apache/seatunnel

PR-I: Add dedicated knowledge sources

Open

#10,921 opened on May 19, 2026

View on GitHub
 (0 comments) (0 reactions) (0 assignees)Java (6,897 stars) (1,432 forks)batch import
help wanted

Description

Scope

Add dedicated knowledge sources after the unified document contract is stable.

Candidate sources include Confluence, Google Drive, and SharePoint.

Deliverables

  • Add Confluence source proposal.
  • Add Google Drive source proposal.
  • Add SharePoint source proposal.
  • Ensure each source follows the unified document contract.
  • Prefer source-native version as DocumentHash priority 1 where available.

Acceptance Criteria

  • Dedicated sources emit or project the standard Knowledge Sync metadata.
  • Connector-specific logic stays inside connector modules.
  • Source-native versioning is used when available.
  • Sources integrate with the existing identity, parse, chunk, embedding, and lifecycle sink path.

Contributor guide