Support parse pdf to structured data (Parser + Normalization). · apache/seatunnel#9716 | Good First Issue

(2 comments) (0 reactions) (1 assignee)Java (1,432 forks)batch import

help wanted

Repository metrics

Stars: (6,897 stars)
PR merge metrics: (Avg merge 13d 21h) (143 merged PRs in 30d)

Description

This issue does not include a description.

Contributor guide

Research direction: Investigate existing Java PDF parsing libraries (e.g., Apache PDFBox) and understand the SeaTunnel connector architecture to implement a new connector for PDF data source.
Tech stack: java
Domain: backend
Issue type: Feature
Difficulty: 3
Estimated time: 1-2 days
Activity status: Active
Clarity: Clear
Prerequisites: JavaMaven
Newbie friendliness: 65