docling-project/docling
View on GitHubMarkdown serialization of headings in rich table cells
Open
#2722 opened on Dec 4, 2025
buggood first issuemarkdown
Description
Bug
In docling-core, the MarkdownTableSerializer transforms DoclingDocument headings and titles into markdown headings (i.e., text with preceding # symbols). According to the markdown specs, you can’t use headings, blockquotes, lists, horizontal rules, images, or most HTML tags when formatting text within tables and most applications will not render those headings properly.
Steps to reproduce
Convert Docling's test file table_with_heading_02.html to markdown or check its ground truth file table_with_heading_02.html.md
Docling version
Docling version: 2.64.0 Docling Core version: 2.51.1 Docling IBM Models version: 3.10.2 Docling Parse version: 4.7.1 Python: cpython-313 (3.13.5) Platform: macOS-14.7.1-arm64-arm-64bit-Mach-O
Python version
Python 3.13.5