Back to Converter

PDF Header Detection

Intelligently detects H1–H6 headings from font sizes, bold text, and table of contents, preserving your document's full structure as clean Markdown.

Try Header Detection

How It Works

1

Analyze text metrics

Font size, weight, and position of every text element in the PDF is measured and recorded.

2

Classify heading levels

Text elements are ranked by relative size and style to assign H1 through H6 levels.

3

Output Markdown headings

Headings are written as # through ###### with correct nesting in the final Markdown.

What Makes It Useful

Font-size analysis

Compares font metrics across the document to identify the heading hierarchy.

TOC parsing

Uses a table of contents, when present, to confirm and refine heading levels.

Bold text detection

Identifies section titles that rely on bold styling rather than font size alone.

Hierarchy preservation

Maintains the exact H1 → H2 → H3 nesting structure from your original document.

Perfect For

Academic papers

Section headings, subsections, and sub-subsections are detected cleanly.

Technical docs

Chapter and section structure is preserved automatically. No manual cleanup needed.

Books & reports

Full heading hierarchy from title to sub-section comes through intact.

See Your Headings Detected Automatically

Upload a PDF and watch heading levels get identified and preserved as clean Markdown.

Start Converting