PDF to Markdown guides
Answer-first, practical guides to turning PDFs into clean, LLM-ready Markdown – pick the one that matches your job.
Choose a guide
Each guide answers the question first, then walks through the steps and the edge cases.
Scanned PDF to Markdown (OCR)
Turn image-only and scanned PDFs into selectable Markdown with built-in OCR across many languages. Real tables, formulas kept.
PDF to Markdown for RAG
Prepare PDFs as clean, chunk-friendly Markdown for RAG and LLM ingestion, with a copyable REST + hosted-MCP recipe.
Extract tables from PDF to Markdown
Get real, aligned Markdown tables out of a PDF, keep formulas, and OCR scanned tables. Not screenshots.
Convert PDF to Markdown in Python
A step-by-step Python tutorial for the REST API: create a job, poll, and download clean Markdown, with error handling.
PDF to Markdown for Obsidian & Notion
Convert a PDF to clean Markdown and import it into an Obsidian vault or Notion, with headings, tables and links intact.
Building an integration or agent?
Beyond the guides, the converter is a REST API and a hosted MCP endpoint, with machine-readable discovery so scripts and agents can find and use it.