Guides

PDF to Markdown guides

Answer-first, practical guides to turning PDFs into clean, LLM-ready Markdown – pick the one that matches your job.

All guides

Choose a guide

Each guide answers the question first, then walks through the steps and the edge cases.

Scanned PDF to Markdown (OCR)

Turn image-only and scanned PDFs into selectable Markdown with built-in OCR across many languages. Real tables, formulas kept.

Read the guide

PDF to Markdown for RAG

Prepare PDFs as clean, chunk-friendly Markdown for RAG and LLM ingestion, with a copyable REST + hosted-MCP recipe.

Read the guide

Extract tables from PDF to Markdown

Get real, aligned Markdown tables out of a PDF, keep formulas, and OCR scanned tables. Not screenshots.

Read the guide

Convert PDF to Markdown in Python

A step-by-step Python tutorial for the REST API: create a job, poll, and download clean Markdown, with error handling.

Read the guide

PDF to Markdown for Obsidian & Notion

Convert a PDF to clean Markdown and import it into an Obsidian vault or Notion, with headings, tables and links intact.

Read the guide

Building an integration or agent?

Beyond the guides, the converter is a REST API and a hosted MCP endpoint, with machine-readable discovery so scripts and agents can find and use it.