Comparison

PDF to Markdown vs Marker

Marker is an excellent open-source library you self-host. pdf2md.dev is a hosted service that converts a PDF in seconds with no install and no GPU – in the browser, by REST API, or from a hosted MCP. Here is an honest side-by-side so you can pick the right one.

Short answer

Self-host Marker, or let pdf2md.dev host it for you

Choose Marker when you want to run the conversion on your own machines: it is open-source, reaches near-perfect accuracy on messy documents with its optional --use_llm flag, and reads many input formats (PDF, DOCX, PPTX, XLSX, EPUB). It is built for a GPU and you operate the stack. Choose pdf2md.dev when you want the result now with nothing to set up: free anonymous conversion in the browser, built-in OCR, real Markdown tables and formulas, and a REST API plus a hosted MCP for agents – no Python, no GPU, no account.

Side by side

pdf2md.dev vs Marker, feature by feature

Both produce clean Markdown with tables, formulas and OCR. The real difference is hosted-for-you versus self-hosted.

  pdf2md.dev Marker
Setup Nothing to install – browser, REST API or hosted MCP pip install marker-pdf + a Python environment
Hardware None – we host it; no GPU Runs on CPU/MPS, but built for a GPU (fast in batch on an H100)
Cost Free anonymous tier; paid tiers raise limits Free to self-host under the licence; you pay GPU + ops. Datalab API is paid
Account None to convert None to self-host; an API key for the Datalab hosted API
Scanned PDF OCR Built in, many languages, no flags Yes, via Surya OCR when self-hosted
Tables & formulas Yes – real Markdown tables, math kept Yes – tables, equations and inline math
Input formats PDF and images, by upload or URL PDF, image, DOCX, PPTX, XLSX, HTML, EPUB
Highest-accuracy mode Pick the engine – MinerU (dense) or Docling (clean, fast) Optional --use_llm for near-perfect output on messy pages
Automation REST API + hosted MCP for agents Python library; optional paid Datalab API
Licensing Hosted SaaS; your files are not used to train models Code GPL; weights Open Rail-M (free under $2M rev/funding, non-compete)

Marker details from its public project documentation; pdf2md.dev limits are the current free-tier values. Both projects evolve – check each source for the latest.

More options? See the full roundup of the best PDF to Markdown converters for the whole field at a glance.

Be fair

When Marker is the better choice

Marker is a genuinely strong tool. Reach for it when these fit.

Fully self-hosted

You need conversion to run entirely on your own machines, offline or on-prem, with no third-party service in the path.

Many input formats

You convert DOCX, PPTX, XLSX, HTML or EPUB as well as PDF, and want one tool for all of them.

LLM-boosted accuracy

You want near-perfect output on messy layouts via the --use_llm flag and have the GPU and MLOps to run it.

Where we win

When pdf2md.dev fits better

The hosted route removes the setup, the GPU and the account.

Result in seconds

Drop a PDF and get clean Markdown with nothing to install – no Python, no GPU, no account.

API + hosted MCP

Call the same conversion from your code or an agent pipeline without managing any infrastructure.

OCR & tables built in

Scanned PDFs, tables and formulas are handled out of the box, no flags to tune.

Occasional or free use

Convert anonymously in the browser on the free tier instead of standing up a GPU box for a few files.

Engine choice

Switch between MinerU and Docling per document instead of tuning one pipeline yourself.

No training on your files

Files are converted and dropped on a short retention window; they are not used to train models.

Converting at scale or from an agent?

pdf2md.dev is a REST API and a hosted MCP endpoint as well as a web app. Convert a PDF and parse clean Markdown from your own code or an LLM agent – no GPU box to run. See the Python tutorial on the developer hub.

FAQ

Common questions

Is pdf2md.dev a drop-in replacement for Marker?

They are different shapes. Marker is a self-hosted Python library you run yourself; pdf2md.dev is a hosted service you reach from the browser, a REST API or a hosted MCP with nothing to install. Pick by whether you want to host the conversion yourself or have it hosted for you.

Is Marker free?

Marker's code is open-source under GPL and its model weights use a modified Open Rail-M licence that is free for research, personal use and organisations under $2M in funding or revenue that do not compete with the Datalab API. You still pay for the GPU, compute and operations to run it. Datalab also offers a paid hosted API.

Does pdf2md.dev need a GPU?

No. We host the conversion, so you need nothing locally: no GPU, no Python and no install. Marker runs on CPU but is built for a GPU and is much faster on one.

Which is more accurate?

Both are strong. Marker can reach near-perfect output on messy layouts with its optional --use_llm flag. pdf2md.dev lets you pick the engine (MinerU for dense, complex pages; Docling for clean, fast ones) and handles OCR, tables and formulas with no flags to tune.

Can I call either from code or an agent?

pdf2md.dev exposes a REST API and a hosted MCP endpoint, so an agent can convert a PDF without any setup. Marker is a Python library you import in your own code, with an optional paid Datalab API for a hosted endpoint.

What about scanned PDFs?

Both OCR scanned and image-only PDFs. Marker uses Surya OCR when you self-host. On pdf2md.dev OCR is built in across many languages with no flags to set – see converting scanned PDFs.

Is pdf2md.dev really free with no sign-up?

Yes. Convert anonymously in the browser on the free tier (3 slots, 10 MB files, a 15-minute time budget, 1-hour retention). Paid tiers raise every limit.