Blog

The PDF to Markdown blog

Technical deep dives, product notes and honest write-ups on the messy, interesting parts of turning real PDFs into clean, LLM-ready Markdown.

Latest posts

Writing

Long-form, practical posts on PDF parsing, OCR, document structure and shipping the tool.

Secure file upload in Go: 7 attacks and how we mitigated them

An engineering case study: type spoofing, disk DoS, path traversal, SSRF, replay and device spoofing – and the Go-backend defenses for each.

June 29, 2026 · ~13 min read

Read the post

What actually breaks when you turn PDFs into Markdown

A practical breakdown of the failure modes – tables, reading order, OCR, formulas, long-running jobs – and the trade-offs a converter has to make.

June 29, 2026 · ~7 min read

Read the post

Looking for how-tos?

The blog is for deep dives and opinion. For task-focused, answer-first walkthroughs – OCR, tables, RAG, per-LLM recipes – see the guides.