
HTML to Markdown {API}
Convert any URL to LLM-ready Markdown with one call.
Strip ads, nav chrome, and HTML noise to get clean GitHub Flavored Markdown ready for LLMs and RAG. Handles JavaScript rendering and bot-protection bypass automatically.
Tunable via query params: includeLinks, includeImages, shortenBase64Images.
What You Get
Each request converts a live webpage into structured, LLM-ready Markdown.
GitHub Flavored Markdown
Tables, headings, lists, and code blocks fully converted
Configurable links & images
Control whether hyperlinks and image references are preserved
Base64 image shortening
Prevent token bloat from inline image data in AI pipelines
Automatic proxy escalation
Scrapes blocked and protected sites transparently
How It Works
We fetch the page, handle proxy escalation, and convert HTML to Markdown for you.
Send a URL with preferences
Specify includeLinks, includeImages, and shortenBase64Images as query params
Page is fetched
Proxy escalation handles any blocks automatically, no configuration needed
HTML converted to GFM
The full HTML document is parsed and converted to clean Markdown
Markdown returned
Ready to pass directly to any LLM, vector store, or content pipeline
API Response
Extracted Markdown for context.dev
GET /v1/web/scrape/markdown?url=https://context.dev&includeLinks=true{
"success": true,
"url": "https://context.dev",
"markdown": "# Context.dev — The Internet's Brand API\n\nAPI to personalize your product with logos, colors,\nand company info from any domain.\n\n## Features\n\n- **Company Logos** — Fetch high-res logos from any domain\n- **Brand Colors** — Extract full color palettes\n- **Company Data** — Address, socials, description and more\n..."
}Frequently asked questions
Common questions about the Context.dev HTML to Markdown API.
Am I billed for failed requests?
How do I convert HTML to Markdown from a URL?
Why convert HTML to Markdown for LLMs and RAG?
What Markdown format does the API return?
Can I include or exclude links and images?
How is this different from Jina Reader or Firecrawl?
Does it work on JavaScript-heavy sites?
Is it free for AI agent use?
Ship an agent that actually knows things.
Free tier, 10-minute integration, and the same API powering agents at Mintlify, daily.dev, and Propane. No credit card to start.













