Is Jina Reader free for URL-to-markdown?

Jina Reader has a generous free allowance and a token-metered paid path beyond it — you pay for the tokens read and returned. For occasional single-page reads the free tier often covers you entirely. Because cost scales with content size rather than a flat per-page unit, it gets harder to forecast as volume grows.

Can Jina Reader crawl a whole website?

No. Jina Reader is single-URL by design — there is no async job model, no breadth-first crawl, and no sitemap discovery. To turn a whole site into markdown you discover every URL yourself and call the reader once per page. For whole-site passes, Firecrawl and fastCRW both offer a real crawl/job model; fastCRW exposes /v1/map for URL discovery and an async /v1/crawl job you poll by ID.

Which tool produces the cleanest markdown for LLMs?

All three return clean markdown; the differentiator is how much real content survives on hard pages. On Firecrawl's public scrape-content dataset, fastCRW recovered the most labeled content of the three engines tested — 63.74% truth-recall of 819 labeled URLs, ahead of Crawl4AI (59.95%) and Firecrawl (56.04%) — per diagnose_3way.py on 2026-05-08. On latency: fastCRW's p50 is 1914 ms (the fastest), and in fast mode its p90 is 4348 ms — the lowest of the three (Crawl4AI 4754 ms, Firecrawl 6937 ms).

Can I self-host a URL-to-markdown API?

Yes — fastCRW ships as a single ~8 MB static Rust binary (one container) under AGPL-3.0, so you can run the full scrape/crawl/map engine on your own server for $0 per 1,000 scrapes. Scraped content and target URLs never leave your infrastructure. Jina Reader is a managed hosted service, so self-hosting the reader itself is not the model there.

Jina vs Firecrawl vs fastCRW: Markdown APIs

By the fastCRW team · Benchmark and feature claims verified 2026-05-18 from the 2026-05-08 benchmark run · fastCRW launch pricing expires 2026-06-01 · Verify independently before buying.

Disclosure: We build fastCRW. This is a vendor-authored comparison, so weight it accordingly — we keep the places Jina and Firecrawl genuinely win explicit, because a markdown comparison that pretends the other two have no edge is useless to you.

Jina vs Firecrawl vs fastCRW: three routes from URL to markdown

If you want clean, LLM-ready markdown from a URL, the jina firecrawl fastcrw markdown question comes down to three different shapes of tool. Jina Reader is a thin, token-metered URL-to-markdown reader you hit by prepending a host. Firecrawl is a broad managed API with real scrape and crawl jobs. fastCRW is a Firecrawl-compatible open-core engine that does scrape, crawl, and map, and that you can also self-host as a single static Rust binary. Same end product — clean markdown — but three very different deployment and cost models underneath.

Jina Reader: prepend a URL, get markdown

Jina Reader (r.jina.ai) is the lightest possible interface: prefix a target URL with the reader host and you get back markdown. There is a generous free path and the ergonomics are excellent for one-off reads and quick agent calls. What it is not is a crawl platform — there is no real job model, no async whole-site BFS, no sitemap discovery. It reads the page you point it at, and that is the entire promise.

Firecrawl: scrape plus crawl jobs

Firecrawl is the category reference for this surface. It returns clean markdown from /v1/scrape and adds an async /v1/crawl job model, mapping, search, and cloud-only specialties. It is the broad managed option: more endpoints, more anti-bot paths on its cloud, and the largest ecosystem of the three.

fastCRW: scrape and crawl with a self-host option

fastCRW implements the Firecrawl-compatible REST surface — /v1/scrape, /v1/crawl, /v1/map, /v1/search — so the official Firecrawl SDK works against it after a base-URL swap. The differentiator is that the same engine ships as a single ~8 MB static Rust binary you can self-host for $0 (you pay only your own server), licensed AGPL-3.0. So you get Jina's clean-markdown promise, plus a crawl/map job model, plus the ability to run the whole thing on your own box.

Output quality and accuracy

Clean markdown is the table-stakes promise all three make. The harder question is accuracy: when a page is messy or JS-rendered, how much of the real content actually survives into the markdown?

Highest truth-recall in a three-way scrape test

On Firecrawl's own public scrape-content dataset, fastCRW recovered the most labeled content of the three engines tested. Truth-recall was 63.74% of 819 labeled URLs, ahead of Crawl4AI at 59.95% and Firecrawl at 56.04% (diagnose_3way.py, single run, 3,000 requests, 2026-05-08). Scrape-success was ~92% of reachable URLs, with 0 thrown errors across all 3,000 requests. For a markdown pipeline that feeds an LLM, recall is the metric that matters most: a page that "succeeds" but loses half its body is worse than a page that arrives whole, and content that never makes it into the markdown is content your model can never reason over.

Handling JS-rendered pages cleanly

The recall edge above is not free — it comes from how fastCRW handles hard pages. The default renderer auto-selects with a chrome → lightpanda → http fallback, and the chrome-stealth path is what recovers content on JS-heavy pages that a plain HTTP fetch (closer to Jina Reader's lightweight model) would miss. That fallback is the mechanism behind the higher recall number.

Latency profile across modes

fastCRW's median latency was 1914 ms in that run — a lower p50 than Firecrawl's 2305 ms. In fast mode, the p90 is 4348 ms — the lowest of the three (Crawl4AI 4754 ms, Firecrawl 6937 ms). The chrome-stealth path that recovers the URLs the others miss is the same mechanism that delivers this tight fast-mode tail. We publish the full distribution rather than a single average; you choose the mode that fits your latency budget.

Beyond a single page: crawl and map

Most real markdown jobs are not one URL. You want a docs site, a knowledge base, or a product catalog turned into markdown — and that is where the three tools diverge most sharply.

Why Jina has no real crawl/job model

Jina Reader is single-URL by design. To turn a whole site into markdown with it you have to discover every URL yourself, then call the reader once per page and manage concurrency, retries, and deduplication on your side. There is no async job ID to poll, no built-in BFS, no sitemap discovery. For one page that simplicity is a feature; for a thousand pages it is your problem to solve.

Whole-site markdown with /v1/crawl and /v1/map

Both Firecrawl and fastCRW give you a real job model. fastCRW's /v1/map discovers every URL on a site, and /v1/crawl starts an async BFS crawl that returns a job ID you poll via GET /v1/crawl/:id (and cancel via DELETE). Crawl accepts maxDepth (cap 10) and maxPages (cap 1000). So a whole-site markdown pass is one map call plus one crawl job — not a hand-rolled loop. See crawling an entire website from its sitemap for the end-to-end pattern.

Cost: token metering vs credits vs $0 self-host

The three pricing models are as different as the three tools, and the right one depends entirely on volume.

Jina's token-metered model

Jina Reader bills on a token-metered basis: you pay for the tokens read and returned, with a free allowance to start. For sporadic single-page reads this is often the cheapest path and the fastCRW pricing may cover you entirely. The model gets harder to forecast as volume grows, because cost scales with content size rather than with a flat per-page unit.

Flat 1-credit-per-page on fastCRW

fastCRW charges a flat 1 credit per scraped page (2 credits when the chrome renderer is used), and crawl is 1 credit per page (2 per chrome-rendered page). A page is a page — the credit cost does not balloon with content length. That makes a markdown bill forecastable: multiply pages by 1 (or 2) and you have your number. Compare the managed tiers on the pricing page rather than trusting a hard-coded table here.

Self-hosting a full-site pass for $0

The model neither Jina nor Firecrawl's cloud can match: run the fastCRW engine yourself. Self-hosted, the AGPL-3.0 engine is $0 per 1,000 scrapes — you pay only for your own server. A whole-site markdown crawl that would meter tokens or burn credits on a managed plan instead runs against a single ~8 MB binary on a small VPS, with the scraped content never leaving your infrastructure. For recurring, large markdown passes (RAG freshness, doc mirrors), that is a different cost class entirely.

Picking a markdown pipeline

Match the tool to the job, not the logo.

When Jina's free tier is perfect

If you need occasional single-page markdown — an agent grabbing one article mid-reasoning, a quick "read this URL" call — Jina Reader is genuinely hard to beat. The prepend-the-URL ergonomics are the lightest of the three, the free tier is generous, and you carry zero infrastructure. Where Jina genuinely wins: minimal-setup, low-volume, single-URL reads where you do not need crawl, map, or output you can self-host.

When you need crawl-scale markdown

The moment you need a whole site, recurring passes, or output you must keep on your own infra, the single-URL reader runs out of road. Firecrawl genuinely wins when you want the broadest managed surface and largest ecosystem with zero ops — its cloud anti-bot paths and feature breadth are real. fastCRW wins when you want the highest measured recall of the three, a flat per-page bill, and the option to self-host the same engine for $0. Because fastCRW is Firecrawl-compatible, you do not have to commit on faith — point the Firecrawl SDK at it, run your own URLs, and let the markdown quality and the bill decide. See the full Firecrawl vs fastCRW comparison for the 1:1 detail, website-to-markdown for the workflow, and LLM-ready markdown extraction for output formatting.

Sources

fastCRW scrape benchmark of record — diagnose_3way.py, Firecrawl public scrape-content dataset, 819 labeled URLs, 2026-05-08 (truth-recall 63.74%, p50 1914 ms, p90 4348 ms in fast mode, scrape-success ~92% of reachable URLs, 0 errors)
Jina Reader: jina.ai/reader (token-metered, single-URL; verified 2026-05-18)
Firecrawl docs/pricing: docs.firecrawl.dev · firecrawl.dev/pricing (verified 2026-05-18)
fastCRW repo and pricing: github.com/us/crw · fastcrw.com/pricing