Why choose CRW over Firecrawl?

CRW is built in Rust — not Node.js. That means no V8 overhead, no garbage collection pauses, and native async I/O, shipped as one self-contained binary you can run locally. The engine is AGPL-3.0 with a public, one-command-reproducible benchmark, and the API is identical self-hosted or in our Cloud — so there is no exit cost.

CRW's Standard plan is $69/mo for 100,000 credits during launch pricing (normally $99/mo), and one credit pool covers search, scrape, crawl, map, and extract — there is no separate metering per endpoint. Self-hosting the AGPL-3.0 engine is free: $0 per 1,000 scrapes, versus $0.83–5.33 on Firecrawl's hosted tiers.

Is the output quality the same?

CRW supports the same core output formats — Markdown, structured JSON via jsonSchema, plus raw and cleaned HTML — on the overlap surface (/scrape, /crawl, /search, /map). Response field names and error envelopes have minor divergence, and screenshot output is not currently exposed. See the compatibility matrix in docs for the row-level diff.

Can I migrate from Firecrawl?

CRW is Firecrawl-compatible on the overlap surface (/scrape, /crawl, /map, /search). Endpoint structure and request shape match closely; response field names and error envelopes have minor divergence, so most migrations are small adjustments rather than a pure URL swap. The migration guide in our docs walks through it.

Yes. CRW is fully open source (AGPL-3.0). Docker setup takes under 5 minutes. Self-hosting is free — you only pay for the cloud API if you want managed infrastructure.

Does CRW handle JavaScript-rendered pages?

Yes. Built-in browser rendering handles SPAs, React apps, and dynamic content. No Puppeteer or Playwright to install — the browser engine is orchestrated by the CRW binary itself.

Each API call uses credits: /scrape = 1 credit, /crawl = 1 credit per page, /search = 1 credit per query, /browse = 1 credit per session. The free tier gives you a one-time lifetime 500 credits (not a monthly meter), no credit card required.

Is there an MCP server?

Yes. Native Model Context Protocol support. Connect CRW to Claude, Cursor, Windsurf, or any MCP-compatible AI agent. One command to start.

Does CRW support structured JSON extraction with a schema?

Yes. Send formats: ["json"] together with a jsonSchema in your scrape request and CRW returns validated structured data using LLM function calling. Works with Anthropic and OpenAI models — compatible with Firecrawl v2 extraction format.

[ 200 OK ]

[ SCRAPE ]

[ .JSON ]

[ .MD ]

500 Free Credits — Get Started

The web data API with
no exit cost

You keep your pipeline, your data, and your budget when you leave.
The engine is AGPL-3.0, the benchmark is public, and the API is identical self-hosted or in our Cloud.

AGPL-3.0 open engine Public benchmark & methodology Identical API self-hosted or Cloud

[ .JSON ]

{
  "success": true,
  "data": {
    "url": "https://stripe.com/docs/api",
    "markdown": "# Stripe API Reference\n\nThe Stripe API is organized around REST...",
    "metadata": {
      "title": "Stripe API Reference",
      "ogTitle": "Stripe API Reference",
      "language": "en"
    },
    "links": [
      "https://stripe.com/docs/api/authentication",
      "https://stripe.com/docs/api/charges",
      "https://stripe.com/docs/api/customers"
    ]
  }
}

Scraped in 813ms

Integrations
Works with your stack

Python

JavaScript

Rust

Ruby

cURL

LangChain

CrewAI

n8n

Vercel

Dify

Botpress

Python

JavaScript

Rust

Ruby

cURL

LangChain

CrewAI

n8n

Vercel

Dify

Botpress

Python

JavaScript

Rust

Ruby

cURL

LangChain

CrewAI

n8n

Vercel

Dify

Botpress

[ 01 / 08 ] · MAIN FEATURES

Developer First

Search, scrape, crawl, and extract — one API

Five endpoints that cover the full pipeline. No glue code, no separate tools.

Search

Search the web and get full content from results.

Scrape

Get LLM-ready data from websites. Markdown, JSON, raw or cleaned HTML.

Browse

Interactive browser sessions over CDP. Navigate, click, fill — perfect for agents.

Crawl

Crawl entire sites. All pages structured and returned as clean data.

Map

Discover all URLs on a domain. Fast sitemap generation.

1from crw import CRW
2
3crw = CRW(api_key="your-api-key")
4
5# Scrape any URL to clean markdown
6result = crw.scrape("https://example.com")
7print(result["markdown"])
8
9# Search the web with full page content
10results = crw.search("best rust web scrapers")
11for r in results:
12    print(r["title"], r["url"])

[ .MD ]

1# CRW
2
3CRW is a blazing-fast web scraping
4tool built in Rust. Extract clean
5data from any website.
6
7## Features
8
9- Scrape: Markdown from any page
10- Search: Search + scrape the web
11- Map: Discover all site URLs
12- Crawl: Full site extraction

/scrape1 credit

Scrape

Any URL to clean markdown, JSON, or HTML. Full JS rendering without Puppeteer overhead.

/search1 credit / query

Search

Web search with full page content. Multi-engine aggregation with structured results.

browse (MCP)MCP tool

Browse

Interactive browser sessions over CDP — exposed via the crw-browse MCP server, not a REST endpoint. Navigate, click, fill for agents that need state.

/crawl1 credit / page

Crawl

Entire sites, respect robots.txt. All pages structured and returned as clean data.

/map1 credit

Map

Discover all URLs on a domain. Fast sitemap generation without full page loads.

[ 02 / 08 ] · POWER YOUR AGENT

Integrations

Works where your
agents already live

Python, Node.js, cURL, MCP — drop into your existing stack in minutes.

One command

Skill. Give your agent easy access to real-time web data.

1npx crw-mcp

Quick config

MCP. Connect any MCP-compatible client to the web in seconds.

2 "mcpServers": {

3 "crw": { "command": "npx", "args": ["crw-mcp"] }

4 }

For AI agents

Agent Onboarding. Are you an AI agent? Fetch this skill to sign up, get an API key, and start building.

View the skill

MCP Compatible

Claude CodeClaude DesktopCursorWindsurfClineContinue.devOpenAI Codex CLI

Python SDK

crw on PyPI. Scrape, crawl, map with zero-config subprocess or cloud mode.

npm / npx

crw-mcp on npm. Cross-platform binary distribution for MCP and CLI.

LangChain

langchain-crw on PyPI. Document loader for RAG chains.

CrewAI

crewai-crw on PyPI. Full tool suite for CrewAI agents.

n8n

n8n-nodes-crw on npm. Visual workflow automation node.

Dify

crw-dify-plugin. Scrape, crawl, and map tools for Dify workflows.

[ 03 / 08 ] · CORE

Built for Performance

Local-first by design.
Open source transparency

A public, one-command-reproducible benchmark. Every line of code is open.

No proxy headaches

Industry-leading reliability. Handles JS-heavy pages, anti-bot protections, and dynamic content. No proxies, no puppets, just clean data.

See benchmarks

Reproducible, not marketing math

Public benchmark. Methodology, dataset, and run scripts are open. Reproduce the full latency distribution yourself with one command.

See the benchmark & methodology

Local-first — Self-host next to your app — requests never leave your network.
No exit cost — Identical API self-hosted or in our Cloud. Switch either way.
AGPL-3.0 — Audit every line of the engine. No vendor lock-in.

Scrape Benchmark

A labeled URL set, run against production APIs.

Open methodology

Full latency distribution — median, P95, and errors.
Same dataset, different run conditions, disclosed.
One command reproduces every number yourself.

See the benchmark

Search Benchmark

A labeled query set, run against production APIs.

Open methodology

Methodology and run scripts are public.
No cherry-picked headline — read the whole distribution.
Reproduce it on your own infrastructure.

See the benchmark

us/crwPublic

124

chore: regenerate static doc pages [skip ci]

f713163 · May 29, 2026 · github-actions[bot]

fix(docs): add 2 missing Firecrawl-shape shims caught by sapient

22b3d54 · May 29, 2026 · us

chore: regenerate static doc pages [skip ci]

96be4cf · May 28, 2026 · github-actions[bot]

Integrations

Use well-known tools

Already fully integrated with the greatest existing tools and workflows.

See all integrations

Open Source

Code you can trust

Developed transparently and collaboratively. Join our community of contributors.

Check out our repo

[ 04 / 08 ] · FEATURES

Zero configuration

Ship faster without building infrastructure

JS rendering, anti-detection, media parsing, smart caching — handled. You focus on the product.

Docs to data

Media parsing. CRW can parse and output content from web hosted PDFs, DOCX, and more.

Knows the moment

Smart wait. CRW intelligently waits for content to load, making scraping faster and more reliable.

Selective persistence

Cached, when you need it. Selective caching — you choose your caching patterns, growing web index.

Advanced web coverage

Enhanced mode. Reaches every corner of the web with comprehensive coverage and high reliability.

Interactive scraping

Actions. Click, scroll, write, wait, press and more before extracting content.

Built-in stealth

Anti-detection. Rotates headers, handles CAPTCHAs, and mimics real browsers. No proxy setup needed.

[ 05 / 08 ] · PRICING

Transparent

Predictable pricing

Credit-based pricing with no surprises. Start with 500 free credits, scale as you grow.

MonthlyAnnual Save up to 15%

Free

$0/mo

500 credits

A lightweight way to try scraping.

Get Started

500 credits/month
2 concurrent requests
Community support

Hobby

$13$19/mo

5,000 credits

Great for side projects and small tools.

Launch pricing — ends June 1

Get Started

5,000 credits/month
5 concurrent requests
Basic support
+1,000 for $9

Popular

Standard

$69$99/mo

100,000 credits

Perfect for scaling with less effort.

Launch pricing — ends June 1

Get Started

100,000 credits/month
50 concurrent requests
Standard support
+35,000 for $47

Growth

$279$399/mo

500,000 credits

Built for high volume and speed.

Launch pricing — ends June 1

Get Started

500,000 credits/month
100 concurrent requests
Priority support
+175,000 for $177

Scale

$549$749/mo

1,000,000 credits

For teams scaling their data pipelines.

Launch pricing — ends June 1

Get Started

1,000,000 credits/month
150 concurrent requests
Priority support
+500,000 for $299

Concurrency figures show each plan's provisioned target. During the current capacity rollout (as of 2026-05-17), sustained concurrent throughput may be lower than the figure shown while infrastructure scaling completes; requests over live capacity receive a 503 with a Retry-After header. See Rate Limits for retry guidance.

[ 06 / 08 ] · USE CASES

Use Cases

Built for teams shipping
production AI

See how teams use CRW for agents, RAG pipelines, research, and enrichment.

AI Chat & RAG

Feed clean web data to LLMs. Build RAG pipelines with structured markdown output.

Learn more

Lead Enrichment

Scrape company pages, LinkedIn profiles, and directories to enrich your CRM data.

Learn more

Market Research

Monitor competitors, track pricing changes, and analyze market trends at scale.

Learn more

AI Agents

Give your AI agents web access via MCP. Let them search, scrape, and interact autonomously.

Learn more

Content Aggregation

Crawl news sites, blogs, and forums. Aggregate content for analysis or republishing.

Learn more

Deep Research

Systematic web research with full-page extraction. Build knowledge bases from the open web.

Learn more

[ 07 / 08 ] · ENABLES

What CRW enables

Stop building scrapers.
Start shipping products.

Your agent needs live web data. CRW handles the hard part so you can focus on what matters.

Your agent needs fresh answers, not stale embeddings

Search + scrape live web results in one API call. Your agent always works with current information, not last month's crawl.

Stop assembling browser fleets to get clean data

One self-contained binary. No Docker, no Puppeteer, no proxy rotation. Just deploy and call the API.

Drop into your agent as an MCP tool

Expose scrape, crawl, map, and search as MCP tools. Claude Code, Cursor, and custom agents get web access in one config.

Embed directly in your stack — no sidecar needed

Self-host the binary alongside your app. Same machine, same network. No external calls when latency matters.

Ship RAG that survives real users

Clean markdown from JS-heavy pages, SPAs, and anti-bot sites. Your pipeline gets structured content, not broken HTML.

Firecrawl-compatible API — switch your base URL

Same endpoint shapes, same SDK patterns. Migrate without rewriting a single line of integration code.

Know where every answer came from

Every response includes source URLs and metadata. Your users can verify, your team can debug, trust stays intact.

Your agent needs fresh answers, not stale embeddings

Search + scrape live web results in one API call. Your agent always works with current information, not last month's crawl.

Stop assembling browser fleets to get clean data

One self-contained binary. No Docker, no Puppeteer, no proxy rotation. Just deploy and call the API.

Drop into your agent as an MCP tool

Expose scrape, crawl, map, and search as MCP tools. Claude Code, Cursor, and custom agents get web access in one config.

Embed directly in your stack — no sidecar needed

Self-host the binary alongside your app. Same machine, same network. No external calls when latency matters.

Ship RAG that survives real users

Clean markdown from JS-heavy pages, SPAs, and anti-bot sites. Your pipeline gets structured content, not broken HTML.

Firecrawl-compatible API — switch your base URL

Same endpoint shapes, same SDK patterns. Migrate without rewriting a single line of integration code.

Know where every answer came from

Every response includes source URLs and metadata. Your users can verify, your team can debug, trust stays intact.

Your agent needs fresh answers, not stale embeddings

Search + scrape live web results in one API call. Your agent always works with current information, not last month's crawl.

Stop assembling browser fleets to get clean data

One self-contained binary. No Docker, no Puppeteer, no proxy rotation. Just deploy and call the API.

Drop into your agent as an MCP tool

Expose scrape, crawl, map, and search as MCP tools. Claude Code, Cursor, and custom agents get web access in one config.

Embed directly in your stack — no sidecar needed

Self-host the binary alongside your app. Same machine, same network. No external calls when latency matters.

Ship RAG that survives real users

Clean markdown from JS-heavy pages, SPAs, and anti-bot sites. Your pipeline gets structured content, not broken HTML.

Firecrawl-compatible API — switch your base URL

Same endpoint shapes, same SDK patterns. Migrate without rewriting a single line of integration code.

Know where every answer came from

Every response includes source URLs and metadata. Your users can verify, your team can debug, trust stays intact.

Lower-latency, local-first scraping

Self-host next to your app so requests never leave your network. Full latency distribution and one-command repro live on /benchmarks.

Public benchmark, not marketing math

Methodology, dataset, and run scripts are open. Reproduce the numbers yourself on /benchmarks.

Open source, self-host with zero license cost

AGPL-3.0. Run on your infra with unlimited requests. Audit every line. No vendor lock-in.

Rust core — no GC pauses, no runtime bloat

Memory-safe engine built for predictable latency. Your agents get consistent response times, not GC spikes.

5 endpoints cover the whole workflow

Scrape, crawl, map, search, and extract structured data. One tool instead of five different services.

Works with Python, Node.js, LangChain, n8n, and more

Official SDKs and native integrations. Add web data to your existing stack in minutes, not days.

Start free. Pay only when you scale.

500 free credits, no card required. Standard plan: $69/mo for 100K credits. Predictable, no surprise bills.

Lower-latency, local-first scraping

Self-host next to your app so requests never leave your network. Full latency distribution and one-command repro live on /benchmarks.

Public benchmark, not marketing math

Methodology, dataset, and run scripts are open. Reproduce the numbers yourself on /benchmarks.

Open source, self-host with zero license cost

AGPL-3.0. Run on your infra with unlimited requests. Audit every line. No vendor lock-in.

Rust core — no GC pauses, no runtime bloat

Memory-safe engine built for predictable latency. Your agents get consistent response times, not GC spikes.

5 endpoints cover the whole workflow

Scrape, crawl, map, search, and extract structured data. One tool instead of five different services.

Works with Python, Node.js, LangChain, n8n, and more

Official SDKs and native integrations. Add web data to your existing stack in minutes, not days.

Start free. Pay only when you scale.

500 free credits, no card required. Standard plan: $69/mo for 100K credits. Predictable, no surprise bills.

Lower-latency, local-first scraping

Self-host next to your app so requests never leave your network. Full latency distribution and one-command repro live on /benchmarks.

Public benchmark, not marketing math

Methodology, dataset, and run scripts are open. Reproduce the numbers yourself on /benchmarks.

Open source, self-host with zero license cost

AGPL-3.0. Run on your infra with unlimited requests. Audit every line. No vendor lock-in.

Rust core — no GC pauses, no runtime bloat

Memory-safe engine built for predictable latency. Your agents get consistent response times, not GC spikes.

5 endpoints cover the whole workflow

Scrape, crawl, map, search, and extract structured data. One tool instead of five different services.

Works with Python, Node.js, LangChain, n8n, and more

Official SDKs and native integrations. Add web data to your existing stack in minutes, not days.

Start free. Pay only when you scale.

500 free credits, no card required. Standard plan: $69/mo for 100K credits. Predictable, no surprise bills.

[ 08 / 08 ] · FAQ

FAQ

Frequently
asked questions

Everything you need to know about CRW.

Get started

Ready to ship?

Give your agent reliable web access in minutes. 500 free credits, no credit card required.

Get started — 500 free credits See pricing

Are you an AI agent? Get an API key here

The web data API withno exit cost

Search, scrape, crawl, and extract — one API

Search

Scrape

Browse

Crawl

Map

Scrape

Search

Browse

Crawl

Map

Works where your agents already live

Skill. Give your agent easy access to real-time web data.

MCP. Connect any MCP-compatible client to the web in seconds.

Agent Onboarding. Are you an AI agent? Fetch this skill to sign up, get an API key, and start building.

Python SDK

npm / npx

LangChain

CrewAI

n8n

Dify

Local-first by design. Open source transparency

Industry-leading reliability. Handles JS-heavy pages, anti-bot protections, and dynamic content. No proxies, no puppets, just clean data.

Public benchmark. Methodology, dataset, and run scripts are open. Reproduce the full latency distribution yourself with one command.

Use well-known tools

Code you can trust

Ship faster without building infrastructure

Media parsing. CRW can parse and output content from web hosted PDFs, DOCX, and more.

Smart wait. CRW intelligently waits for content to load, making scraping faster and more reliable.

Cached, when you need it. Selective caching — you choose your caching patterns, growing web index.

Enhanced mode. Reaches every corner of the web with comprehensive coverage and high reliability.

Actions. Click, scroll, write, wait, press and more before extracting content.

Anti-detection. Rotates headers, handles CAPTCHAs, and mimics real browsers. No proxy setup needed.

Predictable pricing

Free

Hobby

Standard

Growth

Scale

Built for teams shipping production AI

AI Chat & RAG

Lead Enrichment

Market Research

AI Agents

Content Aggregation

Deep Research

Stop building scrapers.Start shipping products.

Frequently asked questions

Why choose CRW over Firecrawl?

How is CRW cheaper?

Is the output quality the same?

Can I migrate from Firecrawl?

Can I self-host CRW?

Does CRW handle JavaScript-rendered pages?

What are credits?

Is there an MCP server?

Does CRW support structured JSON extraction with a schema?

Ready to ship?

The web data API with
no exit cost

Works where your
agents already live

Local-first by design.
Open source transparency

Built for teams shipping
production AI

Stop building scrapers.
Start shipping products.

Frequently
asked questions