The web data API with
no exit cost
You keep your pipeline, your data, and your budget when you leave.
The engine is AGPL-3.0, the benchmark is public, and the API is identical self-hosted or in our Cloud.
{
"success": true,
"data": {
"url": "https://stripe.com/docs/api",
"markdown": "# Stripe API Reference\n\nThe Stripe API is organized around REST...",
"metadata": {
"title": "Stripe API Reference",
"ogTitle": "Stripe API Reference",
"language": "en"
},
"links": [
"https://stripe.com/docs/api/authentication",
"https://stripe.com/docs/api/charges",
"https://stripe.com/docs/api/customers"
]
}
}Integrations
Works with your stack
Search, scrape, crawl, and extract — one API
Search
Search the web and get full content from results.
Scrape
Get LLM-ready data from websites. Markdown, JSON, raw or cleaned HTML.
Browse
Interactive browser sessions over CDP. Navigate, click, fill — perfect for agents.
Crawl
Crawl entire sites. All pages structured and returned as clean data.
Map
Discover all URLs on a domain. Fast sitemap generation.
1from crw import CRW23crw = CRW(api_key="your-api-key")45# Scrape any URL to clean markdown6result = crw.scrape("https://example.com")7print(result["markdown"])89# Search the web with full page content10results = crw.search("best rust web scrapers")11for r in results:12 print(r["title"], r["url"])Scrape
Any URL to clean markdown, JSON, or HTML. Full JS rendering without Puppeteer overhead.
Search
Web search with full page content. Multi-engine aggregation with structured results.
Browse
Interactive browser sessions over CDP — exposed via the crw-browse MCP server, not a REST endpoint. Navigate, click, fill for agents that need state.
Crawl
Entire sites, respect robots.txt. All pages structured and returned as clean data.
Map
Discover all URLs on a domain. Fast sitemap generation without full page loads.
Works where your
agents already live
Skill. Give your agent easy access to real-time web data.
MCP. Connect any MCP-compatible client to the web in seconds.
Agent Onboarding. Are you an AI agent? Fetch this skill to sign up, get an API key, and start building.
View the skillMCP Compatible
Python SDK
crw on PyPI. Scrape, crawl, map with zero-config subprocess or cloud mode.
npm / npx
crw-mcp on npm. Cross-platform binary distribution for MCP and CLI.
LangChain
langchain-crw on PyPI. Document loader for RAG chains.
CrewAI
crewai-crw on PyPI. Full tool suite for CrewAI agents.
n8n
n8n-nodes-crw on npm. Visual workflow automation node.
Dify
crw-dify-plugin. Scrape, crawl, and map tools for Dify workflows.
Local-first by design.
Open source transparency
Industry-leading reliability. Handles JS-heavy pages, anti-bot protections, and dynamic content. No proxies, no puppets, just clean data.
See benchmarksPublic benchmark. Methodology, dataset, and run scripts are open. Reproduce the full latency distribution yourself with one command.
See the benchmark & methodology- Local-first — Self-host next to your app — requests never leave your network.
- No exit cost — Identical API self-hosted or in our Cloud. Switch either way.
- AGPL-3.0 — Audit every line of the engine. No vendor lock-in.
A labeled URL set, run against production APIs.
- Full latency distribution — median, P95, and errors.
- Same dataset, different run conditions, disclosed.
- One command reproduces every number yourself.
A labeled query set, run against production APIs.
- Methodology and run scripts are public.
- No cherry-picked headline — read the whole distribution.
- Reproduce it on your own infrastructure.
chore: regenerate static doc pages [skip ci]
f713163 · May 29, 2026 · github-actions[bot]
fix(docs): add 2 missing Firecrawl-shape shims caught by sapient
22b3d54 · May 29, 2026 · us
chore: regenerate static doc pages [skip ci]
96be4cf · May 28, 2026 · github-actions[bot]
Use well-known tools
Already fully integrated with the greatest existing tools and workflows.
See all integrationsCode you can trust
Developed transparently and collaboratively. Join our community of contributors.
Check out our repoShip faster without building infrastructure
Media parsing. CRW can parse and output content from web hosted PDFs, DOCX, and more.
Smart wait. CRW intelligently waits for content to load, making scraping faster and more reliable.
Cached, when you need it. Selective caching — you choose your caching patterns, growing web index.
Enhanced mode. Reaches every corner of the web with comprehensive coverage and high reliability.
Actions. Click, scroll, write, wait, press and more before extracting content.
Anti-detection. Rotates headers, handles CAPTCHAs, and mimics real browsers. No proxy setup needed.
Predictable pricing
Free
500 credits
A lightweight way to try scraping.
Get Started- 500 credits/month
- 2 concurrent requests
- Community support
Hobby
5,000 credits
Great for side projects and small tools.
Launch pricing — ends June 1
Get Started- 5,000 credits/month
- 5 concurrent requests
- Basic support
- +1,000 for $9
Standard
100,000 credits
Perfect for scaling with less effort.
Launch pricing — ends June 1
Get Started- 100,000 credits/month
- 50 concurrent requests
- Standard support
- +35,000 for $47
Growth
500,000 credits
Built for high volume and speed.
Launch pricing — ends June 1
Get Started- 500,000 credits/month
- 100 concurrent requests
- Priority support
- +175,000 for $177
Scale
1,000,000 credits
For teams scaling their data pipelines.
Launch pricing — ends June 1
Get Started- 1,000,000 credits/month
- 150 concurrent requests
- Priority support
- +500,000 for $299
Concurrency figures show each plan's provisioned target. During the current capacity rollout (as of 2026-05-17), sustained concurrent throughput may be lower than the figure shown while infrastructure scaling completes; requests over live capacity receive a 503 with a Retry-After header. See Rate Limits for retry guidance.
Built for teams shipping
production AI
AI Chat & RAG
Feed clean web data to LLMs. Build RAG pipelines with structured markdown output.
Learn moreLead Enrichment
Scrape company pages, LinkedIn profiles, and directories to enrich your CRM data.
Learn moreMarket Research
Monitor competitors, track pricing changes, and analyze market trends at scale.
Learn moreAI Agents
Give your AI agents web access via MCP. Let them search, scrape, and interact autonomously.
Learn moreContent Aggregation
Crawl news sites, blogs, and forums. Aggregate content for analysis or republishing.
Learn moreDeep Research
Systematic web research with full-page extraction. Build knowledge bases from the open web.
Learn moreStop building scrapers.
Start shipping products.
Your agent needs fresh answers, not stale embeddings
Search + scrape live web results in one API call. Your agent always works with current information, not last month's crawl.
Stop assembling browser fleets to get clean data
One self-contained binary. No Docker, no Puppeteer, no proxy rotation. Just deploy and call the API.
Drop into your agent as an MCP tool
Expose scrape, crawl, map, and search as MCP tools. Claude Code, Cursor, and custom agents get web access in one config.
Embed directly in your stack — no sidecar needed
Self-host the binary alongside your app. Same machine, same network. No external calls when latency matters.
Ship RAG that survives real users
Clean markdown from JS-heavy pages, SPAs, and anti-bot sites. Your pipeline gets structured content, not broken HTML.
Firecrawl-compatible API — switch your base URL
Same endpoint shapes, same SDK patterns. Migrate without rewriting a single line of integration code.
Know where every answer came from
Every response includes source URLs and metadata. Your users can verify, your team can debug, trust stays intact.
Your agent needs fresh answers, not stale embeddings
Search + scrape live web results in one API call. Your agent always works with current information, not last month's crawl.
Stop assembling browser fleets to get clean data
One self-contained binary. No Docker, no Puppeteer, no proxy rotation. Just deploy and call the API.
Drop into your agent as an MCP tool
Expose scrape, crawl, map, and search as MCP tools. Claude Code, Cursor, and custom agents get web access in one config.
Embed directly in your stack — no sidecar needed
Self-host the binary alongside your app. Same machine, same network. No external calls when latency matters.
Ship RAG that survives real users
Clean markdown from JS-heavy pages, SPAs, and anti-bot sites. Your pipeline gets structured content, not broken HTML.
Firecrawl-compatible API — switch your base URL
Same endpoint shapes, same SDK patterns. Migrate without rewriting a single line of integration code.
Know where every answer came from
Every response includes source URLs and metadata. Your users can verify, your team can debug, trust stays intact.
Your agent needs fresh answers, not stale embeddings
Search + scrape live web results in one API call. Your agent always works with current information, not last month's crawl.
Stop assembling browser fleets to get clean data
One self-contained binary. No Docker, no Puppeteer, no proxy rotation. Just deploy and call the API.
Drop into your agent as an MCP tool
Expose scrape, crawl, map, and search as MCP tools. Claude Code, Cursor, and custom agents get web access in one config.
Embed directly in your stack — no sidecar needed
Self-host the binary alongside your app. Same machine, same network. No external calls when latency matters.
Ship RAG that survives real users
Clean markdown from JS-heavy pages, SPAs, and anti-bot sites. Your pipeline gets structured content, not broken HTML.
Firecrawl-compatible API — switch your base URL
Same endpoint shapes, same SDK patterns. Migrate without rewriting a single line of integration code.
Know where every answer came from
Every response includes source URLs and metadata. Your users can verify, your team can debug, trust stays intact.
Lower-latency, local-first scraping
Self-host next to your app so requests never leave your network. Full latency distribution and one-command repro live on /benchmarks.
Public benchmark, not marketing math
Methodology, dataset, and run scripts are open. Reproduce the numbers yourself on /benchmarks.
Open source, self-host with zero license cost
AGPL-3.0. Run on your infra with unlimited requests. Audit every line. No vendor lock-in.
Rust core — no GC pauses, no runtime bloat
Memory-safe engine built for predictable latency. Your agents get consistent response times, not GC spikes.
5 endpoints cover the whole workflow
Scrape, crawl, map, search, and extract structured data. One tool instead of five different services.
Works with Python, Node.js, LangChain, n8n, and more
Official SDKs and native integrations. Add web data to your existing stack in minutes, not days.
Start free. Pay only when you scale.
500 free credits, no card required. Standard plan: $69/mo for 100K credits. Predictable, no surprise bills.
Lower-latency, local-first scraping
Self-host next to your app so requests never leave your network. Full latency distribution and one-command repro live on /benchmarks.
Public benchmark, not marketing math
Methodology, dataset, and run scripts are open. Reproduce the numbers yourself on /benchmarks.
Open source, self-host with zero license cost
AGPL-3.0. Run on your infra with unlimited requests. Audit every line. No vendor lock-in.
Rust core — no GC pauses, no runtime bloat
Memory-safe engine built for predictable latency. Your agents get consistent response times, not GC spikes.
5 endpoints cover the whole workflow
Scrape, crawl, map, search, and extract structured data. One tool instead of five different services.
Works with Python, Node.js, LangChain, n8n, and more
Official SDKs and native integrations. Add web data to your existing stack in minutes, not days.
Start free. Pay only when you scale.
500 free credits, no card required. Standard plan: $69/mo for 100K credits. Predictable, no surprise bills.
Lower-latency, local-first scraping
Self-host next to your app so requests never leave your network. Full latency distribution and one-command repro live on /benchmarks.
Public benchmark, not marketing math
Methodology, dataset, and run scripts are open. Reproduce the numbers yourself on /benchmarks.
Open source, self-host with zero license cost
AGPL-3.0. Run on your infra with unlimited requests. Audit every line. No vendor lock-in.
Rust core — no GC pauses, no runtime bloat
Memory-safe engine built for predictable latency. Your agents get consistent response times, not GC spikes.
5 endpoints cover the whole workflow
Scrape, crawl, map, search, and extract structured data. One tool instead of five different services.
Works with Python, Node.js, LangChain, n8n, and more
Official SDKs and native integrations. Add web data to your existing stack in minutes, not days.
Start free. Pay only when you scale.
500 free credits, no card required. Standard plan: $69/mo for 100K credits. Predictable, no surprise bills.
Frequently
asked questions
Ready to ship?
Give your agent reliable web access in minutes. 500 free credits, no credit card required.
Are you an AI agent? Get an API key here