Integrations/Integration / TypeScript

TypeScript Web Scraping API — fastCRW [Firecrawl-Compatible]

Type-safe web scraping with TypeScript and fastCRW — a Firecrawl-compatible REST API. Use Zod to derive types from JSON schemas, validate extraction output at the boundary, and catch schema drift at the hour it breaks. AGPL-3.0, self-host free.

Published

June 13, 2026

Updated

June 13, 2026

Verdict

fastCRW is Firecrawl-compatible REST — the official @mendable/firecrawl-js SDK works against fastCRW by setting apiUrl. For TypeScript, the integration goes deeper than a base-URL swap: define one Zod schema, convert it to JSON Schema for the /v1/scrape request, and parse the response back through the same schema. You get the TypeScript type, the runtime validation gate, and the API extraction spec all from one source — schema drift throws at the boundary with the exact failing field, not silently three steps downstream.

Who This Is For

TypeScript developers who want typed web data — extracted records that actually match the shape your code expects.
Teams migrating off Firecrawl TS — one apiUrl change, nothing else.
AI pipeline engineers — structured extraction that feeds cleanly into typed service layers.
Full-stack engineers adding web enrichment — a typed fetch wrapper that integrates into existing Express / Fastify / tRPC services.

Setup

1. Install

npm install @mendable/firecrawl-js zod zod-to-json-schema
# or with bun:
bun add @mendable/firecrawl-js zod zod-to-json-schema

TypeScript types are included in @mendable/firecrawl-js. For zod-to-json-schema:

npm install -D @types/zod-to-json-schema   # if needed

2. Get an API key

export FASTCRW_API_KEY="fcrw_..."

The free tier ships 500 one-time lifetime credits. Plain scrape is 1 credit; crawl is 1 credit per page; search is 1 credit per query.

3. Create a singleton client

// src/client.ts
import FirecrawlApp from "@mendable/firecrawl-js";

export const app = new FirecrawlApp({
  apiKey: process.env.FASTCRW_API_KEY ?? "",
  apiUrl: process.env.FASTCRW_API_URL ?? "https://api.fastcrw.com",
});

Quickstart: Scrape a Page to Markdown

import { app } from "./client.js";

const doc = await app.scrapeUrl("https://example.com", {
  formats: ["markdown"],
  onlyMainContent: true,
});

if (!doc.success) {
  throw new Error(`scrape failed: ${doc.error}`);
}

const markdown: string = doc.markdown ?? "";
console.log(`scraped ${markdown.length} chars`);
console.log("title:", doc.metadata?.title);

Type-Safe JSON Extraction with Zod

This is where TypeScript's reliability promise is actually delivered. The pattern: one Zod schema drives the JSON extraction spec, the TypeScript type, and the runtime validation gate.

Step 1: Define the schema once

import { z } from "zod";
import { zodToJsonSchema } from "zod-to-json-schema";

// Single source of truth — never duplicated by hand
const ProductSchema = z.object({
  productName: z.string().min(1),
  priceUsd: z.number().positive(),
  inStock: z.boolean(),
  imageUrl: z.string().url().optional(), // optional field — explicit, not accidental
});

// TypeScript type derived from the schema — never drifts from it
type Product = z.infer<typeof ProductSchema>;

// JSON Schema for the extraction request — generated, not hand-written
const jsonSchema = zodToJsonSchema(ProductSchema, { name: "Product" });

Step 2: Send the extraction request

import { app } from "./client.js";

const res = await app.scrapeUrl("https://example.com/products/widget", {
  formats: ["json"],
  jsonSchema,
});

if (!res.success) {
  throw new Error(`extraction failed: ${res.error}`);
}

Step 3: Validate at the boundary

// .parse() throws ZodError naming the exact failing field — not a silent wrong value
const product: Product = ProductSchema.parse(res.data?.json);

// Use .safeParse() if you want to handle the error without throwing:
const result = ProductSchema.safeParse(res.data?.json);
if (!result.success) {
  console.error("extraction schema mismatch:", result.error.flatten());
  return;
}
const typed: Product = result.data; // fully typed, validated at runtime

When the target site renames a field or restructures the page, parse() rejects it immediately with the exact field that failed — you learn about the break the same hour it happens, not weeks later when downstream data is corrupted.

Cost: formats: ["json"] is billed as 1 scrape credit plus the managed-LLM token cost for that request, settled from real usage, versus 1 credit for plain markdown. LLM extraction runs on fastCRW's managed LLM. /v1/extract also accepts a urls array (up to 50 URLs per request) when you need the same schema across several pages.

Typed Batch Scraping with a Concurrency Pool

import { app } from "./client.js";

interface ScrapeResult {
  url: string;
  ok: boolean;
  chars: number;
  error?: string;
}

// Dependency-free bounded concurrency pool
async function pool<T, R>(
  items: T[],
  limit: number,
  worker: (item: T) => Promise<R>,
): Promise<R[]> {
  const results: R[] = new Array(items.length);
  let next = 0;

  async function run(): Promise<void> {
    while (next < items.length) {
      const i = next++;
      results[i] = await worker(items[i]);
    }
  }

  await Promise.all(Array.from({ length: Math.min(limit, items.length) }, run));
  return results;
}

const urls = [
  "https://docs.fastcrw.com",
  "https://fastcrw.com/pricing",
  "https://fastcrw.com/alternatives/firecrawl",
];

const out: ScrapeResult[] = await pool(
  urls,
  4,
  async (url): Promise<ScrapeResult> => {
    try {
      const d = await app.scrapeUrl(url, { formats: ["markdown"] });
      return { url, ok: d.success, chars: d.markdown?.length ?? 0 };
    } catch (e) {
      return { url, ok: false, chars: 0, error: String(e) };
    }
  },
);

console.table(out);

Latency note: fastCRW's p50 was 1914 ms on the 2026-05-08 benchmark (819 labeled URLs, diagnose_3way.py) — the fastest of the three tools tested. In fast mode, fastCRW's p90 is 4348 ms — the lowest of the three (Crawl4AI 4754 ms, Firecrawl 6937 ms). That same chrome-stealth fallback is what gives fastCRW the highest truth-recall of the three tools (63.74%). Size your pool limit and fetch timeouts from the p90, not the median. Full breakdown at /benchmarks/firecrawl-dataset.

Typed Crawl

import { app } from "./client.js";

const job = await app.crawlUrl("https://docs.fastcrw.com", {
  limit: 25, // cap: 1000
  maxDepth: 2, // cap: 10
  scrapeOptions: { formats: ["markdown"], onlyMainContent: true },
});

if (!job.success) throw new Error(job.error);

// job.data is typed as FirecrawlDocument[]
const pages = job.data.map((page) => ({
  url: page.metadata?.sourceURL ?? "",
  words: (page.markdown ?? "").split(/\s+/).length,
}));

console.table(pages);

Typed Web Search

import { app } from "./client.js";

const res = await app.search("typescript web scraping api 2026", { limit: 5 });
if (!res.success) throw new Error(res.error);

// res.data is typed as SearchResult[]
for (const r of res.data) {
  console.log(`${r.title} → ${r.url}`);
}

Using plain fetch with a typed wrapper

If you prefer to avoid the SDK:

const FASTCRW_BASE = process.env.FASTCRW_API_URL ?? "https://api.fastcrw.com";
const FASTCRW_KEY = process.env.FASTCRW_API_KEY ?? "";

interface ScrapeApiResponse<T = unknown> {
  success: boolean;
  data?: {
    markdown?: string;
    json?: T;
    metadata?: { title?: string; sourceURL?: string; statusCode?: number };
  };
  error?: string;
}

async function scrape<T = string>(
  url: string,
  options: { formats: ("markdown" | "json")[]; jsonSchema?: object } = {
    formats: ["markdown"],
  },
): Promise<ScrapeApiResponse<T>> {
  const res = await fetch(`${FASTCRW_BASE}/v1/scrape`, {
    method: "POST",
    headers: {
      Authorization: `Bearer ${FASTCRW_KEY}`,
      "Content-Type": "application/json",
    },
    body: JSON.stringify({ url, onlyMainContent: true, ...options }),
    signal: AbortSignal.timeout(25_000), // set above the p90 to avoid premature kills
  });

  if (!res.ok) throw new Error(`HTTP ${res.status}`);
  return res.json() as Promise<ScrapeApiResponse<T>>;
}

MCP Setup

fastCRW ships an MCP server (crw-mcp on npm) for AI agents in Claude Code, Cursor, or Windsurf. Adds scrape, crawl, map, and search as agent-callable tools:

{
  "mcpServers": {
    "fastcrw": {
      "command": "npx",
      "args": ["-y", "crw-mcp@latest"],
      "env": {
        "FASTCRW_API_KEY": "fcrw_...",
        "FASTCRW_API_URL": "https://api.fastcrw.com"
      }
    }
  }
}

See /integrations/mcp for full configuration options.

Good to Know

LLM extraction runs on fastCRW's managed LLM — if your stack standardizes on another model, extraction is the exception.
Stateless per request — no session is carried across calls; multi-step authenticated flows must be reconstructed per request.
Screenshots — formats: ["screenshot"] is served by /v2/scrape.
/v1/extract accepts a urls array (up to 50 URLs per request) and returns a results array — for larger sites, iterate /v1/scrape concurrently (pool helper above), use /v1/crawl, or call /v2/batch/scrape directly.

fastCRWlive

Scrape any URL, live

Get 500 free credits →

Sources

fastCRW scrape docs

/docs/scrape

fastCRW 3-way scrape benchmark

/benchmarks/firecrawl-dataset

FAQ

How do I make web scraping type-safe in TypeScript?

Declare the record you want as a Zod schema, validate scraped data against it at runtime, and derive the TypeScript type from that same schema with z.infer. A type annotation alone is a compile-time promise the runtime can break — scraped data arrives at runtime, so you need a runtime gate (Zod .parse()) at the boundary. One schema then governs the API contract, the TypeScript type, and the runtime check — they cannot drift apart.

Can a single Zod schema drive both extraction and TypeScript types?

Yes — this is the strongest reliability pattern. Define one Zod schema, convert it to JSON Schema (zod-to-json-schema) and send it as jsonSchema in a /v1/scrape request with formats: ["json"]. The engine extracts data matching your spec. Parse the response back through the same Zod schema. One definition then governs the API contract, the TypeScript type, and the runtime validation gate.

Does TypeScript help when a website changes its layout?

Types do not prevent schema drift, but runtime validation catches it. When a site redesigns and a field goes missing, Zod .parse() rejects the response at the extraction boundary — you find out the same hour it breaks, not weeks later. Driving extraction from a JSON schema instead of hand-written selectors also shrinks the brittle DOM coupling surface that can break in the first place.

What does JSON extraction cost in fastCRW credits?

Any request using formats: ["json"] with a jsonSchema is billed as 1 scrape credit plus the managed-LLM token cost for that request (settled from real usage), versus 1 credit for plain markdown. LLM extraction runs on the managed LLM. /v1/extract also accepts a urls array (up to 50 URLs per request) when you need the same schema across several pages. Self-hosting under AGPL-3.0 has no per-request charge. See /pricing for managed-cloud tiers.

What is the best TypeScript type guard pattern for scrape responses?

Prefer Zod .parse() over a hand-written type guard — it gives you a ZodError naming the exact failing field rather than a boolean. Use .safeParse() when you want the error without throwing: const result = Product.safeParse(data); if (!result.success) { logZodError(result.error); return; } const product = result.data;. Treat the edge of an LLM extraction call the same as any external API: validate, then trust.

Recommended next step

Run a live scrape before you commit.

Use the hosted demo to test scrape, crawl, or map output with fastCRW semantics.

Try Playground

Continue exploring

More from Integrations

View all integrations

Previous in Integrations

Node.js Web Scraping API — fastCRW [Firecrawl-Compatible]

Next in Integrations

n8n Web Scraping Integration — fastCRW [Firecrawl-Compatible]

Integrations

MCP Web Scraping Integration — fastCRW [Firecrawl-Compatible]

fastCRW ships an official MCP server (crw-mcp) exposing scrape, search, crawl, map, and extract to any MCP-compatible client. Small single static binary, local-first, self-host free under AGPL-3.0.

mcp web scrapingOfficial crw-mcp server, single npx command

Integrations

Google ADK Web Scraping Integration — fastCRW [Firecrawl-Compatible]

Wire fastCRW into Google's Agent Development Kit as a FunctionTool. Firecrawl-compatible scrape and search, small single static binary, local-first, self-host free under AGPL-3.0.

google adk web scrapingFunctionTool wrappers for fastCRW scrape and search

Integrations

OpenAI Agents SDK Web Scraping Integration — fastCRW [Firecrawl-Compatible]

Give OpenAI Agents SDK agents a fastCRW scrape and search tool with the @function_tool decorator. Small single static binary, local-first, Firecrawl-compatible API, self-host free under AGPL-3.0.

openai agents sdk web scrapingNative function_tool wrappers for scrape and search

Related hubs

Keep the crawl path moving

Docs

Drop into endpoint reference once your integration is wired up.

Use Cases

See where this integration shape fits common AI-agent workloads.

Alternatives

Compare fastCRW against other scraping APIs your stack might consider.