Integrations/Integration / Flowise

Flowise Web Scraping Integration — fastCRW [Firecrawl-Compatible]

Add fastCRW to Flowise workflows with an HTTP node or custom tool definition. No-code web scraping for LangChain flows, RAG pipelines, and AI agents. 6.6 MB RAM runtime, 92% coverage on the 1,000-URL benchmark.

Published

May 12, 2026

Updated

May 12, 2026

Why Flowise + fastCRW

Flowise is the drag-and-drop interface for building LangChain applications without coding. It turns Claude, GPT, and other LLMs into production AI agents by orchestrating prompts, retrievers, tools, and memory. The missing link is web scraping — Flowise can ingest files and knowledge bases, but fetching live web data requires switching to code or calling external services. fastCRW fits inside Flowise through the HTTP Request node or a custom tool definition, enabling you to scrape, search, and extract web data in a visual workflow. The 6.6 MB RAM fastCRW runtime is lightweight and self-hostable, pairing naturally with self-hosted Flowise deployments.

Setup: HTTP Request Node

Flowise ships with a built-in HTTP Request node. Use it to call fastCRW:

Open Flowise and create or open a flow.
Add an HTTP Request node from the toolbox (search for "http").
Set Method to POST.
Set URL to https://fastcrw.com/api/v1/scrape (or /search, /crawl, /map depending on your use case).
Add a Header:
- Key: Authorization
- Value: Bearer fcrw_YOUR_API_KEY (store the key as a Flowise credential or environment variable)
Set Content-Type to application/json.
In the Body field, add JSON:

{
  "url": "{{ url }}",
  "formats": ["markdown"]
}

Connect an upstream node that provides the url variable (e.g., a Trigger, Text Input, or Code node).

Setup: Custom Tool (Advanced)

For more control, define a Custom Tool in your Flowise flow:

Add a Custom Tool node from the toolbox.
Set Tool Name to fastcrawl_scrape.
Set Tool Description to "Scrape a URL using fastCRW and return markdown content."
Add Tool Input Parameters (JSON):

{
  "url": {
    "type": "string",
    "description": "The URL to scrape"
  },
  "format": {
    "type": "string",
    "description": "Output format: markdown, html, or raw",
    "default": "markdown"
  }
}

Set Tool Function to JavaScript that calls fastCRW:

async function fastcrawl_scrape(url, format = "markdown") {
  const response = await fetch("https://fastcrw.com/api/v1/scrape", {
    method: "POST",
    headers: {
      "Authorization": `Bearer ${process.env.FASTCRW_API_KEY}`,
      "Content-Type": "application/json"
    },
    body: JSON.stringify({
      url: url,
      formats: [format]
    })
  });
  const data = await response.json();
  return data.data[format] || data.data.markdown;
}

Connect the Custom Tool to your LLM or downstream nodes.

Code Example: RAG with Flowise

A complete Flowise flow for RAG (Retrieval-Augmented Generation) with fastCRW:

Flow structure:

Trigger (accepts user input: URL to ingest)
HTTP Request (calls fastCRW scrape, fetches markdown)
Text Splitter (chunks the markdown into 512-token pieces)
Embeddings (generates vectors — use OpenAI, Hugging Face, or local)
Vector Store (Pinecone, Weaviate, Supabase, or in-memory)
Output (confirmation message)

In the HTTP Request node body:

{
  "url": "{{ trigger.url }}",
  "formats": ["markdown"],
  "timeout": 30
}

The HTTP node outputs an object with a data property containing markdown. Connect the output to the Text Splitter and set the input field to {{ http_request.data.markdown }}.

For inference (querying the RAG):

User Input (query text)
Retriever (searches the vector store with the query)
Prompt Template (composes: system prompt + retrieved docs + user query)
LLM (Claude, GPT, etc.)
Output (AI response)

This Flowise flow now retrieves grounded answers from scraped web pages.

Code Example: AI Agent with fastCRW

A Flowise agent that can scrape web pages autonomously:

Flow structure:

Trigger (user message)
Memory (conversation history)
Chat Prompt Template (system prompt for the agent)
LLM (Claude, GPT, etc.)
Agent Tools (define fastCRW scrape as a tool)
Agent Loop (LLM decides when to call tools)
Output (agent's final response)

Define fastCRW as a Custom Tool with:

async function fastcrawl_scrape(url) {
  const response = await fetch("https://fastcrw.com/api/v1/scrape", {
    method: "POST",
    headers: {
      "Authorization": `Bearer ${process.env.FASTCRW_API_KEY}`,
      "Content-Type": "application/json"
    },
    body: JSON.stringify({ url, formats: ["markdown"] })
  });
  const result = await response.json();
  return result.data.markdown;
}

Add this tool to the Agent Tools node. Now when the user asks "What is the pricing on example.com?", the LLM can invoke fastCRW autonomously, scrape the page, and summarize the pricing.

Advanced: Async Crawl with Polling

For large crawls that may timeout:

POST to https://fastcrw.com/api/v1/crawl with "async": true:

{
  "url": "https://example.com",
  "async": true,
  "max_depth": 3,
  "max_pages": 50
}

Capture the response job_id.
Use a Loop node to poll https://fastcrw.com/api/v1/crawl/{job_id} until status is completed.
Fetch the final results and continue your flow.

Flowise doesn't have a native polling loop, so implement this with a Code node and setInterval.

When to Use This

No-code RAG pipelines — scrape web pages, index them, and query them in a visual flow.
AI agent data fetching — give Claude or GPT a fastCRW tool so it can browse the web during conversations.
Dynamic knowledge base — schedule a Flowise flow to periodically scrape and re-index a knowledge base.
Lead enrichment — scrape company websites and feed the data into an LLM for enrichment and scoring.
Competitive intelligence — build an agent that scrapes competitor sites and generates reports.

Troubleshooting

"Bearer token missing" Ensure the HTTP Request node has an Authorization header with value Bearer fcrw_YOUR_API_KEY. If using an environment variable, set FASTCRW_API_KEY in your Flowise .env file or hosting platform's environment.

"Request timeout" The default HTTP timeout in Flowise is 30 seconds. For slower pages, increase the timeout in the HTTP node configuration. Alternatively, use async crawl mode with polling.

"404 on Custom Tool" Make sure the Custom Tool function is syntactically correct and process.env.FASTCRW_API_KEY is available in your Flowise runtime. Self-hosted Flowise may need environment variables explicitly set in the startup script.

"Vector store not indexing results" Verify that the Text Splitter node is receiving markdown from the HTTP Request. Check the flow's Variables inspector to see what data is being passed between nodes. The Vector Store node expects chunked text with metadata.

"Agent not calling the scrape tool" The agent calls tools only if its system prompt mentions them and the LLM decides they're relevant. Make sure the Custom Tool is added to the Agent Tools list and the system prompt says the agent has access to a scraping tool.

When to Choose fastCRW

Speed and lightweight runtime: fastCRW is 5.5x faster and 75x lighter on memory than Firecrawl, making it ideal for self-hosted Flowise.
Firecrawl compatibility: Flowise flows built on Firecrawl's HTTP API port to fastCRW by changing one URL.
Self-hosting: fastCRW's single binary runs on a $5 VPS or Docker container alongside self-hosted Flowise.
Cost: Consumption-based pricing means you pay only for what you scrape, not for idle infrastructure.
MCP support: If you also use Claude Code or other MCP clients, fastCRW provides a unified scraping API across tools.

Sources

Flowise HTTP Request node documentation

https://docs.flowiseai.com/integrations/langchain-tools/http-request

Flowise Custom Tool documentation

https://docs.flowiseai.com/integrations/langchain-tools/custom-tool

fastCRW scrape API docs

/docs/scrape

fastCRW search API docs

/docs/search

FAQ

What's the easiest way to add fastCRW to Flowise?

Use the built-in HTTP Request node. No custom code needed. POST to https://fastcrw.com/api/v1/scrape with your API key as a Bearer token.

Can I build a RAG pipeline with Flowise and fastCRW?

Yes. Chain: HTTP Request (fastCRW) → Text Splitter → Vector Store → Retriever. The HTTP node fetches markdown, the text splitter chunks it, and the vector store indexes it for retrieval.

Does fastCRW work with Flowise AI agents?

Yes. Add fastCRW as a Custom Tool with a scrape or search function. The agent's LLM can invoke the tool during reasoning.

What is a Custom Tool in Flowise?

A Custom Tool is a JavaScript function that wraps an HTTP call or external API. You define the tool's name, description, input parameters, and handler function. The LLM can call it like any LangChain tool.

Is fastCRW compatible with self-hosted Flowise?

Yes. Self-hosted Flowise calls fastCRW's cloud API just like Flowise Cloud. You can also self-host fastCRW and point Flowise at your internal endpoint.

How do I pass dynamic URLs to the HTTP node?

Use Flowise variable syntax: `{{ variable_name }}`. For example, in the HTTP body, use `{{ url }}` and Flowise will substitute the actual URL from upstream nodes.

Recommended next step

Run a live scrape before you commit.

Use the hosted demo to test scrape, crawl, or map output with fastCRW semantics.

Try Playground

Continue exploring

More from Integrations

View all integrations

Previous in Integrations

Cursor Web Scraping Integration — fastCRW [Firecrawl-Compatible]

Next in Integrations

Pydantic AI Web Scraping Integration — fastCRW [Firecrawl-Compatible]

Integrations

Vercel AI SDK Web Scraping Integration — fastCRW [Firecrawl-Compatible]

Register fastCRW as a tool in Vercel AI SDK so generateText and streamText can scrape live web pages. Drop-in alternative to Firecrawl with 6.6 MB RAM runtime and 833 ms average latency on 1,000-URL benchmark.

vercel ai sdk web scrapingRegister fastCRW scrape/crawl/search as native Vercel AI SDK tools via tool() helper

Integrations

LlamaIndex Web Scraping Integration — fastCRW [Firecrawl-Compatible]

Build a custom LlamaIndex web reader that calls fastCRW for live ingestion, or wrap fastCRW scrape/crawl in a LlamaIndex tool for agent workflows. Markdown output feeds directly into embeddings pipelines. 6.6 MB RAM, 92% coverage on 1,000-URL benchmark.

llamaindex web scrapingCustom SimpleWebPageReader-style LlamaIndex reader wrapping fastCRW REST API

Integrations

Dify Web Scraping Integration — fastCRW [Firecrawl-Compatible]

Integrate fastCRW into Dify workflows via HTTP node or native plugin. Call scrape and search endpoints from Dify LLM apps, knowledge bases, and agents. 6.6 MB RAM runtime, 92% coverage on the 1,000-URL benchmark.

dify web scrapingWorks with the standard HTTP Request node — no custom plugin required

Related hubs

Keep the crawl path moving

Docs

Drop into endpoint reference once your integration is wired up.

Use Cases

See where this integration shape fits common AI-agent workloads.

Alternatives

Compare fastCRW against other scraping APIs your stack might consider.