GitHub - zuhabul/Fetchium: Rust-native universal retrieval layer for humans and AI agents — search, extract, rank, and synthesize the web. CLI + REST API + MCP server, with adaptive extraction (CEP), token-budgeted packing, multi-signal ranking, and cited research.

███████╗███████╗████████╗ ██████╗██╗  ██╗██╗██╗   ██╗███╗   ███╗
██╔════╝██╔════╝╚══██╔══╝██╔════╝██║  ██║██║██║   ██║████╗ ████║
█████╗  █████╗     ██║   ██║     ███████║██║██║   ██║██╔████╔██║
██╔══╝  ██╔══╝     ██║   ██║     ██╔══██║██║██║   ██║██║╚██╔╝██║
██║     ███████╗   ██║   ╚██████╗██║  ██║██║╚██████╔╝██║ ╚═╝ ██║
╚═╝     ╚══════╝   ╚═╝    ╚═════╝╚═╝  ╚═╝╚═╝ ╚═════╝ ╚═╝     ╚═╝

The universal retrieval layer for humans and AI agents

Rust-native search, extraction, ranking, and synthesis — delivered as a CLI, a REST API, and an MCP server.

Install · Architecture & innovations · Quick start · For AI agents · Docs

What is Fetchium?

Most "search" tools hand an LLM a wall of raw HTML and hope for the best. Fetchium is built for the opposite: it finds, fetches, extracts, ranks, validates, and packages information so that the output is clean, cited, and token-efficient — whether the consumer is a human at a terminal or an AI agent over the network.

It runs as a single Rust binary with no required runtime dependencies and exposes one engine three ways:

CLI — fetchium search, fetch, summarize, compare, research, and more.
REST API (fetchium-api) — the engine over HTTP for any language.
MCP server (fetchium-mcp) — a first-class tool for AI agents (Codex, Claude, any MCP client).

Why Fetchium

Token-efficient by design — extraction and packaging are query-aware, so agents spend context on signal, not boilerplate.
Evidence-first — results carry sources and citations, with a validation pass before output.
Adaptive — extraction and ranking escalate only as far as a query needs, keeping latency and cost down.
Agent-native — MCP + REST + LangChain/CrewAI adapters, not an afterthought.
Resilient — per-backend circuit breakers and bulkheads keep one slow/broken source from sinking a query.

How it compares

	Naive retrieval (raw HTML → LLM)	Fetchium
Extraction	dump full HTML, hope the model copes	adaptive 5-layer CEP cascade + QADD DOM distillation
Token cost	pays for nav, ads, scripts, boilerplate	query-aware QATBE/SCS packing into a fixed budget
Ranking	first result / single relevance score	HyperFusion — 8 fused signals (relevance, trust, recency, diversity…)
Trust	unattributed text	evidence validation + source citations
Depth control	all-or-nothing	PDS tiers: `key_facts` → `summary` → `detailed` → `complete`
Research	one query, one shot	AMRS swarm: decompose → search → synthesize → verify
Reliability	one bad source breaks the run	circuit breakers + bulkheads + SimHash dedup
Agent access	bespoke glue per agent	native MCP + REST + LangChain/CrewAI adapters

Architecture & innovations

Fetchium's engine (fetchium-core) is a pipeline of purpose-built components. The novel pieces:

CEP — Content Extraction Protocol (5-layer adaptive cascade)

Extraction escalates layer by layer and stops as soon as the content is good enough, so the cheap path handles the common case and expensive paths run only when needed.

Layer	Technique	Handles
1	HTML + CSS selectors (`scraper`)	~85% of pages
2	Streaming HTML rewriter (`lol_html`)	enhanced boilerplate removal
3	Headless JS rendering	SPAs / dynamic content (`headless` feature)
4	PDF / document extraction	PDF, DOCX, RTF
5	Screenshot OCR	image-heavy / canvas content (`headless` feature)

A learned predictor (cep_predictor) chooses where to start instead of always running Layer 1.

QADD — Query-Aware DOM Distillation

Prunes the DOM against the query before extraction, dropping nav/ads/chrome to cut tokens dramatically (design target ~10–20×) while keeping query-relevant content.

QATBE + SCS — token-budgeted, semantically segmented extraction

SCS splits content into typed semantic segments; QATBE (Query-Aware Token-Budgeted Extraction) scores segments with BM25 and packs the best ones into a fixed token budget (greedy knapsack) — you get the most relevant content that fits, not an arbitrary truncation.

PDS — Progressive Detail Streaming

The same result can be served at four tiers — key_facts (~200 tok) → summary (~1k) → detailed (~5k) → complete — so callers request exactly the depth they need.

HyperFusion — 8-signal ranking

Final ranking fuses eight independent signals rather than a single relevance score: BM25, semantic, temporal, authority/trust, evidence, diversity, quality, and consensus/cluster (rank/{bm25,semantic,temporal,trust,evidence,diversity,quality,cluster}.rs).

AMRS — Adaptive Multi-Agent Research Swarm

fetchium research decomposes a question and runs a swarm of cooperating agents (research/amrs/) — decompose → search → synthesize → verify — to produce a cited report.

Evidence, validation & citations

Results flow through a validation stage and a citation layer (validate/, citation/, rank/evidence.rs) so claims are backed by sources before they reach the output.

PIE — Persistent Intelligence Engine

Cross-session learning (source trust, failure patterns, query prediction) persisted locally, so the engine improves with use instead of starting cold each time.

RAR — Retry-and-Refine

A multi-checkpoint self-correction loop that detects weak results and refines the query/extraction rather than returning a poor answer.

Resilience

Every backend call is wrapped in a circuit breaker and bulkhead (resilience/), with SimHash-based de-duplication across federated backends — one degraded source can't stall a query.

Pipeline

            ┌──────────── research/ (AMRS) ────────────┐
            ▼                                           │
search/ → extract/ (CEP + QADD) → token/ (QATBE/SCS) → rank/ (HyperFusion) → validate/ → citation/ → output/ (PDS)
            ▲                                                                                   │
            └───────────────── cache/ · index/ · intelligence/ (PIE) ───────────────────────────┘

Installation

Requires Rust 1.75+ for the Cargo/source methods.

# Shell installer (Linux + macOS — all architectures)
curl -sSfL https://install.fetchium.com | sh

# Cargo
cargo install fetchium-cli

# npm / npx
npm install -g fetchium-cli
npx fetchium-cli --help

# Homebrew
brew install zuhabul/fetchium/fetchium

# Python adapters
pip install fetchium-langchain   # LangChain retriever
pip install fetchium-crewai      # CrewAI tool

# Build from source
git clone https://github.com/zuhabul/Fetchium && cd Fetchium
cargo build -p fetchium-cli --release

Then run fetchium doctor to check optional tools.

Quick start

# ── Core retrieval ────────────────────────────────────────────────────────────
fetchium search "best rust async runtimes"           # federated web search (17+ backends)
fetchium fetch https://example.com                   # fetch + clean extraction (CEP)
fetchium research "impact of LLMs on engineering"    # multi-step cited report (AMRS)
fetchium compare "rust vs go vs python"              # structured side-by-side comparison
fetchium summarize https://example.com               # AI summarization of a URL or text
fetchium ai "what causes the northern lights?"       # grounded answer (needs AI provider)
fetchium deep "history of Byzantine Empire"          # deep multi-agent research (Mode E)

# ── YouTube intelligence ──────────────────────────────────────────────────────
fetchium youtube search "rust programming tutorial"  # search YouTube
fetchium youtube analyze https://youtube.com/watch?v=dQw4w9WgXcQ  # analyze video
fetchium youtube transcript https://youtube.com/watch?v=dQw4w9WgXcQ  # extract transcript
fetchium transcribe https://youtube.com/watch?v=dQw4w9WgXcQ  # transcribe any audio/video

# ── Social media intelligence ─────────────────────────────────────────────────
fetchium social "AI regulation news"                 # unified search across all platforms
fetchium reddit search "mechanical keyboards"        # Reddit posts + sentiment
fetchium twitter search "Rust lang"                  # X/Twitter (via Nitter)
fetchium hackernews search "open source tools"       # Hacker News
fetchium tiktok search "programming tips"            # TikTok trends

# ── Productivity / monitoring ─────────────────────────────────────────────────
fetchium monitor add https://example.com             # watch URL for content changes
fetchium monitor check                               # check all monitored URLs now
fetchium digest "AI weekly"                          # generate a research digest
fetchium radar                                       # personalized research radar from history

# ── API / server ──────────────────────────────────────────────────────────────
fetchium serve                                       # start REST API (port 3000)
fetchium serve --mode mcp                            # start MCP server (stdio)
fetchium tui                                         # interactive terminal UI

Full command reference: docs/guide/commands.md.

For AI agents

MCP server (fetchium-mcp) exposes retrieval as Model Context Protocol tools for Codex, Claude, and other MCP clients.
REST API (fetchium-api) serves the same engine over HTTP — fetchium serve.
Adapters for LangChain and CrewAI live in adapters/.

See docs/guide/agent-integration.md.

Configuration

Configuration lives in ~/.fetchium/config.toml (with env-var overrides). API keys you provide are stored locally and never committed. Run fetchium doctor to verify provider/tool setup. Optional integrations: an AI provider (e.g. Ollama) for ai/research/deep, and Chromium for CEP Layers 3/5. Details: docs/guide/configuration.md.

Workspace layout

Crate	Role
`fetchium-core`	The engine: search, extract (CEP/QADD), rank (HyperFusion), validate, research (AMRS), cache, intelligence (PIE)
`fetchium-cli`	The `fetchium` command-line binary
`fetchium-mcp`	Model Context Protocol server
`fetchium-api`	REST API server

Deeper notes in docs/architecture/; the full design spec is in prd.md.

Contributing

Contributions are welcome — see CONTRIBUTING.md for setup and the checks CI runs, and the Code of Conduct. Report security issues per SECURITY.md.

License

Licensed under either of MIT (LICENSE-MIT) or Apache-2.0 (LICENSE-APACHE) at your option.

This dual license is the Rust ecosystem standard (Rust itself and most crates use it). Two files are included because each license has its own canonical text: MIT is short and maximally permissive, while Apache-2.0 adds an explicit patent grant that some organizations require. "At your option" means any user may choose whichever terms suit them — maximizing compatibility.

Unless you state otherwise, any contribution you submit shall be dual-licensed as above, without additional terms.

Name		Name	Last commit message	Last commit date
Latest commit History 90 Commits
.cargo		.cargo
.claude		.claude
.githooks		.githooks
.github/workflows		.github/workflows
adapters		adapters
apps		apps
benches		benches
crates		crates
docs		docs
fuzz		fuzz
infra		infra
issues		issues
packages/npm		packages/npm
scripts		scripts
tests		tests
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
.npmrc		.npmrc
.release-please-manifest.json		.release-please-manifest.json
AGENTS.md		AGENTS.md
CLAUDE.md		CLAUDE.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
Cross.toml		Cross.toml
Jenkinsfile		Jenkinsfile
LICENSE-APACHE		LICENSE-APACHE
LICENSE-MIT		LICENSE-MIT
README.md		README.md
SECURITY.md		SECURITY.md
dist-workspace.toml		dist-workspace.toml
package.json		package.json
prd.md		prd.md
release-please-config.json		release-please-config.json
rust-toolchain.toml		rust-toolchain.toml
turbo.json		turbo.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

The universal retrieval layer for humans and AI agents

What is Fetchium?

Why Fetchium

How it compares

Architecture & innovations

CEP — Content Extraction Protocol (5-layer adaptive cascade)

QADD — Query-Aware DOM Distillation

QATBE + SCS — token-budgeted, semantically segmented extraction

PDS — Progressive Detail Streaming

HyperFusion — 8-signal ranking

AMRS — Adaptive Multi-Agent Research Swarm

Evidence, validation & citations

PIE — Persistent Intelligence Engine

RAR — Retry-and-Refine

Resilience

Pipeline

Installation

Quick start

For AI agents

Configuration

Workspace layout

Contributing

License

About

Licenses found

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

The universal retrieval layer for humans and AI agents

What is Fetchium?

Why Fetchium

How it compares

Architecture & innovations

CEP — Content Extraction Protocol (5-layer adaptive cascade)

QADD — Query-Aware DOM Distillation

QATBE + SCS — token-budgeted, semantically segmented extraction

PDS — Progressive Detail Streaming

HyperFusion — 8-signal ranking

AMRS — Adaptive Multi-Agent Research Swarm

Evidence, validation & citations

PIE — Persistent Intelligence Engine

RAR — Retry-and-Refine

Resilience

Pipeline

Installation

Quick start

For AI agents

Configuration

Workspace layout

Contributing

License

About

Topics

Resources

License

Licenses found

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages