A fully offline Windows AI desktop assistant powered by Ollama + Llama 3.1 8B model.
Built by MrLexCoder 🚀
- 🎤 Voice Input - Offline speech-to-text using faster-whisper
- 🔊 Voice Output - Offline text-to-speech using pyttsx3
- 💬 Modern GUI - Floating chat interface with PyQt6
- 🖥️ Full Computer Control - Apps, files, mouse, keyboard automation
- 🧠 AI-Powered - Llama 3.1 8B model via Ollama (localhost:11434)
- 📊 System Monitoring - CPU, RAM, disk usage tracking
- 🌐 Browser Control - Open URLs, search, navigate
- 📸 Screenshots - Capture and save screenshots
- ⏰ Task Scheduling - Schedule and automate tasks
- 🎯 Human-like Input - Realistic typing with delays
- Windows 10/11 (64-bit)
- 12GB RAM minimum
- Python 3.9+
- 10GB free disk space
# Download from https://ollama.com
ollama pull llama3.1:8b-instruct-q4_K_M# Clone the repository
git clone https://github.com/mrlexcoder/zox-ai.git
cd zox-ai
# Install dependencies
pip install -r requirements.txtpython main.pyOr use the quick launcher:
scripts\run.bat- 📘 Quick Start Guide - Get started in 5 minutes
- 📗 Setup Guide - Detailed installation instructions
- 📙 Commands Reference - Complete command list
- 📕 Project Summary - Technical overview
- 📔 Checklist - Verification steps
- 📓 File Structure - Project organization
- Click "Start Listening" to use voice input
- Type in the chat box for text input
- Zox AI will respond with voice and execute actions
- "Open Chrome and go to YouTube"
- "Create a file called notes.txt with hello world"
- "Take a screenshot"
- "What's my CPU usage?"
- "Type 'Hello World' with human-like speed"
- "Move mouse to center of screen"
- "Set volume to 50%"
┌─────────────────────────────────────────────────────────────┐
│ ZOX AI ARCHITECTURE │
└─────────────────────────────────────────────────────────────┘
┌─────────────────────────────────────────────────────────────┐
│ USER INTERFACE │
│ ┌──────────────┐ ┌──────────────┐ ┌──────────────┐ │
│ │ Voice Input │ │ Chat GUI │ │ Voice Output │ │
│ │ (Whisper) │ │ (PyQt6) │ │ (pyttsx3) │ │
│ └──────┬───────┘ └──────┬───────┘ └──────▲───────┘ │
└─────────┼──────────────────┼──────────────────┼─────────────┘
│ │ │
└──────────────────┼──────────────────┘
▼
┌─────────────────────────────────────────────────────────────┐
│ AI BRAIN (LLM) │
│ ┌───────────────────────────────────────────────────────┐ │
│ │ Ollama + Llama 3.1 8B (localhost:11434) │ │
│ │ • Understands natural language │ │
│ │ • Generates JSON action plans │ │
│ │ • Fully offline, no API keys │ │
│ └───────────────────────────────────────────────────────┘ │
└─────────────────────────────────────────────────────────────┘
│
▼
┌─────────────────────────────────────────────────────────────┐
│ ACTION EXECUTOR │
│ ┌──────────────┐ ┌──────────────┐ ┌──────────────┐ │
│ │ App Control │ │ File Control │ │ Mouse/Keys │ │
│ │ • Open apps │ │ • Create │ │ • Type text │ │
│ │ • Close apps │ │ • Read │ │ • Move mouse │ │
│ │ • Focus │ │ • Delete │ │ • Click │ │
│ └──────────────┘ └──────────────┘ └──────────────┘ │
│ ┌──────────────┐ ┌──────────────┐ ┌──────────────┐ │
│ │ Browser │ │ System │ │ Scheduler │ │
│ │ • Open URLs │ │ • Volume │ │ • Schedule │ │
│ │ • Search │ │ • Brightness │ │ • Recurring │ │
│ │ • Navigate │ │ • Screenshot │ │ • Reminders │ │
│ └──────────────┘ └──────────────┘ └──────────────┘ │
└─────────────────────────────────────────────────────────────┘
│
▼
┌─────────────────────────────────────────────────────────────┐
│ WINDOWS SYSTEM │
│ • Applications • Files • Mouse • Keyboard • Settings │
└─────────────────────────────────────────────────────────────┘
zox-ai/
├── 📄 main.py # Main application entry point
├── 📄 config.py # Configuration settings
├── 📄 test_components.py # Component testing suite
├── 📄 requirements.txt # Python dependencies
├── 📄 LICENSE # MIT License
├── 📄 README.md # This file
│
├── 📁 core/ # Core functionality
│ ├── llm.py # Ollama LLM interface
│ ├── voice_input.py # Speech-to-text (Whisper)
│ └── voice_output.py # Text-to-speech (pyttsx3)
│
├── 📁 actions/ # Action handlers
│ ├── app_control.py # Application management
│ ├── file_control.py # File operations
│ ├── mouse_keyboard.py # Input control
│ ├── browser_control.py # Browser automation
│ ├── system_control.py # System control
│ └── scheduler.py # Task scheduling
│
├── 📁 docs/ # Documentation
│ ├── QUICKSTART.md # 5-minute setup guide
│ ├── SETUP_GUIDE.md # Detailed installation
│ ├── COMMANDS.md # Command reference
│ ├── PROJECT_SUMMARY.md # Technical overview
│ ├── CHECKLIST.md # Verification checklist
│ ├── FILE_STRUCTURE.md # Project organization
│ └── BUILD_COMPLETE.md # Build summary
│
└── 📁 scripts/ # Utility scripts
├── install.bat # Installation script
└── run.bat # Quick launcher
| Layer | Technology | Purpose |
|---|---|---|
| Voice Input | faster-whisper | Offline speech-to-text (CPU) |
| AI Brain | Ollama + Llama 3.1 8B | Natural language understanding |
| Voice Output | pyttsx3 | Offline text-to-speech |
| GUI | PyQt6 | Modern chat interface |
| Automation | pyautogui | Mouse/keyboard control |
| System | psutil, pycaw | System monitoring & control |
| Browser | webbrowser | URL opening & navigation |
- ✅ 100% Offline - No internet required after setup
- ✅ No Cloud Services - Everything runs locally
- ✅ No API Keys - No external dependencies
- ✅ No Data Collection - Your data stays on your machine
- ✅ Open Source - All code is visible and auditable
- RAM Usage: ~7GB (within 12GB system)
- Response Time: 1-3 seconds per command
- Startup Time: ~5 seconds (after Ollama warm-up)
- Model Size: ~5GB (Llama 3.1 8B Q4)
Contributions are welcome! Please feel free to submit a Pull Request.
- Fork the repository
- Create your feature branch (
git checkout -b feature/AmazingFeature) - Commit your changes (
git commit -m 'Add some AmazingFeature') - Push to the branch (
git push origin feature/AmazingFeature) - Open a Pull Request
This project is licensed under the MIT License - see the LICENSE file for details.
- Ollama - Local LLM runtime
- Meta - Llama 3.1 model
- OpenAI - Whisper model
- PyQt6 - GUI framework
- All open-source contributors
- 📧 Email: mrlexcder@gmail.com
- 🐛 Issues: GitHub Issues
- 📖 Docs: Documentation
If you find Zox AI useful, please consider giving it a star! ⭐
Built with ❤️ by MrLexCoder
Your personal AI assistant, completely offline, completely yours.