🎬 ViralContent Factory

Autonomous AI-Powered Viral Content Generation Pipeline

Fully automated Reddit story scraping → AI voice synthesis → viral short-form video generation

Features • Architecture • Installation • Usage • Tech Stack

🚀 Overview

ViralContent Factory is an end-to-end automated content generation system that transforms Reddit stories into professionally edited, viral-ready short-form videos for TikTok, YouTube Shorts, and Instagram Reels. The pipeline handles everything from content discovery to final video rendering with zero manual intervention.

💡 What Makes This Special?

🤖 Fully Autonomous: Set it and forget it. The system runs daily via scheduled tasks
🧠 AI-Powered Intelligence: Multi-provider LLM router with automatic failover across 5+ AI services
🎯 Production-Ready: Includes failover systems, cold storage backups, and email alerting
⚡ Optimized Performance: Multi-threaded rendering, smart caching, and resource management
📊 Scalable Architecture: Modular phase-based design for easy extension and maintenance
🔄 Smart LLM Routing: Automatic failover between Groq, Cerebras, Gemini, HuggingFace, and OpenRouter

✨ Features

🔍 Phase 1: Intelligent Content Acquisition

Multi-Source Scraping: Waterfall system across 10+ high-engagement subreddits (AITA, TIFU, TrueOffMyChest, etc.)
Smart Filtering:
- Language detection (English-only)
- Optimal word count (120-200 words for 60-second videos)
- Duplicate prevention via persistent database
- Automatic removal of deleted/removed posts
AI Enhancement:
- Multi-provider LLM router with automatic quota management
- Gender detection for voice matching (fast models)
- Viral hook generation with creative reasoning (strong models)
- Slang/acronym normalization (AITA → "Am I the jerk", etc.)
Failover System: Falls back to local cold storage if all live sources fail
Upload Automation: YouTube and Instagram automation modules (in development)

🎙️ Phase 2: Professional Audio Synthesis

Edge TTS Integration: Microsoft's neural voices for natural-sounding narration
Dynamic Voice Selection: Gender-matched voices (3 female variants, 1 male)
Word-Level Timing: Precise timestamp extraction for perfect subtitle synchronization
Fallback Mechanisms: Sentence-level heuristics if word boundaries fail

🎥 Phase 3: Viral Video Composition

9:16 Vertical Format: Optimized for mobile-first platforms
Dynamic Background Selection: Random gameplay footage (Minecraft, GTA 5)
Animated Subtitles:
- Impact font with stroke for maximum readability
- 2-word chunks with pop-in animations
- Mathematically synced to audio timestamps
Smart Cropping: Automatic center-crop from 16:9 to 9:16
Random Start Points: Prevents repetitive background footage

🤖 LLM Router System

Multi-Provider Architecture: Supports 5 AI providers with automatic failover
Intelligent Task Routing:
- Fast models (Gemini, HuggingFace, OpenRouter) for classification and tagging
- Strong models (Groq, Cerebras) for creative writing and reasoning
Quota Management: Automatically detects rate limits and switches providers
Error Recovery: Retry logic with exponential backoff
Cost Optimization: Routes cheap tasks to free tiers, expensive tasks to premium models

🔧 Production Features

Automated Cleanup: Removes temporary files after each run
Batch Management: Collects 7 videos before triggering upload alert
Email Notifications: Gmail SMTP alerts when batch is ready
Sanitized Filenames: OS-safe naming with ID-based uniqueness
Error Handling: Comprehensive try-catch blocks with detailed logging
Video Path Utilities: Batch processing helpers for upload automation

🏗️ System Architecture

┌─────────────────────────────────────────────────────────────┐
│                    MAIN PIPELINE ORCHESTRATOR                │
│                     (main_pipeline.py)                       │
└────────────┬────────────────────────────────────────────────┘
             │
    ┌────────┴────────┐
    │                 │
    ▼                 ▼
┌─────────┐      ┌─────────┐
│ Phase 1 │──────│ Phase 2 │
│ Scraper │      │  Audio  │
└────┬────┘      └────┬────┘
     │                │
     │                ▼
     │           ┌─────────┐
     │           │ Phase 3 │
     └───────────│  Video  │
                 └────┬────┘
                      │
                      ▼
              ┌───────────────┐
              │  Cleanup &    │
              │  Notification │
              └───────┬───────┘
                      │
                      ▼
              ┌───────────────┐
              │   Upload      │
              │  Automation   │
              └───────────────┘

📁 Project Structure

ViralContent-Factory/
├── 📜 main_pipeline.py      # Orchestrator - coordinates all phases
├── 🔍 phase1.py             # Content acquisition & AI processing
├── 🎙️ phase2.py             # Audio synthesis & timestamp extraction
├── 🎥 phase3.py             # Video composition & rendering
├── 🤖 llm_router.py         # Multi-provider LLM failover system
├── 📥 yt_downloader.py      # Background footage downloader
├── 📧 reminder.py           # Batch management & email alerts
├── 📤 yt_automation.py      # YouTube upload automation
├── 📱 insta_automation.py   # Instagram upload automation (WIP)
├── 🔧 get_videopaths.py     # Video path utility for batch processing
├── ⚙️ run_factory.bat       # Windows Task Scheduler entry point
├── 📦 requirements.txt      # Python dependencies
├── 🗄️ scripts.json          # Persistent story database
├── 🎬 downloads/            # Background video assets
├── 📤 reels/                # Final rendered videos
└── 📦 ready_to_upload/      # Batched videos ready for upload

🛠️ Tech Stack

Category	Technology	Purpose
Language	Python 3.11+	Core runtime
AI/LLM	Multi-Provider Router	Groq, Cerebras, Gemini, HuggingFace, OpenRouter
Voice Synthesis	Edge-TTS	Neural text-to-speech
Video Processing	MoviePy 1.0.3	Compositing & rendering
Image Processing	ImageMagick	Text rendering backend
Web Scraping	Requests	Reddit API interaction
NLP	langdetect	Language filtering
Video Download	yt-dlp	Background footage acquisition
Email	smtplib	Gmail notifications
Environment	python-dotenv	Secure credential management

📦 Installation

Prerequisites

# Required System Dependencies
- Python 3.11 or higher
- FFmpeg (for audio/video processing)
- ImageMagick (for subtitle rendering)
- Deno or Node.js (for yt-dlp)

Step 1: Clone the Repository

git clone https://github.com/indiser/viralcontent-factory.git
cd viralcontent-factory

Step 2: Install Python Dependencies

pip install -r requirements.txt

Step 3: Install System Dependencies

Windows (via winget):

winget install Gyan.FFmpeg
winget install ImageMagick.ImageMagick
winget install DenoLand.Deno

macOS (via Homebrew):

brew install ffmpeg imagemagick deno

Linux (Ubuntu/Debian):

sudo apt update
sudo apt install ffmpeg imagemagick
curl -fsSL https://deno.land/install.sh | sh

Step 4: Configure Environment Variables

Create a .env file in the project root:

# LLM API Keys (at least one required, more = better failover)
GROQ_API_KEY=your_groq_api_key_here
CEREBRAS_API_KEY=your_cerebras_api_key_here
GEMINI_API_KEY=your_gemini_api_key_here
HUGGINGFACE_API_KEY=your_huggingface_api_key_here
OPENROUTER_API_KEY=your_openrouter_api_key_here

# Gmail SMTP (for notifications)
EMAIL_USER=your_email@gmail.com
EMAIL_APP_PASS=your_gmail_app_password

Note: For Gmail, you need to generate an App Password (not your regular password)

LLM Keys: You only need ONE API key to start, but having multiple provides better reliability through automatic failover

Step 5: Download Background Videos

python yt_downloader.py "https://youtube.com/watch?v=MINECRAFT_VIDEO_ID"
python yt_downloader.py "https://youtube.com/watch?v=GTA5_VIDEO_ID"

Or manually place 9:16 or 16:9 gameplay videos in the downloads/ folder.

Step 6: Configure ImageMagick Path (Windows Only)

Edit phase3.py line 5 to match your ImageMagick installation:

os.environ["IMAGEMAGICK_BINARY"] = r"C:\Program Files\ImageMagick-7.1.2-Q16-HDRI\magick.exe"

🎯 Usage

Manual Execution

python main_pipeline.py

Automated Daily Execution (Windows)

Open Task Scheduler
Create a new task:
- Trigger: Daily at 3:00 AM
- Action: Run run_factory.bat
The system will automatically:
- Generate 1 video per day
- Collect 7 videos per week
- Send email alert when batch is ready

Batch Management

python reminder.py

This checks if 7+ videos are ready and moves them to ready_to_upload/ folder.

Get Video Paths for Upload

python get_videopaths.py

Returns absolute paths of all videos in ready_to_upload/ for batch upload scripts.

📊 Workflow Example

1. [03:00 AM] Task Scheduler triggers run_factory.bat
2. [03:00:05] Phase 1 scrapes r/AmItheAsshole
3. [03:00:12] LLM Router tries Groq → generates viral hook
4. [03:00:15] Gender detected: Female → Voice: en-US-AriaNeural
5. [03:00:45] Phase 2 generates audio + word timestamps
6. [03:01:30] Phase 3 renders 60-second vertical video
7. [03:02:00] Cleanup removes temporary files
8. [03:02:05] Reminder script checks inventory (3/7 videos)
9. [Day 7] Email sent: "🟢 FACTORY ALERT: Weekly Batch Ready"
10. [Manual] Run upload automation scripts

🎨 Customization

Add More Subreddits

Edit phase1.py:

SUBREDDITS = [
    "AmItheAsshole",
    "YourNewSubreddit",  # Add here
]

Change Voice Models

Edit phase2.py:

WOMAN_VOICE_LIST = [
    "en-US-JennyNeural",
    "en-GB-SoniaNeural",  # Add British accent
]

Adjust Video Length

Edit phase1.py line 175:

if 120 < len(words) < 200:  # Change word count range

Modify Subtitle Style

Edit phase3.py lines 50-60:

txt_clip = TextClip(
    chunk_text,
    font="Arial",           # Change font
    fontsize=100,           # Increase size
    color="yellow",         # Change color
    stroke_width=8,         # Thicker outline
)

Configure LLM Provider Priority

Edit llm_router.py:

CHEAP_PROVIDERS = [openrouter_chat, hf_chat, gemini_chat]
STRONG_PROVIDERS = [groq_chat, cerebras_chat]

🐛 Troubleshooting

Issue: "ImageMagick not found"

Solution: Update the path in phase3.py line 5 to match your installation

Issue: "No viable stories found"

Solution: The subreddit may have no posts matching criteria. The system will automatically try the next subreddit

Issue: "FFmpeg not found"

Solution: Ensure FFmpeg is in your system PATH. Run ffmpeg -version to verify

Issue: "Email sending failed"

Solution:

Enable 2FA on Gmail
Generate an App Password
Use the App Password in .env, not your regular password

Issue: "All LLM providers failed"

Solution:

Check that at least one API key is valid in .env
Verify API quotas haven't been exceeded
Check internet connection

Issue: "Word boundaries missing"

Solution: The system automatically falls back to sentence-level timing. This is expected behavior for some voices

📈 Performance Metrics

Average Runtime: 2-3 minutes per video
Video Quality: 1080x1920 @ 30fps
Audio Quality: 192kbps MP3
Storage: ~15-25MB per final video
Success Rate: 95%+ (with failover systems)
LLM Failover: <2 seconds between provider switches

🔒 Security & Privacy

✅ No user data collection
✅ API keys stored in .env (gitignored)
✅ Reddit scraping complies with API terms
✅ All content is public domain (Reddit posts)
✅ No personal information in generated videos
✅ Multi-provider LLM routing prevents vendor lock-in

🚧 Roadmap

🤝 Contributing

Contributions are welcome! Please follow these steps:

Fork the repository
Create a feature branch (git checkout -b feature/AmazingFeature)
Commit your changes (git commit -m 'Add AmazingFeature')
Push to the branch (git push origin feature/AmazingFeature)
Open a Pull Request

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

🙏 Acknowledgments

Reddit API - Content source
Microsoft Edge TTS - Neural voice synthesis
Groq, Cerebras, Gemini, HuggingFace, OpenRouter - LLM infrastructure
MoviePy - Video processing framework
yt-dlp - Video download utility

📞 Contact

Project Link: https://github.com/yourusername/viralcontent-factory

⭐ If this project helped you, please consider giving it a star!

Made with ❤️ and Python

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
LICENSE		LICENSE
Readme.md		Readme.md
hidden_depedencies.txt		hidden_depedencies.txt
insta_automation.py		insta_automation.py
llm_router.py		llm_router.py
main_pipeline.py		main_pipeline.py
phase1.py		phase1.py
phase2.py		phase2.py
phase3.py		phase3.py
reminder.py		reminder.py
requirements.txt		requirements.txt
run_factory.bat		run_factory.bat
yt_automation.py		yt_automation.py
yt_downloader.py		yt_downloader.py

License

indiser/ViralContent-Factory

Folders and files

Latest commit

History

Repository files navigation

🎬 ViralContent Factory

Autonomous AI-Powered Viral Content Generation Pipeline

🚀 Overview

💡 What Makes This Special?

✨ Features

🔍 Phase 1: Intelligent Content Acquisition

🎙️ Phase 2: Professional Audio Synthesis

🎥 Phase 3: Viral Video Composition

🤖 LLM Router System

🔧 Production Features

🏗️ System Architecture

📁 Project Structure

🛠️ Tech Stack

📦 Installation

Prerequisites

Step 1: Clone the Repository

Step 2: Install Python Dependencies

Step 3: Install System Dependencies

Step 4: Configure Environment Variables

Step 5: Download Background Videos

Step 6: Configure ImageMagick Path (Windows Only)

🎯 Usage

Manual Execution

Automated Daily Execution (Windows)

Batch Management

Get Video Paths for Upload

📊 Workflow Example

🎨 Customization

Add More Subreddits

Change Voice Models

Adjust Video Length

Modify Subtitle Style

Configure LLM Provider Priority

🐛 Troubleshooting

Issue: "ImageMagick not found"

Issue: "No viable stories found"

Issue: "FFmpeg not found"

Issue: "Email sending failed"

Issue: "All LLM providers failed"

Issue: "Word boundaries missing"

📈 Performance Metrics

🔒 Security & Privacy

🚧 Roadmap

🤝 Contributing

📄 License

🙏 Acknowledgments

📞 Contact

⭐ If this project helped you, please consider giving it a star!

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages