ComfyUI Image Editing + Gemini - WebUI - Cloud serverless GPU/VPS

AI-powered image editing service using Qwen-based workflows on RunPod and Google Gemini for prompt enhancement.

�️ Interfaces

Interface	Description	Status
Image Editor	Main Interface. Interactive AI image editing using Qwen workflow.	✅ Production
Content Pipeline	"Conveyor Belt" prototype for batch content creation (Idea -> Video).	🚧 Prototype
Studio Dashboard	Classic chat & generate interface with Gemini integration.	✅ Stable

📸 Screenshots

538451299-5e5f7d70-7c20-4ae3-90d6-98ad74d9b6ea

✨ Operations

AI Editing: Replace/modify objects in images using natural language.
Prompt Engineering: Use Gemini to expand simple ideas into professional prompts.
Cloud Backend: Serverless GPU execution via RunPod (no local GPU required).
Production Ready: Built-in rate limiting, logging, and PM2 support.

🚀 Quick Start

Install Dependencies
```
npm install
```

Configure Environment

cp .env.example .env
# Set GEMINI_API_KEY, RUNPOD_API_KEY, RUNPOD_ENDPOINT_ID

Run Locallly

npm run dev
# Open http://localhost:3000

📦 Deployment (VPS)

Setup PM2: npm install -g pm2
Start Server: pm2 start ecosystem.config.js
Docs: See DEPLOYMENT.md for full Nginx/HTTPS setup.

� Configuration

Required .env Variables:

GEMINI_API_KEY: Google AI Studio key.
RUNPOD_API_KEY: API Key from RunPod.
RUNPOD_ENDPOINT_ID: Endpoint ID for your deployed ComfyUI workflow.

Default Port: 3000 (Set PORT to change)

🏗️ Architecture

graph TD
    subgraph Client
        UI[Web UI / Browser]
        Ed[Image Editor]
    end

    subgraph Server_Layer
        API[Express Server]
        Auth[Auth Middleware]
        Job[Job Manager]
    end

    subgraph AI_Services
        Gemini[Google Gemini API]
        RunPod[RunPod Serverless GPU]
    end

    UI --> |/api/generate| API
    UI --> |/api/chat| API
    
    API --> Auth
    Auth --> Job
    
    Job --> |Enhance Prompt| Gemini
    Job --> |Generate/Edit| RunPod
    
    RunPod --> |Images| Job
    Gemini --> |Text| Job
    
    Job --> |JSON Response| UI

🔄 Workflow: Image Editing

sequenceDiagram
    participant User
    participant WebUI
    participant Server
    participant Gemini as Gemini AI
    participant RunPod as ComfyUI (RunPod)

    User->>WebUI: Enters Prompt & Uploads Image
    WebUI->>Server: POST /api/generate (edit)
    
    par Prompt Enhancement
        Server->>Gemini: Enhance Prompt
        Gemini-->>Server: Optimized Prompt
    end
    
    Server->>RunPod: Submit Job (Image + Prompt)
    RunPod-->>Server: Job ID
    Server-->>WebUI: Job ID (Pending)
    
    loop Polling
        WebUI->>Server: GET /api/status/:id
        Server->>RunPod: Check Status
        RunPod-->>Server: Status / Result
        Server-->>WebUI: Status update
    end
    
    RunPod-->>Server: Final Image
    Server-->>WebUI: Display Result

📝 License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.github/workflows		.github/workflows
custom_node_dist		custom_node_dist
input		input
public		public
src		src
.env.example		.env.example
.gitignore		.gitignore
DEPLOYMENT.md		DEPLOYMENT.md
Dockerfile		Dockerfile
README.md		README.md
ecosystem.config.js		ecosystem.config.js
example-request.json		example-request.json
key-limits.json		key-limits.json
nginx.conf		nginx.conf
package-lock.json		package-lock.json
package.json		package.json
tsconfig.json		tsconfig.json
workflow-image-edit.json		workflow-image-edit.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ComfyUI Image Editing + Gemini - WebUI - Cloud serverless GPU/VPS

�️ Interfaces

📸 Screenshots

✨ Operations

🚀 Quick Start

📦 Deployment (VPS)

� Configuration

🏗️ Architecture

🔄 Workflow: Image Editing

📝 License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

ComfyUI Image Editing + Gemini - WebUI - Cloud serverless GPU/VPS

�️ Interfaces

📸 Screenshots

✨ Operations

🚀 Quick Start

📦 Deployment (VPS)

� Configuration

🏗️ Architecture

🔄 Workflow: Image Editing

📝 License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages