GitHub - infiniV/VoiceFlow: Open-source voice dictation for Windows and Linux. Hold a hotkey, talk, and the transcript shows up at your cursor. Runs offline with Whisper.

Hold a hotkey. Speak. Release. The transcript pastes itself at your cursor.

Local Whisper dictation for Windows and Linux. No account, no cloud, no monthly bill.
_{macOS builds and runs but isn't officially supported yet.}

_{Latest: v1.6.0-rc1 (pre-release) · all releases}

What it does

VoiceFlow lives in your system tray. Hold a global hotkey, a small popup pops up with a live amplitude meter, you talk, you release, and the transcript is typed at the cursor. That's it.

The inference runs on your machine through faster-whisper. CUDA when you have it, CPU when you don't. The audio never touches a network socket.

Features

Fully local. Audio stays in RAM. No telemetry, no analytics, no phone-home.
16+ Whisper models. Tiny (75 MB) through Large-v3 (3 GB), plus Turbo, distilled, and .en variants. The picker shows speed, accuracy, parameter count, and disk size for each.
CUDA when available. Auto-detects your GPU, falls back to CPU.
Hold or Toggle modes. Configurable hotkeys including modifier-only combos like Ctrl+Win.
Wayland and X11. Native evdev input on Linux, Hyprland window rules, wl-copy and wtype/ydotool for paste.
99+ languages. Whisper handles language detection automatically.
Searchable history. SQLite log of every transcript, stored at ~/.VoiceFlow/.
Dark mode by default. Light and system themes if you want them.

Meetings (experimental)

New in v1.6.0-rc1. Long-form recording that captures mic input plus system audio (Zoom, Meet, anything that plays through your speakers) into one stereo file, transcribes it locally, and lets you bring your own LLM for the summary.

Pause, resume, and stop from the dashboard or the tray menu.
Re-transcribe any saved recording with a different model, device, or language without re-recording.
Bring your own LLM provider: OpenAI, Groq, OpenRouter, Ollama, or any OpenAI-compatible endpoint. API keys are stored in your OS keychain.
Export to Markdown, plain text, SRT, or structured JSON.
Auto-rename from a default timestamp to a real topic once the transcript is in.

Recording, transcription, search, and storage stay local. The only network call is the optional summary request, and you can skip it, point it at a local Ollama, or send it to a provider you already pay for.

VoiceFlow vs cloud dictation

	VoiceFlow	Cloud services
Cost	$0	~$10–15/month
Where audio goes	Your RAM	Their servers
Works offline	Yes	No
Account required	No	Yes
License	MIT	Closed

Install

Grab the latest binary from Releases — currently v1.6.0-rc1 (pre-release):

Windows 10/11: .exe installer (Inno Setup)
Linux: .AppImage or .tar.gz

64-bit only. First launch walks you through a seven-step setup: microphone, compute device, Whisper model download, hotkey. If you delete the model later, a recovery dialog lets you re-download or pick a different one.

Build from source

git clone https://github.com/infiniV/VoiceFlow.git
cd VoiceFlow
pnpm run setup        # installs Node and Python deps
pnpm run dev          # Vite frontend + Pyloid backend

Platform installers (run on the matching OS):

pnpm run build:installer          # Windows (.exe via Inno Setup)
pnpm run build:installer:linux    # Linux (.AppImage and .tar.gz)
pnpm run build:installer:macos    # macOS (.dmg, unsupported)

Stack

Layer	Tech
Shell	Pyloid (PySide6 + Qt WebEngine)
Inference	faster-whisper (CTranslate2)
Frontend	React 18, Vite, Tailwind v4, shadcn/ui
Storage	SQLite at `~/.VoiceFlow/VoiceFlow.db`

License

MIT. See LICENSE.

Releases · Issues · Website

Name		Name	Last commit message	Last commit date
Latest commit History 97 Commits
.github/workflows		.github/workflows
installer		installer
media		media
public		public
src-pyloid		src-pyloid
src		src
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
components.json		components.json
eslint.config.js		eslint.config.js
index.html		index.html
package.json		package.json
pnpm-lock.yaml		pnpm-lock.yaml
pyproject.toml		pyproject.toml
tsconfig.app.json		tsconfig.app.json
tsconfig.json		tsconfig.json
tsconfig.node.json		tsconfig.node.json
uv.lock		uv.lock
vite.config.ts		vite.config.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

What it does

Features

Meetings (experimental)

VoiceFlow vs cloud dictation

Install

Build from source

Stack

License

About

Uh oh!

Releases 14

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

What it does

Features

Meetings (experimental)

VoiceFlow vs cloud dictation

Install

Build from source

Stack

License

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 14

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages