Why voice dictation on Linux was historically hard
Until recently, Linux users had few real options for voice dictation:
- Google Chrome's Web Speech API — browser-only, requires internet, no system-wide injection
- Julius / CMU Sphinx — open-source but outdated accuracy, complex setup, no continuous improvement
- Kaldi-based tools — research-grade accuracy but no polished user interface
- Cloud services (Google, AWS) — excellent accuracy but audio leaves your device
The commercial leaders (Superwhisper, Dragon, Wispr Flow) simply didn't support Linux. Linux users were left with clunky, manual, or cloud-dependent solutions.
How Whisper AI changed Linux voice dictation
OpenAI's open-source Whisper model changed the equation. Released in 2022 and continuously improved through Whisper Large V3 in 2024, Whisper delivers:
- Near-human accuracy — competitive with cloud services even for accented speech
- 100+ language support — multilingual transcription from a single model
- CPU-runnable — no GPU required, works on any modern Linux machine
- Fully open-source — can be run completely offline, self-hosted, no licensing fees
The result: a wave of Linux-compatible voice dictation apps built on top of Whisper, ranging from polished paid apps to developer-focused open-source tools.
Full comparison: best Linux voice dictation apps in 2026
| App | Offline | System-wide injection | Filler word removal | AI routing | Setup difficulty | Price |
|---|---|---|---|---|---|---|
| AirTypes | ✅ 100% | ✅ Yes | ✅ Yes | ✅ My Agent | ⭐ Install & go | $3.99/mo |
| Vocalinux | ✅ 100% | ✅ Yes | ❌ No | ❌ No | ⭐⭐⭐ Manual config | Free |
| OpenWhispr | ✅ 100% | ✅ Yes | ❌ No | ❌ No | ⭐⭐⭐ Terminal setup | Free |
| Nerd Dictation | ✅ 100% | ✅ Yes | ❌ No | ❌ No | ⭐⭐⭐⭐ Script-based | Free |
| Whisper.cpp + xdotool | ✅ 100% | ✅ Yes | ❌ No | ❌ No | ⭐⭐⭐⭐⭐ DIY only | Free |
| Speechnotes (browser) | ❌ Cloud | ❌ Browser only | ❌ No | ❌ No | ⭐ Open in browser | Free / paid |
AirTypes — best polished offline voice dictation for Linux
AirTypes is the only actively-developed, paid, offline voice dictation app that currently ships on Linux. It fills the gap that Superwhisper and Wispr Flow leave — a polished, maintained tool you don't have to configure manually.
What you get
- Global hotkey (Ctrl+Shift+Space by default) — hold to record, release to transcribe and inject
- 6 Whisper model tiers — from Swift (~40 MB, <1 second) to Deep (Large V3, ~1.6 GB, maximum accuracy)
- Filler word removal — strips um, uh, like, you know automatically
- Writing tones — Formal, Professional, Neutral, Casual, Friendly
- 100+ language support via Whisper's multilingual models
- My Agent — voice to BYOK AI, response typed at cursor (GPT-4, Claude, Ollama, any OpenAI-compatible)
- Transcription history — searchable log of all past dictations
System requirements (Linux)
- Ubuntu 20.04+, Fedora 36+, Arch Linux, or equivalent
- 4 GB RAM minimum (8 GB recommended for Large models)
- x86_64 architecture
- No GPU required — runs entirely on CPU
Installation
Download the AirTypes AppImage or .deb from airtypes.com. Mark it executable and run — no dependencies to install manually. Models download on first use based on your tier selection.
Vocalinux — best free Linux voice dictation
Vocalinux is an open-source, offline voice dictation tool built on Whisper for Linux. It handles system-wide text injection and works in most desktop environments (GNOME, KDE, XFCE). No cost, no subscription.
Best for: Users comfortable with a manual setup who don't need filler word removal, AI routing, or ongoing updates. Good "it works" solution once configured.
Limitations: No Windows or macOS. Manual model management. No tone adjustment. Community-supported only.
OpenWhispr — best for self-hosting / cross-platform
OpenWhispr is a cross-platform open-source wrapper around Whisper that runs on Linux, macOS, and Windows. It handles model management and injection. More polished than raw whisper.cpp but still requires terminal setup.
Best for: Developers who want cross-platform offline transcription with full control and no subscription. Good if you're comfortable in the terminal.
Getting started: Linux voice dictation by distribution
Ubuntu 20.04+ / Debian
AirTypes: download the .deb package from airtypes.com, install with sudo dpkg -i airtypes.deb.
Vocalinux: follow the installation guide at vocalinux.com — uses pip and requires PortAudio.
Fedora 36+
AirTypes: use the AppImage download. Mark executable with chmod +x AirTypes.AppImage, then run directly.
Arch Linux / Manjaro
AirTypes AppImage works on Arch. For Vocalinux or OpenWhispr, AUR packages may be available — check the respective project repositories.
Final recommendation
The right Linux voice dictation tool depends on your priorities:
- Want install-and-go, polished UI, AI features: AirTypes ($3.99/mo, 7-day free trial)
- Want free and fully offline, OK with manual setup: Vocalinux
- Developer, want cross-platform + self-hosted: OpenWhispr
- Want maximum DIY control: whisper.cpp + xdotool script
For most Linux users who want something that "just works" — AirTypes is the only polished, maintained, offline option currently available.
Try AirTypes on Linux — free for 7 days
No credit card. Full access to all features. Works on Ubuntu, Fedora, Arch, and more.
Download for Linux