$ hyprwhspr
System-wide speech‑to‑text.
Local models. Secure Cloud providers. Fully featured. Top
performance.
── Features
Cutting-edge local
Ships with support for Parakeet TDT V3, Cohere Transcribe, and the full Whisper family. On CPU? onnx-asr delivers wild speeds without a GPU. Models stay hot in memory.
GPU-intelligent
Auto-detects NVIDIA CUDA, AMD/Intel Vulkan, or falls back to CPU. Unload the model from VRAM on demand — free resources, then reload instantly without restarting the service.
Private by default
Local inference means nothing leaves your machine. When you do reach for cloud — OpenAI, ElevenLabs, and more — credentials are stored securely and never touch config files.
Audio ducking
System volume steps down while you record, back up when you're done.
Four recording modes
Toggle, push-to-talk, auto (tap vs. hold), and long-form with pause and save.
Paste anywhere
Text injects into any active buffer via ydotool. Auto-submit optional.
Themed visualizer
Mic-OSD overlay that auto-matches your Omarchy theme. Looks great.
Waybar tray
Live status indicator: idle, recording, processing, error — all at a glance.
Multi-lingual
Strong performance across many languages. Optional translate-to-English mode.
Text processing
Word overrides, filler word removal, symbol replacements, custom prompts.
WebSocket streaming
Stream in near realtime via 11Labs, OpenAI Realtime API or similar services.
Works everywhere
Hyprland, GNOME, KDE Plasma, Sway — any Wayland compositor with systemd.
Free and open source, forever. For the people!
MIT licensed. No subscriptions, no telemetry, no monetization — ever.
── Install
── Writing
Voice Typing on Linux in 2026
Speech-to-text on Linux has been broken for years. Local models and Wayland maturity have finally changed that.
Read → modelsBest Speech-to-Text Models for Linux
Parakeet TDT V3, Whisper, onnx-asr, Cohere — which model fits your hardware and workflow.
Read → opinionDictation is the Future of Programming
The keyboard made sense when code was the output. Now that language is the input, your voice is faster.
Read →