Handy STT
34 likes
A free, open source, and extensible speech-to-text application that works completely offline.
Features
Properties
- Lightweight
- Privacy focused
Features
- No registration required
- Push to talk
- No Tracking
- Ad-free
- Works Offline
- Support for Keyboard Shortcuts
- speech transcription
- Speech to text
- Support for Hotkeys
Tags
- stt
- tauri2
- tauri
Handy STT News & Activities
Highlights All activities
Recent activities
- olegsh liked Handy STT
- anchovylatte liked Handy STT
Sharrnah added Handy STT as alternative to Whispering Tiger
What is Handy STT?
A free, open source, and extensible speech-to-text application that works completely offline.
Handy is a cross-platform desktop application built with Tauri (Rust + React/TypeScript) that provides simple, privacy-focused speech transcription. Press a shortcut, speak, and have your words appear in any text field—all without sending your voice to the cloud.
Why Handy?
Handy was created to fill the gap for a truly open source, extensible speech-to-text tool.
- Free: Accessibility tooling belongs in everyone's hands, not behind a paywall
- Open Source: Together we can build further. Extend Handy for yourself and contribute to something bigger
- Private: Your voice stays on your computer. Get transcriptions without sending audio to the cloud
- Simple: One tool, one job. Transcribe what you say and put it into a text box
Handy isn't trying to be the best speech-to-text app—it's trying to be the most forkable one.






Comments and Reviews
It's a good app, but need some touch how on this working offline implementation a bit better. And yeah you need to download a model before it can work and it doesn't show how much space needed in hard drive for model to work.
You'd be hard pressed to find a better FREE speech to text that is as lightweight, straightforward, and accurate as Handy is.
Handy STT is a simple, lightweight, no frills text to speech app that handles day to day use well. To use it you do have to download at least one LLM. The models are presented to you to choose from so you don't have to go searching for them and each has a simple bar graph showing you (assuming you're an English speaker) how accurate and responsive each model is. Set up is as easy a picking a model, letting it download, changing settings like if you want the text copied to your clipboard or not then you are ready to go. You can easily be up and going in under 10 minutes. A few small things hold this software back from being a perfect 10 in my book ( The primarily issue being you're unable to train the model on how to spell words. As well as not being able to instruct the program to write numbers or special characters such as exclamation marks or question marks.) Finally while not an issue for me the current fastest and most accurate models used ( Parakeet V2 and V3) are known to work much better with English but they are working hard to add wider language support.
Pros:
##Cons:
Done. I've finished looking for an STT app, or trying to vibe-code my own. This is it.