A free, open source, and extensible speech-to-text application that works completely offline.



CMU Sphinx is described as 'Speaker-independent large vocabulary continuous speech recognizer released under BSD style license. It is also a collection of open source tools and resources that allows researchers and developers to build speech recognition systems' and is an app in the development category. There are more than 25 alternatives to CMU Sphinx for a variety of platforms, including Mac, Windows, Linux, iPhone and iPad apps. The best CMU Sphinx alternative is Handy STT, which is both free and Open Source. Other great apps like CMU Sphinx are Vibe Transcribe, Voxtral, FUTO Voice Input and Whisper.
A free, open source, and extensible speech-to-text application that works completely offline.



Vibe is an auto transcription service that utilizes local language learning models (LLMs) or artificial intelligence to provide transcriptions for a wide range of languages. The service prioritizes user privacy by offering fully offline transcription, ensuring that no data ever...

Voxtral models are state-of-the-art speech understanding models, which are available in two sizes — a 24B variant for production-scale applications and a 3B variant for local and edge deployments. Both versions are released under the Apache 2.0 license.

FUTO Voice Input is an application that lets you do speech-to-text on Android, integrating with third party keyboards or apps that use the generic speech-to-text APIs.



Whisper is a general-purpose speech recognition model. It is trained on a large dataset of diverse audio and is also a multi-task model that can perform multilingual speech recognition as well as speech translation and language identification.








Moonshine is a family of speech-to-text models optimized for fast and accurate automatic speech recognition (ASR) on resource-constrained devices. It is well-suited to real-time, on-device applications like live transcription and voice command recognition.
Privacy-focused transcription tool for macOS. For free. No ads, no tracking, no data collection.


Provides instant, local voice-to-text transcription, searchable timeline for both dictated and clipboard text, fully private processing on macOS, adaptive learning for terminology, invisible activation, audio/video file transcription, and seamless pasting into apps.


Dictly turns speech into polished, structured text — instantly and entirely on your device. No servers. No data collection. No delay.




Aqua Voice is a voice-driven document editor that lets you edit documents using just your voice. Instead of transcribing what you said, Aqua Voice writes what you meant.

Switch between typing and voice to take notes with ease, supporting 40 languages. Real-time transcription captures paragraphs and emoticons while ensuring clear punctuation. Pro upgrade offers cloud storage and multiple note management options.



