A free, open source, and extensible speech-to-text application that works completely offline.



Vocol is described as 'AI transcription software and a one-stop voice collaboration platform designed to boost work efficiency by turning voice and data into actionable insights' and is a audio transcription tool in the office & productivity category. There are more than 50 alternatives to Vocol, not only websites but also apps for a variety of platforms, including Mac, iPhone, iPad and Android apps. The best Vocol alternative is Handy STT, which is both free and Open Source. Other great sites and apps similar to Vocol are Vibe Transcribe, Voxtral, FUTO Voice Input and Spokenly.
A free, open source, and extensible speech-to-text application that works completely offline.



Vibe is an auto transcription service that utilizes local language learning models (LLMs) or artificial intelligence to provide transcriptions for a wide range of languages. The service prioritizes user privacy by offering fully offline transcription, ensuring that no data ever...

Voxtral models are state-of-the-art speech understanding models, which are available in two sizes — a 24B variant for production-scale applications and a 3B variant for local and edge deployments. Both versions are released under the Apache 2.0 license.

FUTO Voice Input is an application that lets you do speech-to-text on Android, integrating with third party keyboards or apps that use the generic speech-to-text APIs.



Experience effortless voice-to-text on your Mac. Speak your thoughts, and let modern AI handle the typing—no hidden data collection, no distractions.




Whisper is a general-purpose speech recognition model. It is trained on a large dataset of diverse audio and is also a multi-task model that can perform multilingual speech recognition as well as speech translation and language identification.




TalkNotes turns audio into structured notes, todos, flashcards, and transcripts using AI transcription in 100+ languages. Automatically capture lectures, meetings, or any speech into actionable text, ensuring tasks never get lost, with user-friendly accessibility for everyone.








Never pay to convert speech to text. Fluid transforms your voice into text instantly with NVIDIA's fastest AI model, processing locally on your Mac with complete privacy.




Moonshine is a family of speech-to-text models optimized for fast and accurate automatic speech recognition (ASR) on resource-constrained devices. It is well-suited to real-time, on-device applications like live transcription and voice command recognition.
Ito transforms your voice into perfect text anywhere on Mac. Speak naturally, and our AI crafts polished messages for any context.

Instantly converts spoken words to text using local AI for near-perfect accuracy and privacy, with real-time transcription, seamless macOS integration, user-supplied AI keys, multiple language support, and no cloud data upload required.



