Kaldi is a toolkit for speech recognition written in C++ and licensed under the Apache License v2.0. Kaldi is intended for use by speech recognition researchers.
Cost / License
- Free
- Open Source
Platforms
- Mac
- Windows
- Linux
Whisper is described as 'End-to-end speech recognition model trained on 680,000 hours of multitask, multilingual audio data, offering robust transcription, translation, and language identification' and is a audio transcription tool in the audio & music category. There are more than 100 alternatives to Whisper for a variety of platforms, including Mac, Web-based, Windows, iPhone and iPad apps. The best Whisper alternative is Handy STT, which is both free and Open Source. Other great apps like Whisper are Vibe Transcribe, Voxtral, FUTO Voice Input and TypeWhisper.
Kaldi is a toolkit for speech recognition written in C++ and licensed under the Apache License v2.0. Kaldi is intended for use by speech recognition researchers.
Transcripo is an intuitive tool crafted for converting audio and video files into precise text transcriptions. It supports multiple languages, most video/audio file formats and offers features such as AI-generated summaries and subtitle exports for videos.


Happy Scribe converts audio to text using online transcription software. Get accurate and affordable transcripts in minutes.

Transcriboar is a lightweight Android transcription app that uses the device’s built-in SpeechRecognizer to convert speech to text in real time.




Transcripts are your new secret weapon. Access the full potential of your audio and video content by converting it to searchable, editable interactive transcripts with Trint.
Glasscribe is a lightweight macOS menu bar app that transcribes speech in real time — entirely on your device. Built on Apple's native Speech framework (macOS 26 Tahoe), it captures both system audio and microphone input across 22+ languages with real-time on-device...
Whisper Dictator is a free desktop dictation application that runs entirely on your computer. Powered by OpenAI's Whisper AI model, it converts speech to text with high accuracy in over 90 languages — without sending any data to the cloud.

NotchLive is a macOS menu bar app that displays real-time AI-powered captions and translations directly in your MacBook's notch. It uses on-device Whisper AI (via CoreML) for speech recognition and Apple Translation for real-time translation — nothing ever leaves your Mac.


Private, on-device audio transcription for macOS. Your audio never leaves your Mac — no cloud uploads, no subscriptions, no data collection. Real-time ASR with Qwen3-ASR, MLX Whisper & Whisper, plus system-wide dictation, all 100% local.




Buzz Captions is an offline audio transcription and translation tool powered by OpenAI's Whisper model. It allows users to import audio and video files to generate transcripts in CSV, SRT, TXT and VTT formats.

Free Podcast Transcription is a 100% free automated transcription tool that works in the browser. Nothing to install. 100% privacy safe. And totally free.





