AI executive assistant that records Google Meet, Zoom, Teams, Webex; transcribes and makes instant summaries: decisions, action items, minutes, smart titles. Search/tag, jump to quotes, export/share.



Handy STT is described as 'Free, open source speech-to-text app for desktop, works fully offline, prioritizes privacy, extensibility, cross-platform support, and accessibility' and is a popular audio transcription tool in the audio & music category. There are more than 50 alternatives to Handy STT for a variety of platforms, including Mac, Web-based, Windows, iPhone and Linux apps. The best Handy STT alternative is Vibe Transcribe, which is both free and Open Source. Other great apps like Handy STT are OpenWhispr, Voxtral, FUTO Voice Input and TypeWhisper.
AI executive assistant that records Google Meet, Zoom, Teams, Webex; transcribes and makes instant summaries: decisions, action items, minutes, smart titles. Search/tag, jump to quotes, export/share.



Supernormal combines a desktop notetaker with an AI agent to transform your meetings into finished deliverables. The desktop app captures your Zoom, Teams, Google Meet, or Slack conversations without a bot joining the call.



Transcript LOL is a transcription service that converts video, podcast, or meeting content into text, supporting over 1500 platforms without requiring downloads or uploads.





Buzz Captions is an offline audio transcription and translation tool powered by OpenAI's Whisper model. It allows users to import audio and video files to generate transcripts in CSV, SRT, TXT and VTT formats.

CMU Sphinx is a speaker-independent large vocabulary continuous speech recognizer released under BSD style license. It is also a collection of open source tools and resources that allows researchers and developers to build speech recognition systems.
Windows Speech Recognition makes using a keyboard and mouse optional. You can control your PC with your voice and dictate text instead.
Open-source Rust based AI meeting assistant with Parakeet/Whisper live transcription, speaker diarization, and Ollama summarization. 100% local processing. No cloud required.



Amphion is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
Write with your voice in any app on macOS. Faster and more accurate than ChatGPT, Google and OpenAI Whisper. Start talking. Stop typing.

High-quality on-device transcription. Easily convert speech to text from meetings, lectures, and more.

