Glimpse is a local-first voice dictation app for Mac. No subscription, no cloud required. It's just fast, accurate transcription powered by models running entirely on-device.




Supernormal is described as 'Combines a desktop notetaker with an AI agent to transform your meetings into finished deliverables. The desktop app captures your Zoom, Teams, Google Meet, or Slack conversations without a bot joining the call' and is a audio transcription tool in the audio & music category. There are more than 50 alternatives to Supernormal for a variety of platforms, including Web-based, Mac, Windows, iPhone and iPad apps. The best Supernormal alternative is Handy STT, which is both free and Open Source. Other great apps like Supernormal are Vibe Transcribe, Voxtral, FUTO Voice Input and Whisper.
Glimpse is a local-first voice dictation app for Mac. No subscription, no cloud required. It's just fast, accurate transcription powered by models running entirely on-device.




Notiq is a privacy-first note-taking app for iPhone that lets you capture voice memos, transcribe speech, write text notes, and scan documents — all securely and completely offline.



Powered by deep AI, Deciphr timestamps and summarizes your entire podcast transcript for you.

SpeechText.AI's primary feature is domain-specific speech recognition technology. With this audio transcription software you can get accurate transcripts for wide range of domains: finance, HR, legal, education, medical, information technology, etc.


TranscribeMe offers a suite of transcription products that deliver the highest quality human readable text quickly and with the lowest prices.




Transcriptable is transcription software that lets you transcribe faster by focusing on improving even difficult to hear audio, along with powerful sound, speed and equalizer controls.




This is an online tool for recognition audio voice file(mp3,wav,ogg,wma etc) to text.
This tool base by CMU Sphinx, which a open source speech recognition toolkit from CMU. It is a free and online tool.
Transcribe audio and video files in a blink, automatically, all offline, and with highly accurate results. AI Transcription uses OpenAI’s Whisper technology and Apple Speech Recognition to convert speech (like in podcasts, presentations, lectures, or voice messages) into text...




Ebby will automatically convert your audio to text for a fraction of the time and cost of traditional services.




VibeVoice is a novel framework designed for generating expressive, long-form, multi-speaker conversational audio, such as podcasts, from text. It addresses significant challenges in traditional Text-to-Speech (TTS) systems, particularly in scalability, speaker consistency, and...

