Glimpse is a local-first voice dictation app for Mac. No subscription, no cloud required. It's just fast, accurate transcription powered by models running entirely on-device.




Audiotype - Audio & Video Transcription is described as 'Audiotype is a transcription software that convert audio and video file into editable text transcript and subtitles. More than +10 000 users use Audiotype to transcribe their media files (video, podcast, recordings, MP4, MP3, interviews) into exportable transcripts or subtitles' and is a audio transcription tool in the audio & music category. There are more than 50 alternatives to Audiotype - Audio & Video Transcription, not only websites but also apps for a variety of platforms, including Mac, iPhone, Windows and SaaS apps. The best Audiotype - Audio & Video Transcription alternative is Handy STT, which is both free and Open Source. Other great sites and apps similar to Audiotype - Audio & Video Transcription are Vibe Transcribe, Voxtral, FUTO Voice Input and Whisper.
Glimpse is a local-first voice dictation app for Mac. No subscription, no cloud required. It's just fast, accurate transcription powered by models running entirely on-device.




AI-driven online converter transcribes uploaded audio files into accurate text, supporting multiple languages and dialects. Works fully in-browser, requires no registration, offers fast speech recognition, and supports formats suited for interviews or meetings.

Local speech-to-text app for Windows 10/11 using Whisper AI, transcribes audio and video files offline, protects privacy, supports 90+ languages and multiple formats, offers GPU acceleration, drag-and-drop, and exports to SRT, VTT, TXT, or LRC formats.



NotchLive is a macOS menu bar app that displays real-time AI-powered captions and translations directly in your MacBook's notch. It uses on-device Whisper AI (via CoreML) for speech recognition and Apple Translation for real-time translation — nothing ever leaves your Mac.


WordWand is a system-wide AI assistant for macOS that works in any app through a single keyboard shortcut. No copy-pasting, no tab switching — just select text, press a hotkey, and transform it instantly.



VibeVoice is a novel framework designed for generating expressive, long-form, multi-speaker conversational audio, such as podcasts, from text. It addresses significant challenges in traditional Text-to-Speech (TTS) systems, particularly in scalability, speaker consistency, and...


VoiceType AI is a revolutionary voice-first writing assistant designed to transform how you create content.



Transcriboar is a lightweight Android transcription app that uses the device’s built-in SpeechRecognizer to convert speech to text in real time.



