Private, on-device audio transcription for macOS. Your audio never leaves your Mac — no cloud uploads, no subscriptions, no data collection. Real-time ASR with Qwen3-ASR, MLX Whisper & Whisper, plus system-wide dictation, all 100% local.




Gladia is described as 'Production-ready Speech-to-Text API built for teams shipping real-world voice products—delivering high accuracy, multilingual coverage, real-time + async transcription, and a growing set of add-ons (diarization, translation, summarization, sentiment, formatting, and more)' and is a audio transcription tool in the audio & music category. There are more than 25 alternatives to Gladia, not only websites but also apps for a variety of platforms, including Mac, Windows, Self-Hosted and iPhone apps. The best Gladia alternative is Vibe Transcribe, which is both free and Open Source. Other great sites and apps similar to Gladia are Voxtral, Whisper, TranscribeX and Moonshine AI.
Private, on-device audio transcription for macOS. Your audio never leaves your Mac — no cloud uploads, no subscriptions, no data collection. Real-time ASR with Qwen3-ASR, MLX Whisper & Whisper, plus system-wide dictation, all 100% local.




WordWand is a system-wide AI assistant for macOS that works in any app through a single keyboard shortcut. No copy-pasting, no tab switching — just select text, press a hotkey, and transform it instantly.



Meeting Recorder is your personal assistant for meetings. It listens and transcribes meetings and conferences for you, allowing you to search for words and phrases within your recording. You can record your most important conversations and save time, helping you work more...




Transcribe audio and video files in a blink, automatically, all offline, and with highly accurate results. AI Transcription uses OpenAI’s Whisper technology and Apple Speech Recognition to convert speech (like in podcasts, presentations, lectures, or voice messages) into text...




Are you tired of manually transcribing audio recordings, images, and videos into text?




Gazelle is a joint speech-language model by Tincans — for more details and prompt ideas, see our v0.2 announcement. This is an early research preview -- please temper expectations! Gazelle can take in text and audio as input (interchangeably) and generates text as output.

AI-driven online converter transcribes uploaded audio files into accurate text, supporting multiple languages and dialects. Works fully in-browser, requires no registration, offers fast speech recognition, and supports formats suited for interviews or meetings.

This is an iOS application written in Objective-c for assisting the people who want to work out a piece of audio in order to write it out.




NotchLive is a macOS menu bar app that displays real-time AI-powered captions and translations directly in your MacBook's notch. It uses on-device Whisper AI (via CoreML) for speech recognition and Apple Translation for real-time translation — nothing ever leaves your Mac.

