This is Scriberr, a self-hostable AI audio transcription app. Scriber uses the Whisper models from OpenAI, to transcribe audio files offline, on your hardware.




Gladia is described as 'Production-ready Speech-to-Text API built for teams shipping real-world voice products—delivering high accuracy, multilingual coverage, real-time + async transcription, and a growing set of add-ons (diarization, translation, summarization, sentiment, formatting, and more)' and is a audio transcription tool in the audio & music category. There are more than 25 alternatives to Gladia, not only websites but also apps for a variety of platforms, including Mac, Windows, Self-Hosted and iPhone apps. The best Gladia alternative is Vibe Transcribe, which is both free and Open Source. Other great sites and apps similar to Gladia are Voxtral, Whisper, TranscribeX and Moonshine AI.
This is Scriberr, a self-hostable AI audio transcription app. Scriber uses the Whisper models from OpenAI, to transcribe audio files offline, on your hardware.




Amphion is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
Write with your voice in any app on macOS. Faster and more accurate than ChatGPT, Google and OpenAI Whisper. Start talking. Stop typing.

The Tomedes Free AI Transcription Tool transforms audio and video files into clear, accurate text in seconds. Supporting formats like MP3, MP4, WAV, and more, it offers seamless transcriptions in nearly 100 languages.

BetterDictation is your personal scribe. You speak, and it will quickly and flawless transcribe into any app.


BlabbyAI is a powerful speech-to-text extension for your browser that lets you voice-type 3x faster than typing.




SpeechText.AI's primary feature is domain-specific speech recognition technology. With this audio transcription software you can get accurate transcripts for wide range of domains: finance, HR, legal, education, medical, information technology, etc.


AssemblyAI is API for speech recognition. They’ve built “accurate, simple and customizable” technology that the team claims is what “Stripe did to payments,” but for speech. The voice technology industry is growing fast, due to the popularity of Siri, Alexa and Google Home.

Private Transcriber Pro is a Windows-based offline transcription tool that processes audio and video files. Key features include drag-and-drop functionality, multilingual transcription with optional English translation, and export options for text and subtitle files.



VibeVoice is a novel framework designed for generating expressive, long-form, multi-speaker conversational audio, such as podcasts, from text. It addresses significant challenges in traditional Text-to-Speech (TTS) systems, particularly in scalability, speaker consistency, and...


Automatically convert all of your voice recordings into clean, organized, neat text files. Unlimited and free.


