Transcribe any audio and get fast and accurate transcripts with timestamps using AI. Generate new content from the transcripts such as summaries, blog-posts, social media posts or your own custom content with GPT prompts. No subscription required.


SpeechFlow is described as 'Accurate speech-to-text API for all languages beyond just English' and is a audio transcription tool in the audio & music category. There are more than 50 alternatives to SpeechFlow for a variety of platforms, including Web-based, Mac, SaaS, iPhone and Windows apps. The best SpeechFlow alternative is FUTO Voice Input, which is both free and Open Source. Other great apps like SpeechFlow are Whisper, Moonshine AI, MacWhisper and Dictanote.
Transcribe any audio and get fast and accurate transcripts with timestamps using AI. Generate new content from the transcripts such as summaries, blog-posts, social media posts or your own custom content with GPT prompts. No subscription required.


Podium is the ultimate podcast editing service that uses AI-generated show notes, summaries, chapters, transcripts, and highlights to supercharge your post-production workflow. With Podium's AI copywriting software, you can generate high-quality show notes and summaries in...



A free, secure, and easy automatic transcription service, that delivers astounding transcriptions in minutes. Made in Denmark for journalists and anyone else.


A straightforward macOS application that allows the user to use different Whisper services (OpenAI API, Runpod Faster Whisper) from your macOS desktop. You have the flexibility to use your own API key, ensuring that you only incur charges for the services you actively use.




Txtplay.ai delivers AI-powered real-time captioning, transcription, and translation for TV and online streaming. It integrates with encoders like PixelPower and Evertz, plus OVPs such as Kaltura and Brightcove. Cloud, hybrid, or on-prem — accessible and multilingual.

txtplay.ai is the most popular SaaS alternative to SpeechFlow.
AI executive assistant that records Google Meet, Zoom, Teams, Webex; transcribes and makes instant summaries: decisions, action items, minutes, smart titles. Search/tag, jump to quotes, export/share.








Transcript LOL is a transcription service that converts video, podcast, or meeting content into text, supporting over 1500 platforms without requiring downloads or uploads.




CMU Sphinx is a speaker-independent large vocabulary continuous speech recognizer released under BSD style license. It is also a collection of open source tools and resources that allows researchers and developers to build speech recognition systems.
Windows Speech Recognition makes using a keyboard and mouse optional. You can control your PC with your voice and dictate text instead.
Amphion is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.