Transcripts are your new secret weapon. Access the full potential of your audio and video content by converting it to searchable, editable interactive transcripts with Trint.
Cost / License
- Paid
- Proprietary
Application types
Platforms
- Online
Amphion is described as 'Toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development' and is a audio transcription tool in the ai tools & services category. There are more than 50 alternatives to Amphion for a variety of platforms, including Mac, Web-based, Windows, iPhone and iPad apps. The best Amphion alternative is Handy STT, which is both free and Open Source. Other great apps like Amphion are Vibe Transcribe, Voxtral, FUTO Voice Input and Whisper.
Transcripts are your new secret weapon. Access the full potential of your audio and video content by converting it to searchable, editable interactive transcripts with Trint.
Transcriptions powered by artificial intelligence. Automatically convert audio into text.
Transform thoughts into text effortlessly with Audio Note. Speak your mind and let AI refine it into formats of your choice like Journal Entries, Tweets, Notes, Lists or LinkedIn Posts. Unlock the power of your words!




VoxCommando is a speech recognition and command utility that can be used for home automation, as an assistive tool to speed up everyday tasks, or simply because it is fun and easy to use.




BetterDictation is your personal scribe. You speak, and it will quickly and flawless transcribe into any app.


Saylient speeds up writing meeting minutes, reviewing lectures, and analyzing interviews. Transcribe, review, and share snippets from your video and audio files.

Scriber Pro transforms your audio and video files into accurate text in seconds with the power of AI transcription. Whether you're transcribing meetings, interviews, lectures, or personal recordings, Scriber Pro makes it effortless.




FLUENT is a hotkey-activated speech-to-text recognition tool that conveniently displays the recognition results & copies them to the clipboard.




VibeVoice is a novel framework designed for generating expressive, long-form, multi-speaker conversational audio, such as podcasts, from text. It addresses significant challenges in traditional Text-to-Speech (TTS) systems, particularly in scalability, speaker consistency, and...


Writevoice lets you write at the speed of thought. Click record, speak naturally, and get clean, accurate text ready for docs, tickets, or your CRM. It’s fast, precise, and privacy-first: we never store your recordings or transcripts.

