Amphion AlternativesAudio Transcription Tools and other similar apps like Amphion

Amphion is described as 'Toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development' and is a audio transcription tool in the ai tools & services category. There are more than 50 alternatives to Amphion for a variety of platforms, including Mac, Web-based, Windows, iPhone and iPad apps. The best Amphion alternative is Handy STT, which is both free and Open Source. Other great apps like Amphion are Vibe Transcribe, Voxtral, FUTO Voice Input and Whisper.

Copy a direct link to this comment to your clipboard
Amphion alternatives page was last updated

Alternatives list

  1. Handy STT icon
     55 likes

    Free open source speech-to-text app for desktop runs fully offline, keeping transcription and audio data private. Extensible and customizable, supports accessibility, cross-platform use, and community-driven modifications. No subscriptions or internet requirement.

    87 Handy STT alternatives

    Cost / License

    • Free
    • Open Source (MIT)

    Application type

    Platforms

    • Mac
    • Windows
    • Linux
     
  2. Vibe Transcribe icon
     63 likes

    Delivers offline transcription of audio and video using local AI for privacy, supports batch processing, multi-language input, real-time preview, multi-format export, English translation, direct printing, system audio, microphone input, CLI, and setup customization on Windows, Linux, or macOS.

    158 Vibe Transcribe alternatives

    Cost / License

    • Free
    • Open Source (MIT)

    Application type

    Platforms

    • Mac
    • Windows
    • Linux
     
  3. Voxtral icon
     23 likes

    Advanced audio models for transcription, translation, and understanding, optimized for production and edge deployment, API accessible, open-source with Apache 2.0, delivering high accuracy, resource efficiency, and support for both large-scale and local use cases.

    98 Voxtral alternatives

    Cost / License

    Application type

    Platforms

    • Online
    • Self-Hosted
    • Hugging Face
     
  4. FUTO Voice Input is an application that lets you do speech-to-text on Android, integrating with third party keyboards or apps that use the generic speech-to-text APIs.

    Cost / License

    • Free
    • Open Source

    Application type

    Platforms

    • Android
     
  5. Whisper icon
     26 likes

    An open-source, end-to-end speech recognition system trained on 680,000 hours of diverse audio, providing multilingual transcription, to-English translation, language identification, phrase-level timestamps, and high performance in real-world scenarios using transformer architecture.

    157 Whisper alternatives

    Cost / License

    • Freemium
    • Open Source (MIT)

    Application type

    Platforms

    • Mac
    • Windows
    • Linux
     
  6. Spokenly icon
     14 likes

    Voice recognition software offering real-time transcription across any Mac app, instant customizable shortcuts, support for diverse accents, visual cues, privacy-focused with no hidden data collection, reduces manual typing, fully integrates for note taking, chat, or drafting.

    112 Spokenly alternatives

    Cost / License

    • Freemium
    • Proprietary

    Platforms

    • Mac
    • iPhone
     
  7. Paraspeech icon
     6 likes

    Fully offline speech-to-text for macOS, prioritizing privacy by processing all transcriptions on device, optimized for Apple Silicon, lightweight at under 200MB RAM, compatible with any app, efficient in the background, with no recurring subscription, and fast model setup.

    Cost / License

    • Paid
    • Proprietary

    Application type

    Platforms

    • Mac
     
  8. Moonshine AI icon
     13 likes

    Optimized for edge hardware, this open-source tool provides fast, private on-device speech recognition for real-time transcription, command and speaker identification, lower word-error rates than Whisper models, and multi-language support over varied platforms.

    Cost / License

    • Free
    • Open Source

    Application type

    Platforms

    • Self-Hosted
    • Python
     
  9. TalkNotes turns audio into structured notes, todos, flashcards, and transcripts using AI transcription in 100+ languages. Automatically capture lectures, meetings, or any speech into actionable text, ensuring tasks never get lost, with user-friendly accessibility for everyone.

    Cost / License

    • Paid
    • Proprietary

    Application type

    Platforms

    • Online
    • Software as a Service (SaaS)
     
  10. VoiceInk icon
     7 likes

    Instantly converts spoken words to text using local AI for near-perfect accuracy and privacy, with real-time transcription, seamless macOS integration, user-supplied AI keys, multiple language support, and no cloud data upload required.

    101 VoiceInk alternatives

    Cost / License

    • Freemium
    • Open Source

    Application type

    Platforms

    • Mac
     
  11. FluidVoice icon
     5 likes

    Processes voice-to-text instantly on Mac with privacy-first local AI, zero delay, no subscriptions, open-source code, optional context-enhanced transcription, direct input into any app, hardware-optimized performance, and full offline control.

    83 FluidVoice alternatives

    Cost / License

    Application type

    Platforms

    • Mac
     
  12. xcribe icon
     8 likes

    Privacy-focused transcription tool for macOS. For free. No ads, no tracking, no data collection.

    Cost / License

    • Free
    • Proprietary

    Application type

    Platforms

    • Mac
     
12 of 74 Amphion alternatives