Voxtral AlternativesAudio Transcription Tools and other similar apps like Voxtral

Voxtral is described as 'State-of-the-art speech models with transcription, translation, and audio understanding, available via API or self-hosted, optimized for cost and efficiency' and is a audio transcription tool in the ai tools & services category. There are more than 50 alternatives to Voxtral for a variety of platforms, including Mac, Web-based, Windows, iPhone and Linux apps. The best Voxtral alternative is Handy STT, which is both free and Open Source. Other great apps like Voxtral are Vibe Transcribe, FUTO Voice Input, ElevenLabs and Spokenly.

filter to find the best alternatives

Voxtral alternatives are mainly Audio Transcription Tools, but if you're looking for Video Transcription Tools or Text to Speech Services you can filter on that. These are just examples - use the filter bar below to find more specific alternatives to Voxtral.
Copy a direct link to this comment to your clipboard
Voxtral alternatives page was last updated

Alternatives list

  1. Handy STT icon
     51 likes

    Free open source speech-to-text app for desktop runs fully offline, keeping transcription and audio data private. Extensible and customizable, supports accessibility, cross-platform use, and community-driven modifications. No subscriptions or internet requirement.

    77 Handy STT alternatives

    Cost / License

    • Free
    • Open Source (MIT)

    Application type

    Platforms

    • Mac
    • Windows
    • Linux
     
  2. Vibe Transcribe icon
     60 likes

    Delivers offline transcription of audio and video using local AI for privacy, supports batch processing, multi-language input, real-time preview, multi-format export, English translation, direct printing, system audio, microphone input, CLI, and setup customization on Windows, Linux, or macOS.

    144 Vibe Transcribe alternatives

    Cost / License

    • Free
    • Open Source (MIT)

    Application type

    Platforms

    • Mac
    • Windows
    • Linux
     
  3. ElevenLabs icon
     19 likes

    ElevenLabs uses AI to deliver natural, expressive speech for diverse applications such as podcasts and videos. It features a user-friendly interface, customizable intonation, and offers seamless API integration. Privacy, scalability, and multilingual capabilities enhance its adaptability.

    51 ElevenLabs alternatives

    Cost / License

    • Freemium
    • Proprietary

    Platforms

    • Online
    • Android
    • iPhone
    • Android Tablet
    • iPad
     
  4. Spokenly icon
     13 likes

    Voice recognition software offering real-time transcription across any Mac app, instant customizable shortcuts, support for diverse accents, visual cues, privacy-focused with no hidden data collection, reduces manual typing, fully integrates for note taking, chat, or drafting.

    101 Spokenly alternatives

    Cost / License

    • Freemium
    • Proprietary

    Application type

    Platforms

    • Mac
    • iPhone
     
  5. Whisper icon
     24 likes

    An open-source, end-to-end speech recognition system trained on 680,000 hours of diverse audio, providing multilingual transcription, to-English translation, language identification, phrase-level timestamps, and high performance in real-world scenarios using transformer architecture.

    145 Whisper alternatives

    Cost / License

    • Freemium
    • Open Source (MIT)

    Application type

    Platforms

    • Mac
    • Windows
    • Linux
     
  6. TranscribeX icon
     8 likes

    Local macOS solution for fast AI-powered transcription and translation of audio or video in over 100 languages, featuring speaker diarization, batch processing, GPU acceleration, file-type flexibility, drag-and-drop support, advanced editing, and secure privacy.

    71 TranscribeX alternatives

    Cost / License

    • Freemium
    • Proprietary

    Application type

    Platforms

    • Mac
     
  7. Paraspeech icon
     6 likes

    Fully offline speech-to-text for macOS, prioritizing privacy by processing all transcriptions on device, optimized for Apple Silicon, lightweight at under 200MB RAM, compatible with any app, efficient in the background, with no recurring subscription, and fast model setup.

    86 Paraspeech alternatives

    Cost / License

    • Paid
    • Proprietary

    Application type

    Platforms

    • Mac
     
  8. TalkNotes turns audio into structured notes, todos, flashcards, and transcripts using AI transcription in 100+ languages. Automatically capture lectures, meetings, or any speech into actionable text, ensuring tasks never get lost, with user-friendly accessibility for everyone.

    Cost / License

    • Paid
    • Proprietary

    Application type

    Platforms

    • Online
    • Software as a Service (SaaS)
     
  9. Moonshine AI icon
     12 likes

    Optimized for edge hardware, this open-source tool provides fast, private on-device speech recognition for real-time transcription, command and speaker identification, lower word-error rates than Whisper models, and multi-language support over varied platforms.

    Cost / License

    • Free
    • Open Source

    Application type

    Platforms

    • Self-Hosted
    • Python
     
  10. FluidVoice icon
     5 likes

    Processes voice-to-text instantly on Mac with privacy-first local AI, zero delay, no subscriptions, open-source code, optional context-enhanced transcription, direct input into any app, hardware-optimized performance, and full offline control.

    Cost / License

    Application type

    Platforms

    • Mac
     
  11. Murmure icon
     5 likes

    A privacy-first, open-source speech-to-text application that runs entirely on your machine, powered by a neural network via NVIDIA’s Parakeet model for fast, local transcription. Murmure turns your voice into text with no internet connection and zero data collection, and...

    69 Murmure alternatives

    Cost / License

    Application type

    Platforms

    • Windows
    • Linux
     
12 of 91 Voxtral alternatives