Voxtral Alternatives

Voxtral is described as 'Models are state-of-the-art speech understanding models, which are available in two sizes — a 24B variant for production-scale applications and a 3B variant for local and edge deployments. Both versions are released under the Apache 2.0 license' and is a audio transcription tool in the ai tools & services category. There are more than 50 alternatives to Voxtral for a variety of platforms, including Mac, Web-based, Windows, iPhone and Linux apps. The best Voxtral alternative is Vibe Transcribe, which is both free and Open Source. Other great apps like Voxtral are Handy STT, FUTO Voice Input, ElevenLabs and Whisper.

Copy a direct link to this comment to your clipboard
Voxtral alternatives page was last updated

Alternatives list

  1. AI Audio Kit icon
     1 like
    Copy a direct link to this comment to your clipboard

    A straightforward macOS application that allows the user to use different Whisper services (OpenAI API, Runpod Faster Whisper) from your macOS desktop. You have the flexibility to use your own API key, ensuring that you only incur charges for the services you actively use.

    Cost / License

    • Pay once
    • Proprietary

    Application type

    Platforms

    • Mac
     
  2. Whisper Mate icon
     1 like
    Copy a direct link to this comment to your clipboard

    Batch transcribe audio files or movie files into text with OpenAI's Whisper AI Model. With an embed subtitles editor to preview the transcription result segment by segment. All transcribe operation is processing in local machine. Keep your privacy safe.

    Cost / License

    • Freemium (Pay once)
    • Proprietary

    Platforms

    • Mac
     
  3. Letterly icon
     6 likes
    Copy a direct link to this comment to your clipboard

    Letterly is a mobile app that converts any speech to clear and well-structured text. It's more than just a transcription. With the help of AI, you can transform your voice into structured notes, catchy social media posts, readable meeting summaries, formal emails and much more

    Cost / License

    • Freemium (Subscription)
    • Proprietary

    Platforms

    • iPhone
     
  4. txtplay.ai icon
     2 likes
    Copy a direct link to this comment to your clipboard

    Txtplay.ai delivers AI-powered real-time captioning, transcription, and translation for TV and online streaming. It integrates with encoders like PixelPower and Evertz, plus OVPs such as Kaltura and Brightcove. Cloud, hybrid, or on-prem — accessible and multilingual.

    Cost / License

    • Pay once or Subscription
    • Proprietary

    Platforms

    • Online
    • Software as a Service (SaaS)
     
  5. Buzz Captions icon
     4 likes
    Copy a direct link to this comment to your clipboard

    Buzz Captions is an offline audio transcription and translation tool powered by OpenAI's Whisper model. It allows users to import audio and video files to generate transcripts in CSV, SRT, TXT and VTT formats.

    Cost / License

    • Pay once
    • Open Source

    Application type

    Platforms

    • Mac
    • Windows
    • Linux
    • Flathub
    • Flatpak
     
  6. Notah AI icon
     2 likes
    Copy a direct link to this comment to your clipboard

    AI executive assistant that records Google Meet, Zoom, Teams, Webex; transcribes and makes instant summaries: decisions, action items, minutes, smart titles. Search/tag, jump to quotes, export/share.

    277 Notah AI alternatives

    Cost / License

    • Free
    • Proprietary

    Platforms

    • Google Chrome
     
  7. Copy a direct link to this comment to your clipboard

    Simple, hackable offline speech to text - using the VOSK-API.

    Cost / License

    • Free
    • Open Source

    Platforms

    • Linux
     
  8. Dia TTS icon
     1 like
    Copy a direct link to this comment to your clipboard

    Dia is a 1.6B parameter text to speech model created by Nari Labs. It was pushed to the Hub using the PytorchModelHubMixin integration.

    Cost / License

    • Free
    • Open Source

    Application type

    Platforms

    • Self-Hosted
    • Python
     
  9. Copy a direct link to this comment to your clipboard

    Supernormal is the AI platform that helps you write meeting notes 20x faster.

    Cost / License

    • Freemium (Subscription)
    • Proprietary

    Application type

    Platforms

    • Software as a Service (SaaS)
     
  10.  6 likes
    Copy a direct link to this comment to your clipboard

    CMU Sphinx is a speaker-independent large vocabulary continuous speech recognizer released under BSD style license. It is also a collection of open source tools and resources that allows researchers and developers to build speech recognition systems.

    Cost / License

    • Free
    • Open Source

    Platforms

    • Mac
    • Windows
    • Linux
     
  11. Copy a direct link to this comment to your clipboard

    Windows Speech Recognition makes using a keyboard and mouse optional. You can control your PC with your voice and dictate text instead.

    Cost / License

    • Free
    • Proprietary

    Application type

    Platforms

    • Windows
     
You are at page 5 of Voxtral alternatives