Optimized for edge hardware, this open-source tool provides fast, private on-device speech recognition for real-time transcription, command and speaker identification, lower word-error rates than Whisper models, and multi-language support over varied platforms.
Voxtral AlternativesOnly apps categorised as Large Language Model (LLM) Tools
The best Large Language Model (LLM) alternative to Voxtral is Moonshine AI, which is both free and Open Source. If that doesn't suit you, our users have ranked more than 50 alternatives to Voxtral and six of them are Large Language Model (LLM) Tools so hopefully you can find a suitable replacement. Other interesting Large Language Model (LLM) Tool alternatives to Voxtral are Echosy, Dia TTS, Amphion and Vocol.
filter to find the best alternatives
Alternatives list
Private, on-device audio transcription for macOS. Your audio never leaves your Mac — no cloud uploads, no subscriptions, no data collection. Real-time ASR with Qwen3-ASR, MLX Whisper & Whisper, plus system-wide dictation, all 100% local.


+4
Dia is a 1.6B parameter text to speech model created by Nari Labs. It was pushed to the Hub using the PytorchModelHubMixin integration.

Amphion is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
Vocol is an AI transcription software and a one-stop voice collaboration platform designed to boost work efficiency by turning voice and data into actionable insights.



VibeVoice is a novel framework designed for generating expressive, long-form, multi-speaker conversational audio, such as podcasts, from text. It addresses significant challenges in traditional Text-to-Speech (TTS) systems, particularly in scalability, speaker consistency, and...
Cost / License
- Free
- Open Source (MIT)
Platforms
- Python
- Self-Hosted
- Hugging Face



















