Voxtral
Advanced audio models for transcription, translation, and understanding, optimized for production and edge deployment, API accessible, open-source with Apache 2.0, delivering high accuracy, resource efficiency, and support for both large-scale and local use cases.
Cost / License
- Freemium
- Open Source (Apache-2.0)
Application type
Platforms
- Online
- Self-Hosted
- Hugging Face
Features
- Ad-free
- AI-Powered
- Speech Transcription
- Speech Recognition
Voxtral News & Activities
Recent News
- POX published news article about Voxtral
Mistral unveils Voxtral Transcribe 2, a cheap open source speech model that runs on-deviceFrench company Mistral AI has released Voxtral Transcribe 2, introducing two next-generation speech...
- POX published news article about Voxtral
Mistral introduces Voxtral, its first family of open source speech understanding AI modelsMistral has introduced Voxtral, a new family of state-of-the-art speech understanding AI models. Th...
Recent activities
- gregsysu added Voxtral as alternative to Video to Text AI
- Danilo_Venom updated Voxtral
POX added Voxtral as alternative to Glimpse STT
What is Voxtral?
Voxtral models are state-of-the-art speech understanding models, which are available in two sizes — a 24B variant for production-scale applications and a 3B variant for local and edge deployments. Both versions are released under the Apache 2.0 license. We have also made both models available on our API, and also provided a highly optimized transcription-only endpoint that delivers unparalleled cost-efficiency.
Voxtral Small is an enhancement of Mistral Small 3, incorporating state-of-the-art audio input capabilities while retaining best-in-class text performance. It excels at speech transcription, translation and audio understanding.
Voxtral Mini is an enhancement of Ministral 3B, incorporating state-of-the-art audio input capabilities while retaining best-in-class text performance. It excels at speech transcription, translation and audio understanding.





