Voxtral models are state-of-the-art speech understanding models, which are available in two sizes — a 24B variant for production-scale applications and a 3B variant for local and edge deployments. Both versions are released under the Apache 2.0 license.
- Audio Transcription Tool
- Freemium • Open Source
- Hugging Face
- Self-Hosted
- Online














