Moonshine is a family of speech-to-text models optimized for fast and accurate automatic speech recognition (ASR) on resource-constrained devices. It is well-suited to real-time, on-device applications like live transcription and voice command recognition.
Voxtral AlternativesOnly apps categorised as Large Language Model (LLM) Tools
The best Large Language Model (LLM) alternative to Voxtral is Moonshine AI, which is both free and Open Source. If that doesn't suit you, our users have ranked more than 50 alternatives to Voxtral and five of them are Large Language Model (LLM) Tools so hopefully you can find a suitable replacement. Other interesting Large Language Model (LLM) Tool alternatives to Voxtral are Dia TTS, Amphion, Vocol and VibeVoice.
filter to find the best alternatives
Alternatives list
Dia is a 1.6B parameter text to speech model created by Nari Labs. It was pushed to the Hub using the PytorchModelHubMixin integration.

Amphion is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
Vocol is an AI transcription software and a one-stop voice collaboration platform designed to boost work efficiency by turning voice and data into actionable insights.



VibeVoice is a novel framework designed for generating expressive, long-form, multi-speaker conversational audio, such as podcasts, from text. It addresses significant challenges in traditional Text-to-Speech (TTS) systems, particularly in scalability, speaker consistency, and...
Cost / License
- Free
- Open Source (MIT)
Platforms
- Python
- Self-Hosted
- Hugging Face
















