VoiceCraft is a token infilling neural codec language model, that achieves state-of-the-art performance on both speech editing and zero-shot text-to-speech (TTS) on in-the-wild data including audiobooks, internet videos, and podcasts.




The best open source alternative to Murf AI is VoiceCraft. If that doesn't suit you, our users have ranked more than 50 alternatives to Murf AI and 12 is open source so hopefully you can find a suitable replacement. Other interesting open source alternatives to Murf AI are Speech Note, X to Voice, Kokoro and eSpeak.
VoiceCraft is a token infilling neural codec language model, that achieves state-of-the-art performance on both speech editing and zero-shot text-to-speech (TTS) on in-the-wild data including audiobooks, internet videos, and podcasts.








Open-source tool that analyzes your X/Twitter profile data to generate a custom voice with ElevenLabs Voice Design API, integrating with Hedra's video API for an innovative audio-visual experience.


Kokoro is an open-weight TTS model with 82 million parameters. Despite its lightweight architecture, it delivers comparable quality to larger models while being significantly faster and more cost-efficient.

eSpeak is a compact open source software speech synthesizer for English and other languages, for Linux and Windows.


SherpaTTS is an Android Text-to-Speech engine based on Next-gen Kaldi using Piper or Coqui voices.


QwenVoice is a native SwiftUI macOS application that brings state-of-the-art text-to-speech to Apple Silicon Macs with no Python install, no terminal, and no dependencies required of the user — just download and run.



Free open source AI voice cloning and text to speech synthesis. Clone a voice in 5 seconds to generate arbitrary speech in real-time.
Simple TTS Reader is a small utility that reads text from your clipboard using Microsoft Speech API. Whenever you copy any text, the app instantly converts it into spoken words. You have full control over the voice output — simply select your preferred speech engine from those...

Bark is a transformer-based text-to-audio model created by Suno. Bark can generate highly realistic, multilingual speech as well as other audio - including music, background noise and simple sound effects.




Free and offline Text-to-Speech (TTS) engine that reads any text on your screen with high-quality voices powered by AI models.