VoiceCraft is a token infilling neural codec language model, that achieves state-of-the-art performance on both speech editing and zero-shot text-to-speech (TTS) on in-the-wild data including audiobooks, internet videos, and podcasts.




The best Text to Speech alternative to ElevenLabs is VoiceCraft, which is both free and Open Source. If that doesn't suit you, our users have ranked more than 50 alternatives to ElevenLabs and many of them are Text to Speech Services so hopefully you can find a suitable replacement. Other interesting Text to Speech Service alternatives to ElevenLabs are Voice Engine, Kokoro, Balabolka and SherpaTTS .
VoiceCraft is a token infilling neural codec language model, that achieves state-of-the-art performance on both speech editing and zero-shot text-to-speech (TTS) on in-the-wild data including audiobooks, internet videos, and podcasts.




Voice Engine is a text-to-voice generation platform from OpenAI, which uses text input and a single 15-second audio sample to generate natural-sounding speech that closely resembles the original speaker.


Kokoro is an open-weight TTS model with 82 million parameters. Despite its lightweight architecture, it delivers comparable quality to larger models while being significantly faster and more cost-efficient.

Balabolka is a Text-To-Speech (TTS) program. All computer voices installed on your system are available to Balabolka. The on-screen text can be saved as a WAV, MP3, MP4, OGG or WMA file. The program can read the clipboard content, view the text from DOC, EPUB, FB2, HTML, ODT...



SherpaTTS is an Android Text-to-Speech engine based on Next-gen Kaldi using Piper or Coqui voices.


We're excited to introduce Chatterbox, Resemble AI's first production-grade open source TTS model. Licensed under MIT, Chatterbox has been benchmarked against leading closed-source systems like ElevenLabs, and is consistently preferred in side-by-side evaluations.
Audiomatic is a web app that seamlessly translates videos into other languages. Our state-of-the-art pipeline delivers contextually-accurate dubbed translations that preserve the tone, style, and emotion of the original speakers.



Transform text into speech with natural synthesis, offering smooth and fine-tuned audio export. Create high-quality voiceovers, download outputs for diverse applications, and experience excellent synthesis. Supports various languages and operates on multiple platforms.




TTSMaker is a free text-to-speech tool that provides speech synthesis services, supports multiple languages: English, French, German, Spanish, Arabic, Chinese, Japanese, Korean, Vietnamese... and a variety of voice styles, you can use it reads text and e-books aloud, and can...

VoiceDesi is an AI-powered voice design tool that lets you create unique, realistic voices from simple text prompts. Whether you need a playful character, a professional narrator, or a powerful mythical figure, you can generate a custom voice in seconds.

Convert text into natural-sounding speech using an API powered by the best of Google’s AI technologies.

Readivo converts written articles into natural-sounding audio automatically. It is designed for blogs and publishers who want to add a “listen to article” feature without recording audio manually.




i like to use free text to speech free commercial