VoiceCraft is a token infilling neural codec language model, that achieves state-of-the-art performance on both speech editing and zero-shot text-to-speech (TTS) on in-the-wild data including audiobooks, internet videos, and podcasts.




TTSMaker is described as 'Free text-to-speech tool that provides speech synthesis services, supports multiple languages: English, French, German, Spanish, Arabic, Chinese, Japanese, Korean, Vietnamese... and a variety of voice styles, you can use it reads text and e-books aloud, and can' and is a Text to Speech service in the education & reference category. There are more than 25 alternatives to TTSMaker, not only websites but also apps for a variety of platforms, including SaaS, Windows, Android and iPhone apps. The best TTSMaker alternative is VoiceCraft, which is both free and Open Source. Other great sites and apps similar to TTSMaker are ElevenLabs, Balabolka, X to Voice and Kokoro.
VoiceCraft is a token infilling neural codec language model, that achieves state-of-the-art performance on both speech editing and zero-shot text-to-speech (TTS) on in-the-wild data including audiobooks, internet videos, and podcasts.




ElevenLabs uses AI to deliver natural, expressive speech for diverse applications such as podcasts and videos. It features a user-friendly interface, customizable intonation, and offers seamless API integration. Privacy, scalability, and multilingual capabilities enhance its adaptability.




Balabolka is a Text-To-Speech (TTS) program. All computer voices installed on your system are available to Balabolka. The on-screen text can be saved as a WAV, MP3, MP4, OGG or WMA file. The program can read the clipboard content, view the text from DOC, EPUB, FB2, HTML, ODT...





Open-source tool that analyzes your X/Twitter profile data to generate a custom voice with ElevenLabs Voice Design API, integrating with Hedra's video API for an innovative audio-visual experience.


Kokoro is an open-weight TTS model with 82 million parameters. Despite its lightweight architecture, it delivers comparable quality to larger models while being significantly faster and more cost-efficient.

Natural Reader is a professional text to speech program that converts any written text into spoken words. The paid versions of Natural Reader have many more features.



Voice Engine is a text-to-voice generation platform from OpenAI, which uses text input and a single 15-second audio sample to generate natural-sounding speech that closely resembles the original speaker.


eSpeak is a compact open source software speech synthesizer for English and other languages, for Linux and Windows.


VoiceDesi is an AI-powered voice design tool that lets you create unique, realistic voices from simple text prompts. Whether you need a playful character, a professional narrator, or a powerful mythical figure, you can generate a custom voice in seconds.

We're excited to introduce Chatterbox, Resemble AI's first production-grade open source TTS model. Licensed under MIT, Chatterbox has been benchmarked against leading closed-source systems like ElevenLabs, and is consistently preferred in side-by-side evaluations.
AIVocal is your all-in-one AI assistant for voice tasks—perfect for AI podcasting, speech generation, vocal editing, and voice control. From transcribing meetings to creating high-quality audio content, AIVocal makes voice work smarter and faster.

CloudTTS is a straightforward text-to-speech application. Simply type in or paste the text you'd like to hear, and it reads it back to you.

the interface looks obsolete