VoiceCraft is a token infilling neural codec language model, that achieves state-of-the-art performance on both speech editing and zero-shot text-to-speech (TTS) on in-the-wild data including audiobooks, internet videos, and podcasts.




Real-Time Voice Cloning is described as 'Free open source AI voice cloning and text to speech synthesis. Clone a voice in 5 seconds to generate arbitrary speech in real-time' and is an app. There are more than 25 alternatives to Real-Time Voice Cloning for a variety of platforms, including Web-based, SaaS, Mac, Windows and Linux apps. The best Real-Time Voice Cloning alternative is VoiceCraft, which is both free and Open Source. Other great apps like Real-Time Voice Cloning are X to Voice, Kokoro, NaturalReader and Voice Engine.
VoiceCraft is a token infilling neural codec language model, that achieves state-of-the-art performance on both speech editing and zero-shot text-to-speech (TTS) on in-the-wild data including audiobooks, internet videos, and podcasts.




Open-source tool that analyzes your X/Twitter profile data to generate a custom voice with ElevenLabs Voice Design API, integrating with Hedra's video API for an innovative audio-visual experience.


Kokoro is an open-weight TTS model with 82 million parameters. Despite its lightweight architecture, it delivers comparable quality to larger models while being significantly faster and more cost-efficient.

Natural Reader is a professional text to speech program that converts any written text into spoken words. The paid versions of Natural Reader have many more features.



Voice Engine is a text-to-voice generation platform from OpenAI, which uses text input and a single 15-second audio sample to generate natural-sounding speech that closely resembles the original speaker.


SherpaTTS is an Android Text-to-Speech engine based on Next-gen Kaldi using Piper or Coqui voices.


Karaoke and transform any songs in your AI voice. No singing skill required, your AI voice can handle any song even in other languages!.




We're excited to introduce Chatterbox, Resemble AI's first production-grade open source TTS model. Licensed under MIT, Chatterbox has been benchmarked against leading closed-source systems like ElevenLabs, and is consistently preferred in side-by-side evaluations.
AIVocal is your all-in-one AI assistant for voice tasks—perfect for AI podcasting, speech generation, vocal editing, and voice control. From transcribing meetings to creating high-quality audio content, AIVocal makes voice work smarter and faster.

Wondercraft AI is a tool that allows users to easily create studio-quality podcasts using generative AI technology. It eliminates the need for extensive recording and scripting by allowing users to record just a 60-second sample of their voice, which the AI uses to clone their...




AI voice platform features 60+ emotional voices in multiple languages and accents for commercial-grade text-to-speech, supports voice cloning for personal use, offers APIs for workflow integration, enables digital preservation, and fits various audio projects.


Convert text into realistic speech or short-form video with synthetic AI voices, over 700 options in 65+ languages, automatic subtitles, and fast web-based tools ideal for social, educational, e-learning, and marketing content while improving accessibility.



