VoiceCraft is a token infilling neural codec language model, that achieves state-of-the-art performance on both speech editing and zero-shot text-to-speech (TTS) on in-the-wild data including audiobooks, internet videos, and podcasts.




Voice Engine is described as 'Text-to-voice generation platform from OpenAI, which uses text input and a single 15-second audio sample to generate natural-sounding speech that closely resembles the original speaker' and is a Text to Speech service in the ai tools & services category. There are more than 25 alternatives to Voice Engine, not only websites but also apps for a variety of platforms, including SaaS, Windows, iPhone and Mac apps. The best Voice Engine alternative is VoiceCraft, which is both free and Open Source. Other great sites and apps similar to Voice Engine are ElevenLabs, X to Voice, AIVocal and Wondera.
VoiceCraft is a token infilling neural codec language model, that achieves state-of-the-art performance on both speech editing and zero-shot text-to-speech (TTS) on in-the-wild data including audiobooks, internet videos, and podcasts.




ElevenLabs uses AI to deliver natural, expressive speech for diverse applications such as podcasts and videos. It features a user-friendly interface, customizable intonation, and offers seamless API integration. Privacy, scalability, and multilingual capabilities enhance its adaptability.




Open-source tool that analyzes your X/Twitter profile data to generate a custom voice with ElevenLabs Voice Design API, integrating with Hedra's video API for an innovative audio-visual experience.


AIVocal is your all-in-one AI assistant for voice tasks—perfect for AI podcasting, speech generation, vocal editing, and voice control. From transcribing meetings to creating high-quality audio content, AIVocal makes voice work smarter and faster.

Karaoke and transform any songs in your AI voice. No singing skill required, your AI voice can handle any song even in other languages!.




We're excited to introduce Chatterbox, Resemble AI's first production-grade open source TTS model. Licensed under MIT, Chatterbox has been benchmarked against leading closed-source systems like ElevenLabs, and is consistently preferred in side-by-side evaluations.
Wondercraft AI is a tool that allows users to easily create studio-quality podcasts using generative AI technology. It eliminates the need for extensive recording and scripting by allowing users to record just a 60-second sample of their voice, which the AI uses to clone their...




Choose from 60+ human-like, emotional voices in various accents, languages, and characters to turn any text into a commercial-grade audio. Or Clone your own voice.


TTSMaker is a free text-to-speech tool that provides speech synthesis services, supports multiple languages: English, French, German, Spanish, Arabic, Chinese, Japanese, Korean, Vietnamese... and a variety of voice styles, you can use it reads text and e-books aloud, and can...

Amazon Polly uses deep learning technologies to synthesize natural-sounding human speech, so you can convert articles to speech. With dozens of lifelike voices across a broad set of languages, use Amazon Polly to build speech-activated applications.



Voicebox is a state-of-the-art speech generative model built upon Meta’s non-autoregressive flow matching model. By learning to solve a text-guided speech infilling task with a large scale of data, Voicebox outperforms single purpose AI models across speech tasks through...

Transforms text into professional, browser-based HD videos using AI, offering 300+ voices in 40+ languages, scene merging, customizable visuals and music, quick production, unlimited downloads, and easy collaboration for marketing, training, or onboarding purposes.



