VoiceCraft is a token infilling neural codec language model, that achieves state-of-the-art performance on both speech editing and zero-shot text-to-speech (TTS) on in-the-wild data including audiobooks, internet videos, and podcasts.




Sceneo is described as 'Io is an AI-powered creative platform that helps teams turn ideas into high-quality visual stories, videos, and branded content in minutes' and is an app. There are more than 50 alternatives to Sceneo for a variety of platforms, including Web-based, SaaS, iPhone, Mac and Windows apps. The best Sceneo alternative is VoiceCraft, which is both free and Open Source. Other great apps like Sceneo are X to Voice, NaturalReader, Voice Engine and Pickle.
VoiceCraft is a token infilling neural codec language model, that achieves state-of-the-art performance on both speech editing and zero-shot text-to-speech (TTS) on in-the-wild data including audiobooks, internet videos, and podcasts.




Open-source tool that analyzes your X/Twitter profile data to generate a custom voice with ElevenLabs Voice Design API, integrating with Hedra's video API for an innovative audio-visual experience.


Natural Reader is a professional text to speech program that converts any written text into spoken words. The paid versions of Natural Reader have many more features.



Voice Engine is a text-to-voice generation platform from OpenAI, which uses text input and a single 15-second audio sample to generate natural-sounding speech that closely resembles the original speaker.


AI clones lip-sync to your voice in real-time calls. Replace your camera on Zoom, Twitch, TikTok and more.


Meet the world's first AI model designed to generate UGC content. Mirage by Captions generates original actors with natural expressions and body language—completely free from licensing restrictions.




Audiomatic is a web app that seamlessly translates videos into other languages. Our state-of-the-art pipeline delivers contextually-accurate dubbed translations that preserve the tone, style, and emotion of the original speakers.



Vidnoz AI enables fast text-to-video creation with over 70 lifelike avatars and 100+ realistic voices. It offers pre-designed templates, subtitles, and effects—no editing skills needed. The user-friendly interface and customizable options support learning, social media, and more.




Effortlessly create videos with digital actors by inputting a script. Use AI for translating and dubbing videos, preserving voice and syncing lips seamlessly to new languages.




OneClip is an AI-driven platform designed to simplify and automate the creation of short-form, user-generated content (UGC) product demos. Its goal is to reduce the time and cost of producing engaging social videos by replacing manual filming and editing with AI-generated...




AI voice platform features 60+ emotional voices in multiple languages and accents for commercial-grade text-to-speech, supports voice cloning for personal use, offers APIs for workflow integration, enables digital preservation, and fits various audio projects.


Create engaging AI-powered videos with real actors within five minutes. Ideal for businesses, it offers 150+ avatars, auto translations in over 80 languages, and lets users create personal avatars. Perfect for enhancing content, reducing costs, and aligning with organizational goals.



