Kokoro is an open-weight TTS model with 82 million parameters. Despite its lightweight architecture, it delivers comparable quality to larger models while being significantly faster and more cost-efficient.

Kokoro is an open-weight TTS model with 82 million parameters. Despite its lightweight architecture, it delivers comparable quality to larger models while being significantly faster and more cost-efficient.



Convert ebooks to audiobooks with chapters and metadata using dynamic AI models and voice cloning. Supports 1,107+ languages!.



AI receptionist answers calls 24/7, books appointments, captures leads, qualifies callers, answers FAQs, and sends summaries by text and email after every call.




KDAN PDF is a comprehensive, AI-powered document solution that enables professional PDF editing, conversion, and intelligence analysis across Windows, Mac, iOS, and Android.




Voice Engine is a text-to-voice generation platform from OpenAI, which uses text input and a single 15-second audio sample to generate natural-sounding speech that closely resembles the original speaker.


Open-source AI assistant creates interactive UI tools like buttons and forms in chat, supports 24 services, offline mode, free tier, and encrypted local storage.




eSpeak is a compact open source software speech synthesizer for English and other languages, for Linux and Windows.


An open-source, modern-design AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / Azure / DeepSeek), Knowledge Base (file upload / knowledge management / RAG ), Multi-Modals (Vision/TTS) and plugin system.




Convert text into realistic speech with smooth, natural synthesis, export fine-tuned audio files, and download high-quality output for various uses.




Anx Reader, a thoughtfully crafted e-book reader for book lovers. Featuring powerful AI capabilities and supporting various e-book formats, it makes reading smarter and more focused. With its modern interface design, we're committed to delivering pure reading pleasure.




PopTranslate is an AI-powered Mac app that instantly translates selected text, compares translations, and OCR.



Audiomatic is a web app that seamlessly translates videos into other languages. Our state-of-the-art pipeline delivers contextually-accurate dubbed translations that preserve the tone, style, and emotion of the original speakers.



We're excited to introduce Chatterbox, Resemble AI's first production-grade open source TTS model. Licensed under MIT, Chatterbox has been benchmarked against leading closed-source systems like ElevenLabs, and is consistently preferred in side-by-side evaluations.
Transform text into videos swiftly with lifelike avatars and 100+ realistic voices. Benefit from pre-designed templates and an intuitive interface.




Wondercraft AI is a tool that allows users to easily create studio-quality podcasts using generative AI technology. It eliminates the need for extensive recording and scripting by allowing users to record just a 60-second sample of their voice, which the AI uses to clone their...




NextUp.com develops Windows text to speech (TTS) software applications like TextAloud that let your computer talk with AT&T Natural Voices. TextAloud can also be in Microsoft Word as a plug-in.

Create videos with digital actors by typing a script. Translate and dub videos with AI, preserving original voice and syncing lip movements.




CloudTTS is a straightforward text-to-speech application. Simply type in or paste the text you'd like to hear, and it reads it back to you.

BlacktoothAI is an all-in-one AI platform for content generation, including text and images. It offers access to various AI tools like ChatGPT, Claude, Gemini, and more under one subscription. Features include an advanced dashboard, a library of templates and chatbots, and...

Expand to new markets by instantly translating your documents, apps, and webpages. Create multilingual chatbots to communicate with your customers on their terms.

WhisperTyping is voice typing software using the Whisper model for the best-in-class dictation experience. Make use of it’s AI modes to write better and faster, get answers to pending questions and run commands, all by using your voice.




Talk to any LLM with hands-free voice interaction, voice interruption, and Live2D taking face running locally across platforms.




🌟 An AI desktop pet with long-term memory, expressive character sprites, computer control, and voice features—perfect for Galgame-style characters 🌟

Digital Life Project 2 (DLP3D) is an open-source real-time framework that brings Large Language Models (LLMs) to life through expressive 3D avatars. Users converse naturally by voice, while characters respond on demand with unified audio, whole-body animation, and physics...



