End-to-end speech recognition model trained on 680,000 hours of multitask, multilingual audio data, offering robust transcription, translation, and language identification.




End-to-end speech recognition model trained on 680,000 hours of multitask, multilingual audio data, offering robust transcription, translation, and language identification.




Talk to any LLM with hands-free voice interaction, voice interruption, and Live2D taking face running locally across platforms.




Lemonade helps users discover and run local AI apps by serving optimized LLMs right from their own GPUs and NPUs.




Project J.A.I.son is a fully customizable AI companion server designed for streaming, private companionship, or building interactive AI applications. Run it entirely locally or leverage cloud services—the choice is yours.
Leopard is an on-device speech-to-text engine offering private, accurate and affordable transcription experiences with zero latency. Leopard, with its small model size, can run anywhere from single-board computers such as Raspberry Pi, web browsers and servers.

Open Assistant is a private open source personal assistant system able to engage in conversations and complete an increasing amount of tasks using vocal commands.




Gaupol is an editor for text-based subtitle files. It helps you with tasks such as creating and translating subtitles, timing subtitles to match video and correcting common errors. Gaupol includes a built-in video player and also supports launching an external one.



Cloud-based call center software with 50+ advanced features, quick setup, CRM and helpdesk integrations, centralized management, and real-time analytics.





hns is a privacy-focused open-source command-line tool for on-device speech-to-text. It records your voice, transcribes it completely locally using faster-whisper, and automatically copies the text to clipboard for immediate use in any application.
Kalliope is a modular always-on voice controlled personal assistant designed for home automation.
A text-based subtitles editor that supports basic operations as well as more advanced ones, aiming to become an improved version of Subtitle Workshop for every platform supported by KDE.

SEPIA is a server-based, extendable, personal, intelligent assistant. The SEPIA-Framework is a collection of open-source modules and tools that form a coherent ecosystem for self-hosted, privacy-compliant, voice-controlled applications and devices.




Dragonfire is an open source virtual assistant for Linux operating systems especially for Ubuntu based distributions. Dragonfire will be preinstalled software package on DragonOS Linux Distribution.

Simon is an open source speech recognition program that can replace your mouse and keyboard. The system is designed to be as flexible as possible and will work with any language or dialect.

Robust yet lenient forced-aligner built on Kaldi. A tool for aligning speech with text.

Build bots easily. You tell us what your user said, we tell you what your bot should do next. Your users give us voice or text, you get back structured data.
Serenade allows you to write code and do other development-related tasks using voice commands and natural speech.



Vosk is an offline open source speech recognition toolkit. It enables speech recognition for 20+ languages and dialects - English, Indian English, German, French, Spanish, Portuguese, Chinese, Russian, Turkish, Vietnamese, Italian, Dutch, Catalan, Arabic, Greek, Farsi, Filipino...
Simply put, Kara is a voice assistant that steals 0% of your data so you stay free! She is a actively maintained, modular, and designed to customize.

ConstEdit word processor is a Google Chrome / Microsoft Edge web browser extension. It writes doc in the html format, which is the standard internet webpage format. Documents written are therefore directly viewable with any web browsers.




The Hidden Markov Model Toolkit (HTK) is a portable toolkit for building and manipulating hidden Markov models. HTK is primarily used for speech recognition research.
Kaldi is a toolkit for speech recognition written in C++ and licensed under the Apache License v2.0. Kaldi is intended for use by speech recognition researchers.