A free, open source, and extensible speech-to-text application that works completely offline.



A free, open source, and extensible speech-to-text application that works completely offline.



Vibe is an auto transcription service that utilizes local language learning models (LLMs) or artificial intelligence to provide transcriptions for a wide range of languages. The service prioritizes user privacy by offering fully offline transcription, ensuring that no data ever...

Voxtral models are state-of-the-art speech understanding models, which are available in two sizes — a 24B variant for production-scale applications and a 3B variant for local and edge deployments. Both versions are released under the Apache 2.0 license.

FUTO Voice Input is an application that lets you do speech-to-text on Android, integrating with third party keyboards or apps that use the generic speech-to-text APIs.



Voice-to-text dictation app with local Whisper models and OpenAI API. Privacy-first, cross-platform, global hotkey activated.




Still trying to take notes all by yourself? Hyprnote is an AI-powered notepad that crafts personalized meeting notes based on conversation content. With it, you'll feel like having a personal assistant that's always there to help you take notes.



Whisper is a general-purpose speech recognition model. It is trained on a large dataset of diverse audio and is also a multi-task model that can perform multilingual speech recognition as well as speech translation and language identification.




Ito transforms your voice into perfect text anywhere on Mac. Speak naturally, and our AI crafts polished messages for any context.

Transcribes speech instantly with local AI for near-perfect accuracy and privacy, featuring real-time native macOS support and user-provided AI enhancements.




A privacy-first, open-source speech-to-text application that runs entirely on your machine, powered by a neural network via NVIDIA’s Parakeet model for fast, local transcription. Murmure turns your voice into text with no internet connection and zero data collection, and...

audapolis aims to make the workflow for spoken-word-heavy media editing easier, faster and more accessible.

Transcribro is a private and on-device speech recognition keyboard and service for Android.




Speakr is a personal, self-hosted web application designed for transcribing audio recordings (like meetings), generating concise summaries and titles, and interacting with the content through a chat interface. Keep all your meeting notes and insights securely on your own server.



Free, open-source, real-time dictation for Windows. Runs locally (no cloud!), uses AI, and types directly into any application via a user-friendly GUI.

Parlatype is a minimal audio player for manual speech transcription, written for the GNOME desktop environment. It plays audio sources to transcribe them in your favourite text application.

No more switching between Quicktime and Word. Pause, rewind and fast-forward without taking your hands off the keyboard. Interactive timestamps to navigate through your transcript with. Automatically saved to your browser's storage every...

hns is a privacy-focused open-source command-line tool for on-device speech-to-text. It records your voice, transcribes it completely locally using faster-whisper, and automatically copies the text to clipboard for immediate use in any application.
Locally transcribes microphone or system audio on Linux desktops using an offline AI model. No external servers or proprietary services are required.

This is Scriberr, a self-hostable AI audio transcription app. Scriber uses the Whisper models from OpenAI, to transcribe audio files offline, on your hardware.




Buzz Captions is an offline audio transcription and translation tool powered by OpenAI's Whisper model. It allows users to import audio and video files to generate transcripts in CSV, SRT, TXT and VTT formats.

TranscriberAG is designed for assisting the manual annotation of speech signals. It provides a user-friendly graphical user interface (GUI) for segmenting long duration speech recordings, transcribing them, labeling speech turns, topic changes and acoustic conditions.




Amphion is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
Scripto is a free, open source tool for enabling community transcriptions of document and multimedia files. It is designed for institutions and organizations such as libraries and museums engaging in a range of large- and small-scale collaborative transcription projects as well...





