Free, open source speech-to-text app for desktop, works fully offline, prioritizes privacy, extensibility, cross-platform support, and accessibility.



Free, open source speech-to-text app for desktop, works fully offline, prioritizes privacy, extensibility, cross-platform support, and accessibility.



Provides fully offline audio and video transcription with local AI models, batch processing, real-time preview, multi-format export, translation, and privacy on Windows, Linux, and macOS.




State-of-the-art speech models with transcription, translation, and audio understanding, available via API or self-hosted, optimized for cost and efficiency.

FUTO Voice Input is an application that lets you do speech-to-text on Android, integrating with third party keyboards or apps that use the generic speech-to-text APIs.



Dictation app transcribes speech with multiple AI backends, supports local and cloud processing, hotkey activation, auto-paste, and privacy-focused controls.




Still trying to take notes all by yourself? Hyprnote is an AI-powered notepad that crafts personalized meeting notes based on conversation content. With it, you'll feel like having a personal assistant that's always there to help you take notes.



End-to-end speech recognition model trained on 680,000 hours of multitask, multilingual audio data, offering robust transcription, translation, and language identification.




Private macOS speech-to-text tool with offline system-wide dictation, audio and video file transcription, batch processing, profiles, automation, and subtitle export.




A privacy-first, open-source speech-to-text application that runs entirely on your machine, powered by a neural network via NVIDIA’s Parakeet model for fast, local transcription. Murmure turns your voice into text with no internet connection and zero data collection, and...

Transcribes speech instantly with local AI for near-perfect accuracy and privacy, featuring real-time native macOS support and user-provided AI enhancements.




audapolis aims to make the workflow for spoken-word-heavy media editing easier, faster and more accessible.

Transcribro is a private and on-device speech recognition keyboard and service for Android.




Speakr is a personal, self-hosted web application designed for transcribing audio recordings (like meetings), generating concise summaries and titles, and interacting with the content through a chat interface. Keep all your meeting notes and insights securely on your own server.



Parlatype is a minimal audio player for manual speech transcription, written for the GNOME desktop environment. It plays audio sources to transcribe them in your favourite text application.

Free, open-source, real-time dictation for Windows. Runs locally (no cloud!), uses AI, and types directly into any application via a user-friendly GUI.

No more switching between Quicktime and Word. Pause, rewind and fast-forward without taking your hands off the keyboard. Interactive timestamps to navigate through your transcript with. Automatically saved to your browser's storage every...

Locally transcribes microphone or system audio on Linux desktops using an offline AI model. No external servers or proprietary services are required.

hns is a privacy-focused open-source command-line tool for on-device speech-to-text. It records your voice, transcribes it completely locally using faster-whisper, and automatically copies the text to clipboard for immediate use in any application.
Buzz Captions is an offline audio transcription and translation tool powered by OpenAI's Whisper model. It allows users to import audio and video files to generate transcripts in CSV, SRT, TXT and VTT formats.

This is Scriberr, a self-hostable AI audio transcription app. Scriber uses the Whisper models from OpenAI, to transcribe audio files offline, on your hardware.




HyperWhisper is a lightweight desktop application that provides real-time audio transcription supporting both on device local transcription or cloud based transcription. Record your voice, get instant transcriptions, and optionally auto-type the text directly into any...


TranscriberAG is designed for assisting the manual annotation of speech signals. It provides a user-friendly graphical user interface (GUI) for segmenting long duration speech recordings, transcribing them, labeling speech turns, topic changes and acoustic conditions.




Amphion is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
Scripto is a free, open source tool for enabling community transcriptions of document and multimedia files. It is designed for institutions and organizations such as libraries and museums engaging in a range of large- and small-scale collaborative transcription projects as well...

