Free, open source speech-to-text app for desktop, works fully offline, prioritizes privacy, extensibility, cross-platform support, and accessibility.



Free, open source speech-to-text app for desktop, works fully offline, prioritizes privacy, extensibility, cross-platform support, and accessibility.



State-of-the-art speech models with transcription, translation, and audio understanding, available via API or self-hosted, optimized for cost and efficiency.

FUTO Voice Input is an application that lets you do speech-to-text on Android, integrating with third party keyboards or apps that use the generic speech-to-text APIs.



Still trying to take notes all by yourself? Hyprnote is an AI-powered notepad that crafts personalized meeting notes based on conversation content. With it, you'll feel like having a personal assistant that's always there to help you take notes.



End-to-end speech recognition model trained on 680,000 hours of multitask, multilingual audio data, offering robust transcription, translation, and language identification.




Streamline your meetings with real-time suggestions. Summarize key points, recap late joins, and generate follow-up emails effortlessly.




Transcribes speech instantly with local AI for near-perfect accuracy and privacy, featuring real-time native macOS support and user-provided AI enhancements.




Transcribro is a private and on-device speech recognition keyboard and service for Android.




Lemonade helps users discover and run local AI apps by serving optimized LLMs right from their own GPUs and NPUs.




Parlatype is a minimal audio player for manual speech transcription, written for the GNOME desktop environment. It plays audio sources to transcribe them in your favourite text application.

No more switching between Quicktime and Word. Pause, rewind and fast-forward without taking your hands off the keyboard. Interactive timestamps to navigate through your transcript with. Automatically saved to your browser's storage every...

A voice memos-like for Android and iOS. Written in Flutter. Transcription on iOS uses the native transcription APIs (mostly on-device) and on Android, uses Vosk.



HyperWhisper is a lightweight desktop application that provides real-time audio transcription supporting both on device local transcription or cloud based transcription. Record your voice, get instant transcriptions, and optionally auto-type the text directly into any...


Locally transcribes microphone or system audio on Linux desktops using an offline AI model. No external servers or proprietary services are required.

LLM Hub is an open-source Android app for on-device LLM chat and image generation. It's optimized for mobile usage (CPU/GPU/NPU acceleration) and supports multiple model formats so you can run powerful models locally and privately.




SenseVox is an open-source offline voice input app for Windows with a simple GUI, using the SenseVoice-Small model for efficient and accurate speech recognition.


Praat is a speech analysis tool used for doing phonetics by computer. Praat can analyse, synthesize, and manipulate speech, and create high-quality pictures for your publications.




hns is a privacy-focused open-source command-line tool for on-device speech-to-text. It records your voice, transcribes it completely locally using faster-whisper, and automatically copies the text to clipboard for immediate use in any application.
TranscriberAG is designed for assisting the manual annotation of speech signals. It provides a user-friendly graphical user interface (GUI) for segmenting long duration speech recordings, transcribing them, labeling speech turns, topic changes and acoustic conditions.




Robust yet lenient forced-aligner built on Kaldi. A tool for aligning speech with text.


Gazelle is a joint speech-language model by Tincans — for more details and prompt ideas, see our v0.2 announcement. This is an early research preview -- please temper expectations! Gazelle can take in text and audio as input (interchangeably) and generates text as output.




