Cost / License
- Free
- Open Source (Apache-2.0)
Platforms
- Self-Hosted
- Python
- Docker
- PyTorch




Includes a media player, a sound recorder, a speech recognition program, a text to speech reader, an audio/video converter, an audio joiner, FFmpeg and an audio editor.



Voisi is an innovative AI-powered tool designed to revolutionize the creation of audio content by leveraging advanced voice synthesis technology.With an extensive library boasting hundreds of lifelike options, users can effortlessly create captivating content in multiple languages.




VoxWrite is a Chrome Extension that turns your speech into clear, professional text. It automatically cleans up, formats, and prepares your words so they’re ready to use. Just speak naturally in any language, set your preferences once, and watch your speech become polished...




AddSubtitle gives creators full control over how your message meets the world. Subtitles, voiceover, and translation—all in one tool to speed up your video workflow. Experience the perfect balance of efficiency and creative control.



Local, GPU-accelerated dictation for Windows. Hold CapsLock (finally a use for that key), speak, text appears at your cursor. Fully offline — no cloud, no subscription.


WordWand is a system-wide AI assistant for macOS that works in any app through a single keyboard shortcut. No copy-pasting, no tab switching — just select text, press a hotkey, and transform it instantly.



Transcribbit turns WhatsApp voice notes into clean, readable text instantly - no app required. Forward any voice message to Transcribbit's WhatsApp contact, and receive an accurate transcript back directly in your chat.




Translate any text and voice message into 80+ languages free with Voice Messages: All Language Translator 2020. If you are looking for best Voice Message 2020- Speech to text & Language Translator app?




This is an online tool for recognition audio voice file(mp3,wav,ogg,wma etc) to text.
This tool base by CMU Sphinx, which a open source speech recognition toolkit from CMU. It is a free and online tool.
Summarizes PDFs, transcribes lectures and videos, creates flashcards and quizzes, generates study plans, and manages notes from diverse input types efficiently.




Voiser offers high-quality text-to-speech and speech-to-text services in 75+ languages, helping businesses and individuals save time and resources while enhancing their content's accessibility and user experience.




Fully local, easy-to-use AI transcription tool based on OpenAI Whisper. Transcribe audio & video offline with high accuracy and without uploading or sharing your media data.



Gazelle is a joint speech-language model by Tincans — for more details and prompt ideas, see our v0.2 announcement. This is an early research preview -- please temper expectations! Gazelle can take in text and audio as input (interchangeably) and generates text as output.

Speech2Math Calculator translates your speech to mathematical expressions. Speech2Math Calculator is an useful talking voice calculator application, which allows you to calculate easily by speaking. Speech2Math Calculator also has on-screen keyboard which allows you to edit...


Voicetonotes.ai is an AI-powered voice-to-text transcription tool that converts spoken words into accurate, structured notes in real-time.

This Chrome extension is useful for content creators, journalists, students, and professionals who need to transcribe audio recordings. It integrates with the Chrome browser for easy use. Its features include one-click recording, automatic transcription using AI, transcription...

A fast, simple transcription app that runs fully offline on your Mac. No cloud, no uploads, no servers. Your audio stays on your device.




AI-powered speech-to-text platform that converts audio and video into accurate transcripts, captions, and translations in 100+ languages.




Juxano is an AI meeting and interview assistant that records, transcribes, and analyzes conversations in real time. It delivers fast summaries, deep structured insights, and searchable memory, helping professionals review performance, extract key points, and retain critical knowledge.

(EdTech) focused on helping English students improve their accent, pronunciation, and clear communication skills, through artificial intelligence.




AI tool converts audio and video to text in 100+ languages with speaker identification, transcript timestamps, instant translation, and cloud storage.




Voibe is the fastest dictation app for Mac, built for developers and AI power users. It’s highly accurate, runs locally, and is fully private—working entirely offline across all apps. With deep Cursor integration, it resolves files and folders you speak.




TiniText turns audio recordings into speaker-labeled transcripts, summaries, key insights, and action items you can edit, share, and export.


Voice-first AI for meeting notes, voice notes, and dictation. 5× faster than typing. Just speak, and it's done.
