Open Source Apps with 'Speech Transcription' feature

All apps in Open Source Apps with 'Speech Transcription' feature category. Use the filters below to narrow down your search. 
Copy a direct link to this comment to your clipboard
  1. Handy STT icon
     55 likes

    Free, open source speech-to-text app for desktop, works fully offline, prioritizes privacy, extensibility, cross-platform support, and accessibility.

    Cost / License

    • Free
    • Open Source (MIT)

    Application type

    Platforms

    • Mac
    • Windows
    • Linux
    Handy STT screenshot 1
    Handy STT screenshot 1
    Handy STT screenshot 2
    87 alternatives
  2. Whisper icon
     26 likes

    End-to-end speech recognition model trained on 680,000 hours of multitask, multilingual audio data, offering robust transcription, translation, and language identification.

    Cost / License

    • Freemium
    • Open Source (MIT)

    Application type

    Platforms

    • Mac
    • Windows
    • Linux
    Transcribing in different languages
    Using the whisper module in Python
    Approach
    +1
    Output of whisper --help
  3. Lemonade helps users discover and run local AI apps by serving optimized LLMs right from their own GPUs and NPUs.

    Cost / License

    Platforms

    • Windows
    • Linux
    • Docker
    • Snapcraft
    • iPhone
    • iPad
    • Self-Hosted
    • Python
    Lemonade Server screenshot 1
    Lemonade Server screenshot 1
    Lemonade Server screenshot 2
    +1
    Lemonade Server screenshot 3
    17 alternatives
  4. Voiceliner icon
     5 likes

    A voice memos-like for Android and iOS. Written in Flutter. Transcription on iOS uses the native transcription APIs (mostly on-device) and on Android, uses Vosk.

    Cost / License

    Application type

    Platforms

    • Android
    • iPhone
    • iPad
    Voiceliner screenshot 1
    Voiceliner screenshot 1
    Voiceliner screenshot 2
    34 alternatives
  5. HyperWhisper icon
     1 like

    HyperWhisper is a lightweight desktop application that provides real-time audio transcription supporting both on device local transcription or cloud based transcription. Record your voice, get instant transcriptions, and optionally auto-type the text directly into any...

    Cost / License

    • Free
    • Open Source

    Application type

    Platforms

    • Linux
    HyperWhisper screenshot 1
    HyperWhisper screenshot 1
    4 alternatives
  6. Live Captions icon
     2 likes

    Locally transcribes microphone or system audio on Linux desktops using an offline AI model. No external servers or proprietary services are required.

    Cost / License

    Platforms

    • Mac
    • Linux
    Live Captions screenshot 1
  7. Echo STT icon
     1 like

    Echo is a private, offline speech-to-text app powered by Whisper. 100% local processing, cross-platform (macOS, Windows, Linux), free and open source, privacy and local first.

    Cost / License

    • Free
    • Open Source (MIT)

    Platforms

    • Mac
    • Windows
    • Linux
    Echo STT screenshot 1
    20 alternatives
  8. SenseVox icon
     1 like

    SenseVox is an open-source offline voice input app for Windows with a simple GUI, using the SenseVoice-Small model for efficient and accurate speech recognition.

    Cost / License

    • Free
    • Open Source

    Platforms

    • Windows
    v0.2.0
    sensevox_wx_gtcrn_with_sensevoice_int8_model.zip
    17 alternatives
  9. Praat icon
     3 likes

    Praat is a speech analysis tool used for doing phonetics by computer. Praat can analyse, synthesize, and manipulate speech, and create high-quality pictures for your publications.

    Cost / License

    • Free
    • Open Source

    Platforms

    • Mac
    • Windows
    • Linux
    • Chrome OS
    Picture generator for reports
    Sound file window
    Main window
    +2
    TextGrid with sound
    3 alternatives
  10. hns icon
     1 like

    hns is a privacy-focused open-source command-line tool for on-device speech-to-text. It records your voice, transcribes it completely locally using faster-whisper, and automatically copies the text to clipboard for immediate use in any application.

    Cost / License

    • Free
    • Open Source (MIT)

    Application type

    Platforms

    • Windows
    • Mac
    • Linux
    11 alternatives
  11. TranscriberAG icon
     5 likes

    TranscriberAG is designed for assisting the manual annotation of speech signals. It provides a user-friendly graphical user interface (GUI) for segmenting long duration speech recordings, transcribing them, labeling speech turns, topic changes and acoustic conditions.

    Cost / License

    • Free
    • Open Source

    Application type

    Alerts

    • Discontinued

    Platforms

    • Mac
    • Windows
    • Linux
    TranscriberAG screenshot 1
    TranscriberAG screenshot 1
    TranscriberAG screenshot 2
    +3
    TranscriberAG screenshot 3
    53 alternatives
  12. Robust yet lenient forced-aligner built on Kaldi. A tool for aligning speech with text.

    Cost / License

    • Free
    • Open Source (MIT)

    Platforms

    • Mac
    • Linux
    • Online
    • Self-Hosted
    • Docker
    Gentle (forced-aligner) screenshot 1
  13. Gazelle is a joint speech-language model by Tincans — for more details and prompt ideas, see our v0.2 announcement. This is an early research preview -- please temper expectations! Gazelle can take in text and audio as input (interchangeably) and generates text as output.

    Cost / License

    Platforms

    • Online
    Gazelle Speech Language Model screenshot 1
    4 alternatives