Linux Software filtered by 'Speech Recognition'

Browse 38 Linux Software filtered by 'Speech Recognition' on AlternativeTo with popular options like Linux and Linux + Open Source.

Copy a direct link to this comment to your clipboard
  1. Whisper icon
     24 likes

    End-to-end speech recognition model trained on 680,000 hours of multitask, multilingual audio data, offering robust transcription, translation, and language identification.

    Cost / License

    • Freemium
    • Open Source (MIT)

    Application type

    Platforms

    • Mac
    • Windows
    • Linux
    Transcribing in different languages
    Using the whisper module in Python
    Approach
    +1
    Output of whisper --help
  2. Lemonade helps users discover and run local AI apps by serving optimized LLMs right from their own GPUs and NPUs.

    Cost / License

    Platforms

    • Windows
    • Linux
    • Docker
    • Snapcraft
    • iPhone
    • iPad
    • Self-Hosted
    • Python
    Lemonade Server screenshot 1
    Lemonade Server screenshot 1
    Lemonade Server screenshot 2
    +1
    Lemonade Server screenshot 3
    16 alternatives
  3. Project J.A.I.son is a fully customizable AI companion server designed for streaming, private companionship, or building interactive AI applications. Run it entirely locally or leverage cloud services—the choice is yours.

    Cost / License

    • Free
    • Open Source (MIT)

    Application types

    Platforms

    • Linux
    • Mac
    • Windows
    • Self-Hosted
    • PyTorch
    • Python
    8 alternatives
  4. Picovoice icon
     1 like

    Leopard is an on-device speech-to-text engine offering private, accurate and affordable transcription experiences with zero latency. Leopard, with its small model size, can run anywhere from single-board computers such as Raspberry Pi, web browsers and servers.

    Cost / License

    Platforms

    • Mac
    • Windows
    • Linux
    • Online
    • Android
    • iPhone
    • Android Tablet
    • iPad
    • Self-Hosted
    • Google Chrome
    • Software as a Service (SaaS)
    • Firefox
    SDKs supported by Leopard Speech-to-Text
    1 alternatives
  5. Open Assisant icon
     3 likes

    Open Assistant is a private open source personal assistant system able to engage in conversations and complete an increasing amount of tasks using vocal commands.

    Cost / License

    Platforms

    • Mac
    • Windows
    • Linux
    • Raspberry Pi
    Open Assisant screenshot 1
    Open Assisant screenshot 1
    Open Assisant screenshot 2
    +9
    Open Assisant screenshot 3
  6. Gaupol icon
     16 likes

    Gaupol is an editor for text-based subtitle files. It helps you with tasks such as creating and translating subtitles, timing subtitles to match video and correcting common errors. Gaupol includes a built-in video player and also supports launching an external one.

    Cost / License

    • Free
    • Open Source

    Application type

    Platforms

    • Windows
    • Linux
    • Xfce
    Gaupol screenshot 1
    Gaupol screenshot 1
    Gaupol screenshot 2
    27 alternatives
  7. CloudTalk.io icon
     7 likes

    Cloud-based call center software with 50+ advanced features, quick setup, CRM and helpdesk integrations, centralized management, and real-time analytics.

    Cost / License

    • Paid
    • Proprietary

    Platforms

    • Mac
    • Windows
    • Linux
    • Android
    • iPhone
    • Software as a Service (SaaS)
    CloudTalk Call History & Historical Reporting - View your team's call performance and all your historical call's data, and export reports for further analysis.
    CloudTalk takes the chaos out of your phone support process.
    CloudTalk - VoIP telephony system can be integrated with your favourite CRM, Helpdesk, and eCommerce solutions.
    +3
    Integrate CRM system Pipedrive with CloudTalk to boost performance of your sales team.
    115 alternatives
  8. hns icon
     1 like

    hns is a privacy-focused open-source command-line tool for on-device speech-to-text. It records your voice, transcribes it completely locally using faster-whisper, and automatically copies the text to clipboard for immediate use in any application.

    Cost / License

    • Free
    • Open Source (MIT)

    Application type

    Platforms

    • Windows
    • Mac
    • Linux
    11 alternatives
  9. A text-based subtitles editor that supports basic operations as well as more advanced ones, aiming to become an improved version of Subtitle Workshop for every platform supported by KDE.

    Cost / License

    • Free
    • Open Source

    Application type

    Platforms

    • Windows
    • Linux
    Subtitle Composer screenshot 1
    29 alternatives
  10. SEPIA is a server-based, extendable, personal, intelligent assistant. The SEPIA-Framework is a collection of open-source modules and tools that form a coherent ecosystem for self-hosted, privacy-compliant, voice-controlled applications and devices.

    Cost / License

    • Free
    • Open Source

    Application type

    Platforms

    • Mac
    • Windows
    • Linux
    • Android
    • iPhone
    • Chrome OS
    SEPIA Framework screenshot 1
    SEPIA Framework screenshot 1
    SEPIA Framework screenshot 2
    +1
    SEPIA Framework screenshot 3
    16 alternatives
  11. Dragonfire icon
     7 likes

    Dragonfire is an open source virtual assistant for Linux operating systems especially for Ubuntu based distributions. Dragonfire will be preinstalled software package on DragonOS Linux Distribution.

    Cost / License

    • Free
    • Open Source (MIT)

    Application type

    Alerts

    • Discontinued

    Platforms

    • Linux
    Dragonfire screenshot 1
    23 alternatives
  12. Simon is an open source speech recognition program that can replace your mouse and keyboard. The system is designed to be as flexible as possible and will work with any language or dialect.

    Cost / License

    • Free
    • Open Source

    Alerts

    • Discontinued

    Platforms

    • Windows
    • Linux
    Simon Speech Recognition screenshot 1
    15 alternatives
  13. Robust yet lenient forced-aligner built on Kaldi. A tool for aligning speech with text.

    Cost / License

    • Free
    • Open Source (MIT)

    Platforms

    • Mac
    • Linux
    • Online
    • Self-Hosted
    • Docker
    Gentle (forced-aligner) screenshot 1
  14. Wit.ai icon
     4 likes

    Build bots easily. You tell us what your user said, we tell you what your bot should do next. Your users give us voice or text, you get back structured data.

    Cost / License

    • Free
    • Proprietary

    Platforms

    • Mac
    • Windows
    • Linux
    • Android SDK
    • Raspberry Pi
    42 alternatives
  15. Serenade icon
     1 like

    Serenade allows you to write code and do other development-related tasks using voice commands and natural speech.

    Cost / License

    • Freemium
    • Proprietary

    Platforms

    • Mac
    • Windows
    • Linux
    Serenade screenshot 1
    Serenade screenshot 1
    Serenade screenshot 2
    2 alternatives
  16. Vosk icon
     1 like

    Vosk is an offline open source speech recognition toolkit. It enables speech recognition for 20+ languages and dialects - English, Indian English, German, French, Spanish, Portuguese, Chinese, Russian, Turkish, Vietnamese, Italian, Dutch, Catalan, Arabic, Greek, Farsi, Filipino...

    Cost / License

    Platforms

    • Windows
    • Linux
    • Mac
  17. Kara icon
     3 likes

    Simply put, Kara is a voice assistant that steals 0% of your data so you stay free! She is a actively maintained, modular, and designed to customize.

    Cost / License

    Application type

    Alerts

    • Discontinued

    Platforms

    • Mac
    • Windows
    • Linux
    14 alternatives
  18. Wryte icon
     2 likes

    Dictation software for blogger and other webworkers.

    Cost / License

    • Free
    • Open Source

    Alerts

    • Discontinued

    Platforms

    • Windows
    • Linux
    Wryte screenshot 1
    17 alternatives
  19. ConstEdit icon
     5 likes

    ConstEdit word processor is a Google Chrome / Microsoft Edge web browser extension. It writes doc in the html format, which is the standard internet webpage format. Documents written are therefore directly viewable with any web browsers.

    Cost / License

    • Free
    • Proprietary

    Application type

    Platforms

    • Mac
    • Windows
    • Linux
    Sections Structure Dialog for managing document sections structure intuitively
    Design Html Stylesheet function of ConstEdit lets you design your own custom css stylesheet.
    Assigning a different css stylesheet to a document makes it look totally different, without any change in document content.
    +2
    Editing panel of ConstEdit.
  20. HTK icon
     2 likes

    The Hidden Markov Model Toolkit (HTK) is a portable toolkit for building and manipulating hidden Markov models. HTK is primarily used for speech recognition research.

    Cost / License

    • Free
    • Open Source

    Platforms

    • Mac
    • Windows
    • Linux
    2 alternatives
  21. Kaldi icon
     2 likes

    Kaldi is a toolkit for speech recognition written in C++ and licensed under the Apache License v2.0. Kaldi is intended for use by speech recognition researchers.

    Cost / License

    • Free
    • Open Source

    Platforms

    • Mac
    • Windows
    • Linux
    20 alternatives