Whisper icon
Whisper icon

Whisper

An open-source, end-to-end speech recognition system trained on 680,000 hours of diverse audio, providing multilingual transcription, to-English translation, language identification, phrase-level timestamps, and high performance in real-world scenarios using transformer architecture.

Transcribing in different languages

Cost / License

  • Freemium
  • Open Source (MIT)

Application type

Platforms

  • Mac
  • Windows
  • Linux
24likes
2comments

Features

  1.  Speech to text
  2.  Speech Recognition
  3.  Ad-free
  4.  Speech Transcription
  5.  AI-Powered

Whisper News & Activities

Highlights All activities

Recent News

Recent activities

Comments and Reviews

   
Top Positive Comment
Will Smith
0

works well not very hard to use when we are a developper. but good models need good GPU

sciencek23
0

I don't speak English and this software helps me get any movie or video subtitles in my native language. Amazing!

Review by a new / low-activity user.

What is Whisper?

Whisper is a general-purpose speech recognition model used for various applications. It's trained on a large, diverse audio dataset, enabling it to handle tasks like multilingual speech recognition, speech translation, and language identification.

Whisper is an automatic speech recognition (ASR) system, using a training dataset of 680,000 hours of multitask, multilingual data from the internet. This dataset helps it handle accents, background noise, and technical language. It also supports transcription in multiple languages and translation into English. Its models and inference code are open-source, aiding application development and further research.

Whisper uses an end-to-end approach through an encoder-decoder Transformer. It processes 30-second audio segments into a log-Mel spectrogram, which is fed into an encoder. A decoder predicts the corresponding text caption and special tokens for tasks like language identification and translation into English. About a third of Whisper’s audio dataset is non-English, and it alternates between transcribing in the original language or translating to English. This method has outperformed the supervised state-of-the-art on CoVoST2 to English translation in a zero-shot scenario.

Official Links

Whisper information

AlternativeTo Category

Audio & Music

GitHub repository

  •  95,137 Stars
  •  11,802 Forks
  •  114 Open Issues
  •   Updated  
View on GitHub

Our users have written 2 comments and reviews about Whisper, and it has gotten 24 likes

Whisper was added to AlternativeTo by HeyNow on and this page was last updated .