Files
2025-07-18 23:13:11 +02:00

146 lines
14 KiB
Plaintext
Raw Permalink Blame History

This file contains invisible Unicode characters
This file contains invisible Unicode characters that are indistinguishable to humans but may be processed differently by a computer. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.



 
 



 



 






Contents
- Official (#official)
- Model variants (#model-variants)
- Apps (#apps)
- Web apps (#web-apps)
- CLI tools (#cli-tools)
- Playgrounds (#playgrounds)
- Packages (#packages)
- Articles (#articles)
- Videos (#videos)
- Community (#community)
- Third-party APIs (#third-party-apis)
- Related lists (#related-lists)
Official
- Introduction (https://openai.com/research/whisper)
- Source code (https://github.com/openai/whisper)
- White paper (https://cdn.openai.com/papers/whisper.pdf)
Model variants
- Whisper.cpp (https://github.com/ggerganov/whisper.cpp) - Port of Whisper in C++.
- **Bindings for many languages** (https://github.com/ggerganov/whisper.cpp#bindings)
- WhisperX (https://github.com/m-bain/whisperX) - Adds fast automatic speaker recognition with word-level timestamps and speaker diarization.
- faster-whisper (https://github.com/guillaumekln/faster-whisper) - Faster reimplementation of Whisper using CTranslate2.
- Whisper JAX (https://github.com/sanchit-gandhi/whisper-jax) - JAX implementation of Whisper for up to 70x speed-up on TPU.
- whisper-timestamped (https://github.com/linto-ai/whisper-timestamped) - Adds word-level timestamps and confidence scores.
- whisper-openvino (https://github.com/zhuzilin/whisper-openvino) - Whisper running on OpenVINO.
- whisper.tflite (https://github.com/usefulsensors/openai-whisper) - Whisper running on TensorFlow Lite.
- Whisper variants (https://huggingface.co/models?other=whisper) - Various Whisper variants on Hugging Faces.
- Whisper-AT (https://github.com/YuanGongND/whisper-at) - Whisper that can recognize non-speech audio events in addition to speech.
Apps
- Aiko (https://sindresorhus.com/aiko) - Audio transcription iOS and macOS app.
- MacWhisper (https://goodsnooze.gumroad.com/l/macwhisper) - Audio transcription macOS app. (Freemium)
- Whisper Memos (https://apps.apple.com/app/id6443658039) - Audio transcription iOS app. (Freemium)
- FourYou (https://apps.apple.com/app/id1671616134) - Audio journal iOS app.
- Jojo Transcribe (https://apps.apple.com/app/id1659864300) - Audio transcription macOS app.
- Buzz (https://github.com/chidiwilliams/Buzz) - Audio transcription and translation macOS app.
- WhisperScript (https://store.getwavery.com/l/whisperscript) - Audio transcription macOS app. (Freemium · Electron)
- Audio Podium (https://apps.apple.com/app/id6449008295) - Audio/video management macOS app.
- superwhisper (https://superwhisper.com) - Global audio transcription macOS menu bar app.
- Speech Note (https://github.com/mkiol/dsnote) - Audio transcription Linux app.
- FridayGPT (https://www.fridaygpt.app) - Dictation macOS app powered by OpenAI API.
- EasyWhisper (https://easywhisper.io) - Windows and macOS app for audio transcription and speaker diarization. (Freemium)
- Audio Note (https://audionote.app) - Real-time audio transcription on macOS and Windows. (Freemium · Electron)
- Whisper (https://github.com/woheller69/whisperIME) - Android app for transcription and translation. (FOSS)
- VoiceInk (https://github.com/Beingpax/VoiceInk) - Dictation and transcription macOS app. (FOSS)
Web apps
Hosted
- bigWav (https://bigwav.app) - Audio transcription and annotation tool.
- Free Podcast Transcription (https://freepodcasttranscription.com) - Runs locally in your browser.
- Gladia (https://www.gladia.io) - Transcription with real-time processing.
Self-hosted
- Subs AI (https://github.com/abdeladim-s/subsai) - Subtitle generation.
- WaaS (https://github.com/schibsted/WAAS) - GUI and API for Whisper.
- writeout.ai (https://github.com/beyondcode/writeout.ai) - Laravel app to transcribe and translate audio files.
- Meeper (https://github.com/pas1ko/meeper) - Transcriptions, summary and more for meetings and any browser tab. (Chrome app)
CLI tools
- yt-whisper (https://github.com/m1guelpf/yt-whisper) - YouTube subtitle generation.
- phonix (https://github.com/platisd/phonix) - Generate captions for videos.
- whisper-standalone-win (https://github.com/Purfview/whisper-standalone-win) - Standalone Windows executable for Whisper and Faster Whisper.
- whisper-ctranslate2 (https://github.com/Softcatala/whisper-ctranslate2) - Whisper command-line tool based on CTranslate2, compatible with the original.
- insanely-fast-whisper-cli (https://github.com/ochen1/insanely-fast-whisper-cli) - Achieve transcription speeds near 30x real-time with several optimizations.
- whisper-diarization (https://github.com/MahmoudAshraf97/whisper-diarization) - Automatic speech recognition with speaker diarization.
Playgrounds
- Hugging Faces (https://huggingface.co/spaces/openai/whisper) - Whisper demo running on Hugging Faces. (Source (https://huggingface.co/spaces/openai/whisper/tree/main))
- Monster API (https://whisperui.monsterapi.ai) - Whisper demo running on Monster API. (Source (https://github.com/saharmor/whisper-playground))
- Web Whisper (https://whisper.r3d.red) - Whisper demo by Pluja. (Source (https://codeberg.org/pluja/web-whisper))
- YouTube Video Transcription (https://github.com/ArthurFDLR/whisper-youtube) - Running on Colab.
Packages
JavaScript
- use-whisper (https://github.com/chengsokdara/use-whisper) - React hook.
Articles
- Whispers of A.I.'s Modular Future (https://www.newyorker.com/tech/annals-of-technology/whispers-of-ais-modular-future) - The future of machine learning lies in adaptable and accessible open-source speech-transcription programs.
- How to Run Whisper Speech Recognition Model (https://www.assemblyai.com/blog/how-to-run-openais-whisper-speech-recognition-model/) - Explains how to install and run the model, as well as providing a performance analysis comparing Whisper to 
other models.
- Create your own speech to text app using Flask (https://blog.paperspace.com/whisper-openai-flask-application-deployment/) - The tutorial demonstrates Whisper's speech-to-text model, with a demo on running it in a Gradient Notebook and a guide 
for setting up a Flask app with Gradient Deployments.
- Convert Podcasts to Text (https://betterprogramming.pub/openais-whisper-tutorial-42140dd696ee) - Tutorial on the Whisper API with Python for speech-to-text transcription, showcasing GPU's faster transcription and advanced technology.
Videos
- Open AI's Whisper is Amazing! (https://www.youtube.com/watch?v=OCBZtgQGt1I) - Introduction to Whisper.
- How to do Free Speech-to-Text Transcription Better Than Google Premium API (https://www.youtube.com/watch?v=msj3wuYf3d8) - Tutorial.
- Multilingual AI Speech Recognition Live App (https://www.youtube.com/watch?v=ywIyc8l1K1Q) - Tutorial.
Community
- Discussions (https://github.com/openai/whisper/discussions)
- Discord (https://discord.com/invite/openai)
Third-party APIs
APIs that use Whisper.
- Whisper+ (https://www.oneai.com/speech-to-text) - Extension of the Whisper model which adds powerful features such as speaker identification custom vocabulary, summarization, and chapter generation.
- Replicate (https://replicate.com/openai/whisper) - Use Whisper running on Replicate.
Related lists
- awesome-chatgpt (https://github.com/sindresorhus/awesome-chatgpt) - ChatGPT resources.
whisper Github: https://github.com/sindresorhus/awesome-whisper