Files
awesome-awesomeness/html/whisper.md2.html
2025-07-18 23:13:11 +02:00

232 lines
9.6 KiB
HTML
Raw Blame History

This file contains ambiguous Unicode characters
This file contains Unicode characters that might be confused with other characters. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.
<div data-align="center">
<pre><code>&lt;br&gt;
&lt;br&gt;
&lt;div&gt;
&lt;img src=&quot;media/logo.png&quot; alt=&quot;Awesome Whisper&quot;&gt;
&lt;br&gt;
&lt;/div&gt;
&lt;br&gt;
&lt;p&gt;
&lt;a href=&quot;https://openai.com/research/whisper&quot;&gt;Whisper&lt;/a&gt; is an open-source AI-powered speech recognition system developed by &lt;a href=&quot;https://openai.com&quot;&gt;OpenAI&lt;/a&gt;
&lt;/p&gt;
&lt;br&gt;
&lt;a href=&quot;https://awesome.re&quot;&gt;
&lt;img src=&quot;https://awesome.re/badge-flat2.svg&quot; alt=&quot;Awesome&quot;&gt;
&lt;/a&gt;
&lt;br&gt;
&lt;br&gt;
&lt;br&gt;
&lt;br&gt;
&lt;br&gt;</code></pre>
</div>
<h2 id="contents">Contents</h2>
<ul>
<li><a href="#official">Official</a></li>
<li><a href="#model-variants">Model variants</a></li>
<li><a href="#apps">Apps</a></li>
<li><a href="#web-apps">Web apps</a></li>
<li><a href="#cli-tools">CLI tools</a></li>
<li><a href="#playgrounds">Playgrounds</a></li>
<li><a href="#packages">Packages</a></li>
<li><a href="#articles">Articles</a></li>
<li><a href="#videos">Videos</a></li>
<li><a href="#community">Community</a></li>
<li><a href="#third-party-apis">Third-party APIs</a></li>
<li><a href="#related-lists">Related lists</a></li>
</ul>
<h2 id="official">Official</h2>
<ul>
<li><a href="https://openai.com/research/whisper">Introduction</a></li>
<li><a href="https://github.com/openai/whisper">Source code</a></li>
<li><a href="https://cdn.openai.com/papers/whisper.pdf">White
paper</a></li>
</ul>
<h2 id="model-variants">Model variants</h2>
<ul>
<li><a href="https://github.com/ggerganov/whisper.cpp">Whisper.cpp</a> -
Port of Whisper in C++.
<ul>
<li><a href="https://github.com/ggerganov/whisper.cpp#bindings">Bindings
for many languages</a></li>
</ul></li>
<li><a href="https://github.com/m-bain/whisperX">WhisperX</a> - Adds
fast automatic speaker recognition with word-level timestamps and
speaker diarization.</li>
<li><a
href="https://github.com/guillaumekln/faster-whisper">faster-whisper</a>
- Faster reimplementation of Whisper using CTranslate2.</li>
<li><a href="https://github.com/sanchit-gandhi/whisper-jax">Whisper
JAX</a> - JAX implementation of Whisper for up to 70x speed-up on
TPU.</li>
<li><a
href="https://github.com/linto-ai/whisper-timestamped">whisper-timestamped</a>
- Adds word-level timestamps and confidence scores.</li>
<li><a
href="https://github.com/zhuzilin/whisper-openvino">whisper-openvino</a>
- Whisper running on OpenVINO.</li>
<li><a
href="https://github.com/usefulsensors/openai-whisper">whisper.tflite</a>
- Whisper running on TensorFlow Lite.</li>
<li><a href="https://huggingface.co/models?other=whisper">Whisper
variants</a> - Various Whisper variants on Hugging Faces.</li>
<li><a href="https://github.com/YuanGongND/whisper-at">Whisper-AT</a> -
Whisper that can recognize non-speech audio events in addition to
speech.</li>
</ul>
<h2 id="apps">Apps</h2>
<ul>
<li><a href="https://sindresorhus.com/aiko">Aiko</a> - Audio
transcription iOS and macOS app.</li>
<li><a href="https://goodsnooze.gumroad.com/l/macwhisper">MacWhisper</a>
- Audio transcription macOS app. (Freemium)</li>
<li><a href="https://apps.apple.com/app/id6443658039">Whisper Memos</a>
- Audio transcription iOS app. (Freemium)</li>
<li><a href="https://apps.apple.com/app/id1671616134">FourYou</a> -
Audio journal iOS app.</li>
<li><a href="https://apps.apple.com/app/id1659864300">Jojo
Transcribe</a> - Audio transcription macOS app.</li>
<li><a href="https://github.com/chidiwilliams/Buzz">Buzz</a> - Audio
transcription and translation macOS app.</li>
<li><a
href="https://store.getwavery.com/l/whisperscript">WhisperScript</a> -
Audio transcription macOS app. (Freemium · Electron)</li>
<li><a href="https://apps.apple.com/app/id6449008295">Audio Podium</a> -
Audio/video management macOS app.</li>
<li><a href="https://superwhisper.com">superwhisper</a> - Global audio
transcription macOS menu bar app.</li>
<li><a href="https://github.com/mkiol/dsnote">Speech Note</a> - Audio
transcription Linux app.</li>
<li><a href="https://www.fridaygpt.app">FridayGPT</a> - Dictation macOS
app powered by OpenAI API.</li>
<li><a href="https://easywhisper.io">EasyWhisper</a> - Windows and macOS
app for audio transcription and speaker diarization. (Freemium)</li>
<li><a href="https://audionote.app">Audio Note</a> - Real-time audio
transcription on macOS and Windows. (Freemium · Electron)</li>
<li><a href="https://github.com/woheller69/whisperIME">Whisper</a> -
Android app for transcription and translation. (FOSS)</li>
<li><a href="https://github.com/Beingpax/VoiceInk">VoiceInk</a> -
Dictation and transcription macOS app. (FOSS)</li>
</ul>
<h2 id="web-apps">Web apps</h2>
<!-- ### Hosted and self-hosted -->
<h3 id="hosted">Hosted</h3>
<ul>
<li><a href="https://bigwav.app">bigWav</a> - Audio transcription and
annotation tool.</li>
<li><a href="https://freepodcasttranscription.com">Free Podcast
Transcription</a> - Runs locally in your browser.</li>
<li><a href="https://www.gladia.io">Gladia</a> - Transcription with
real-time processing.</li>
</ul>
<h3 id="self-hosted">Self-hosted</h3>
<ul>
<li><a href="https://github.com/abdeladim-s/subsai">Subs AI</a> -
Subtitle generation.</li>
<li><a href="https://github.com/schibsted/WAAS">WaaS</a> - GUI and API
for Whisper.</li>
<li><a href="https://github.com/beyondcode/writeout.ai">writeout.ai</a>
- Laravel app to transcribe and translate audio files.</li>
<li><a href="https://github.com/pas1ko/meeper">Meeper</a> -
Transcriptions, summary and more for meetings and any browser tab.
(Chrome app)</li>
</ul>
<h2 id="cli-tools">CLI tools</h2>
<ul>
<li><a href="https://github.com/m1guelpf/yt-whisper">yt-whisper</a> -
YouTube subtitle generation.</li>
<li><a href="https://github.com/platisd/phonix">phonix</a> - Generate
captions for videos.</li>
<li><a
href="https://github.com/Purfview/whisper-standalone-win">whisper-standalone-win</a>
- Standalone Windows executable for Whisper and Faster Whisper.</li>
<li><a
href="https://github.com/Softcatala/whisper-ctranslate2">whisper-ctranslate2</a>
- Whisper command-line tool based on CTranslate2, compatible with the
original.</li>
<li><a
href="https://github.com/ochen1/insanely-fast-whisper-cli">insanely-fast-whisper-cli</a>
- Achieve transcription speeds near 30x real-time with several
optimizations.</li>
<li><a
href="https://github.com/MahmoudAshraf97/whisper-diarization">whisper-diarization</a>
- Automatic speech recognition with speaker diarization.</li>
</ul>
<h2 id="playgrounds">Playgrounds</h2>
<ul>
<li><a href="https://huggingface.co/spaces/openai/whisper">Hugging
Faces</a> - Whisper demo running on Hugging Faces. (<a
href="https://huggingface.co/spaces/openai/whisper/tree/main">Source</a>)</li>
<li><a href="https://whisperui.monsterapi.ai">Monster API</a> - Whisper
demo running on Monster API. (<a
href="https://github.com/saharmor/whisper-playground">Source</a>)</li>
<li><a href="https://whisper.r3d.red">Web Whisper</a> - Whisper demo by
Pluja. (<a
href="https://codeberg.org/pluja/web-whisper">Source</a>)</li>
<li><a href="https://github.com/ArthurFDLR/whisper-youtube">YouTube
Video Transcription</a> - Running on Colab.</li>
</ul>
<h2 id="packages">Packages</h2>
<h3 id="javascript">JavaScript</h3>
<ul>
<li><a
href="https://github.com/chengsokdara/use-whisper">use-whisper</a> -
React hook.</li>
</ul>
<h2 id="articles">Articles</h2>
<ul>
<li><a
href="https://www.newyorker.com/tech/annals-of-technology/whispers-of-ais-modular-future">Whispers
of A.I.s Modular Future</a> - The future of machine learning lies in
adaptable and accessible open-source speech-transcription programs.</li>
<li><a
href="https://www.assemblyai.com/blog/how-to-run-openais-whisper-speech-recognition-model/">How
to Run Whisper Speech Recognition Model</a> - Explains how to install
and run the model, as well as providing a performance analysis comparing
Whisper to other models.</li>
<li><a
href="https://blog.paperspace.com/whisper-openai-flask-application-deployment/">Create
your own speech to text app using Flask</a> - The tutorial demonstrates
Whispers speech-to-text model, with a demo on running it in a Gradient
Notebook and a guide for setting up a Flask app with Gradient
Deployments.</li>
<li><a
href="https://betterprogramming.pub/openais-whisper-tutorial-42140dd696ee">Convert
Podcasts to Text</a> - Tutorial on the Whisper API with Python for
speech-to-text transcription, showcasing GPUs faster transcription and
advanced technology.</li>
</ul>
<h2 id="videos">Videos</h2>
<ul>
<li><a href="https://www.youtube.com/watch?v=OCBZtgQGt1I">Open AIs
Whisper is Amazing!</a> - Introduction to Whisper.</li>
<li><a href="https://www.youtube.com/watch?v=msj3wuYf3d8">How to do Free
Speech-to-Text Transcription Better Than Google Premium API</a> -
Tutorial.</li>
<li><a href="https://www.youtube.com/watch?v=ywIyc8l1K1Q">Multilingual
AI Speech Recognition Live App</a> - Tutorial.</li>
</ul>
<h2 id="community">Community</h2>
<ul>
<li><a
href="https://github.com/openai/whisper/discussions">Discussions</a></li>
<li><a href="https://discord.com/invite/openai">Discord</a></li>
</ul>
<h2 id="third-party-apis">Third-party APIs</h2>
<p><em>APIs that use Whisper.</em></p>
<ul>
<li><a href="https://www.oneai.com/speech-to-text">Whisper+</a> -
Extension of the Whisper model which adds powerful features such as
speaker identification custom vocabulary, summarization, and chapter
generation.</li>
<li><a href="https://replicate.com/openai/whisper">Replicate</a> - Use
Whisper running on Replicate.</li>
</ul>
<h2 id="related-lists">Related lists</h2>
<ul>
<li><a
href="https://github.com/sindresorhus/awesome-chatgpt">awesome-chatgpt</a>
- ChatGPT resources.</li>
</ul>
<p><a href="https://github.com/sindresorhus/awesome-whisper">whisper.md
Github</a></p>