146 lines
14 KiB
Plaintext
146 lines
14 KiB
Plaintext
|
||
[48;5;235m[38;5;249m[49m[39m
|
||
[48;5;235m[38;5;249m[49m[39m
|
||
[48;5;235m[38;5;249m[49m[39m
|
||
[48;5;235m[38;5;249m [49m[39m
|
||
[48;5;235m[38;5;249m [49m[39m
|
||
[48;5;235m[38;5;249m[49m[39m
|
||
[48;5;235m[38;5;249m[49m[39m
|
||
[48;5;235m[38;5;249m[49m[39m
|
||
[48;5;235m[38;5;249m [49m[39m
|
||
[48;5;235m[38;5;249m[49m[39m
|
||
[48;5;235m[38;5;249m[49m[39m
|
||
[48;5;235m[38;5;249m[49m[39m
|
||
[48;5;235m[38;5;249m [49m[39m
|
||
[48;5;235m[38;5;249m[49m[39m
|
||
[48;5;235m[38;5;249m[49m[39m
|
||
[48;5;235m[38;5;249m[49m[39m
|
||
[48;5;235m[38;5;249m[49m[39m
|
||
[48;5;235m[38;5;249m[49m[39m
|
||
[48;5;235m[38;5;249m[49m[39m
|
||
|
||
|
||
[38;2;255;187;0m[4mContents[0m
|
||
|
||
[38;5;12m- [39m[38;5;14m[1mOfficial[0m[38;5;12m (#official)[39m
|
||
[38;5;12m- [39m[38;5;14m[1mModel variants[0m[38;5;12m (#model-variants)[39m
|
||
[38;5;12m- [39m[38;5;14m[1mApps[0m[38;5;12m (#apps)[39m
|
||
[38;5;12m- [39m[38;5;14m[1mWeb apps[0m[38;5;12m (#web-apps)[39m
|
||
[38;5;12m- [39m[38;5;14m[1mCLI tools[0m[38;5;12m (#cli-tools)[39m
|
||
[38;5;12m- [39m[38;5;14m[1mPlaygrounds[0m[38;5;12m (#playgrounds)[39m
|
||
[38;5;12m- [39m[38;5;14m[1mPackages[0m[38;5;12m (#packages)[39m
|
||
[38;5;12m- [39m[38;5;14m[1mArticles[0m[38;5;12m (#articles)[39m
|
||
[38;5;12m- [39m[38;5;14m[1mVideos[0m[38;5;12m (#videos)[39m
|
||
[38;5;12m- [39m[38;5;14m[1mCommunity[0m[38;5;12m (#community)[39m
|
||
[38;5;12m- [39m[38;5;14m[1mThird-party APIs[0m[38;5;12m (#third-party-apis)[39m
|
||
[38;5;12m- [39m[38;5;14m[1mRelated lists[0m[38;5;12m (#related-lists)[39m
|
||
|
||
[38;2;255;187;0m[4mOfficial[0m
|
||
|
||
[38;5;12m- [39m[38;5;14m[1mIntroduction[0m[38;5;12m (https://openai.com/research/whisper)[39m
|
||
[38;5;12m- [39m[38;5;14m[1mSource code[0m[38;5;12m (https://github.com/openai/whisper)[39m
|
||
[38;5;12m- [39m[38;5;14m[1mWhite paper[0m[38;5;12m (https://cdn.openai.com/papers/whisper.pdf)[39m
|
||
|
||
[38;2;255;187;0m[4mModel variants[0m
|
||
|
||
[38;5;12m- [39m[38;5;14m[1mWhisper.cpp[0m[38;5;12m (https://github.com/ggerganov/whisper.cpp) - Port of Whisper in C++.[39m
|
||
[48;5;235m[38;5;249m- **Bindings for many languages** (https://github.com/ggerganov/whisper.cpp#bindings)[49m[39m
|
||
[38;5;12m- [39m[38;5;14m[1mWhisperX[0m[38;5;12m (https://github.com/m-bain/whisperX) - Adds fast automatic speaker recognition with word-level timestamps and speaker diarization.[39m
|
||
[38;5;12m- [39m[38;5;14m[1mfaster-whisper[0m[38;5;12m (https://github.com/guillaumekln/faster-whisper) - Faster reimplementation of Whisper using CTranslate2.[39m
|
||
[38;5;12m- [39m[38;5;14m[1mWhisper JAX[0m[38;5;12m (https://github.com/sanchit-gandhi/whisper-jax) - JAX implementation of Whisper for up to 70x speed-up on TPU.[39m
|
||
[38;5;12m- [39m[38;5;14m[1mwhisper-timestamped[0m[38;5;12m (https://github.com/linto-ai/whisper-timestamped) - Adds word-level timestamps and confidence scores.[39m
|
||
[38;5;12m- [39m[38;5;14m[1mwhisper-openvino[0m[38;5;12m (https://github.com/zhuzilin/whisper-openvino) - Whisper running on OpenVINO.[39m
|
||
[38;5;12m- [39m[38;5;14m[1mwhisper.tflite[0m[38;5;12m (https://github.com/usefulsensors/openai-whisper) - Whisper running on TensorFlow Lite.[39m
|
||
[38;5;12m- [39m[38;5;14m[1mWhisper variants[0m[38;5;12m (https://huggingface.co/models?other=whisper) - Various Whisper variants on Hugging Faces.[39m
|
||
[38;5;12m- [39m[38;5;14m[1mWhisper-AT[0m[38;5;12m (https://github.com/YuanGongND/whisper-at) - Whisper that can recognize non-speech audio events in addition to speech.[39m
|
||
|
||
[38;2;255;187;0m[4mApps[0m
|
||
|
||
[38;5;12m- [39m[38;5;14m[1mAiko[0m[38;5;12m (https://sindresorhus.com/aiko) - Audio transcription iOS and macOS app.[39m
|
||
[38;5;12m- [39m[38;5;14m[1mMacWhisper[0m[38;5;12m (https://goodsnooze.gumroad.com/l/macwhisper) - Audio transcription macOS app. (Freemium)[39m
|
||
[38;5;12m- [39m[38;5;14m[1mWhisper Memos[0m[38;5;12m (https://apps.apple.com/app/id6443658039) - Audio transcription iOS app. (Freemium)[39m
|
||
[38;5;12m- [39m[38;5;14m[1mFourYou[0m[38;5;12m (https://apps.apple.com/app/id1671616134) - Audio journal iOS app.[39m
|
||
[38;5;12m- [39m[38;5;14m[1mJojo Transcribe[0m[38;5;12m (https://apps.apple.com/app/id1659864300) - Audio transcription macOS app.[39m
|
||
[38;5;12m- [39m[38;5;14m[1mBuzz[0m[38;5;12m (https://github.com/chidiwilliams/Buzz) - Audio transcription and translation macOS app.[39m
|
||
[38;5;12m- [39m[38;5;14m[1mWhisperScript[0m[38;5;12m (https://store.getwavery.com/l/whisperscript) - Audio transcription macOS app. (Freemium · Electron)[39m
|
||
[38;5;12m- [39m[38;5;14m[1mAudio Podium[0m[38;5;12m (https://apps.apple.com/app/id6449008295) - Audio/video management macOS app.[39m
|
||
[38;5;12m- [39m[38;5;14m[1msuperwhisper[0m[38;5;12m (https://superwhisper.com) - Global audio transcription macOS menu bar app.[39m
|
||
[38;5;12m- [39m[38;5;14m[1mSpeech Note[0m[38;5;12m (https://github.com/mkiol/dsnote) - Audio transcription Linux app.[39m
|
||
[38;5;12m- [39m[38;5;14m[1mFridayGPT[0m[38;5;12m (https://www.fridaygpt.app) - Dictation macOS app powered by OpenAI API.[39m
|
||
[38;5;12m- [39m[38;5;14m[1mEasyWhisper[0m[38;5;12m (https://easywhisper.io) - Windows and macOS app for audio transcription and speaker diarization. (Freemium)[39m
|
||
[38;5;12m- [39m[38;5;14m[1mAudio Note[0m[38;5;12m (https://audionote.app) - Real-time audio transcription on macOS and Windows. (Freemium · Electron)[39m
|
||
[38;5;12m- [39m[38;5;14m[1mWhisper[0m[38;5;12m (https://github.com/woheller69/whisperIME) - Android app for transcription and translation. (FOSS)[39m
|
||
[38;5;12m- [39m[38;5;14m[1mVoiceInk[0m[38;5;12m (https://github.com/Beingpax/VoiceInk) - Dictation and transcription macOS app. (FOSS)[39m
|
||
|
||
[38;2;255;187;0m[4mWeb apps[0m
|
||
|
||
|
||
|
||
[38;2;255;187;0m[4mHosted[0m
|
||
|
||
[38;5;12m- [39m[38;5;14m[1mbigWav[0m[38;5;12m (https://bigwav.app) - Audio transcription and annotation tool.[39m
|
||
[38;5;12m- [39m[38;5;14m[1mFree Podcast Transcription[0m[38;5;12m (https://freepodcasttranscription.com) - Runs locally in your browser.[39m
|
||
[38;5;12m- [39m[38;5;14m[1mGladia[0m[38;5;12m (https://www.gladia.io) - Transcription with real-time processing.[39m
|
||
|
||
[38;2;255;187;0m[4mSelf-hosted[0m
|
||
|
||
[38;5;12m- [39m[38;5;14m[1mSubs AI[0m[38;5;12m (https://github.com/abdeladim-s/subsai) - Subtitle generation.[39m
|
||
[38;5;12m- [39m[38;5;14m[1mWaaS[0m[38;5;12m (https://github.com/schibsted/WAAS) - GUI and API for Whisper.[39m
|
||
[38;5;12m- [39m[38;5;14m[1mwriteout.ai[0m[38;5;12m (https://github.com/beyondcode/writeout.ai) - Laravel app to transcribe and translate audio files.[39m
|
||
[38;5;12m- [39m[38;5;14m[1mMeeper[0m[38;5;12m (https://github.com/pas1ko/meeper) - Transcriptions, summary and more for meetings and any browser tab. (Chrome app)[39m
|
||
|
||
[38;2;255;187;0m[4mCLI tools[0m
|
||
|
||
[38;5;12m- [39m[38;5;14m[1myt-whisper[0m[38;5;12m (https://github.com/m1guelpf/yt-whisper) - YouTube subtitle generation.[39m
|
||
[38;5;12m- [39m[38;5;14m[1mphonix[0m[38;5;12m (https://github.com/platisd/phonix) - Generate captions for videos.[39m
|
||
[38;5;12m- [39m[38;5;14m[1mwhisper-standalone-win[0m[38;5;12m (https://github.com/Purfview/whisper-standalone-win) - Standalone Windows executable for Whisper and Faster Whisper.[39m
|
||
[38;5;12m- [39m[38;5;14m[1mwhisper-ctranslate2[0m[38;5;12m (https://github.com/Softcatala/whisper-ctranslate2) - Whisper command-line tool based on CTranslate2, compatible with the original.[39m
|
||
[38;5;12m- [39m[38;5;14m[1minsanely-fast-whisper-cli[0m[38;5;12m (https://github.com/ochen1/insanely-fast-whisper-cli) - Achieve transcription speeds near 30x real-time with several optimizations.[39m
|
||
[38;5;12m- [39m[38;5;14m[1mwhisper-diarization[0m[38;5;12m (https://github.com/MahmoudAshraf97/whisper-diarization) - Automatic speech recognition with speaker diarization.[39m
|
||
|
||
[38;2;255;187;0m[4mPlaygrounds[0m
|
||
|
||
[38;5;12m- [39m[38;5;14m[1mHugging Faces[0m[38;5;12m (https://huggingface.co/spaces/openai/whisper) - Whisper demo running on Hugging Faces. ([39m[38;5;14m[1mSource[0m[38;5;12m (https://huggingface.co/spaces/openai/whisper/tree/main))[39m
|
||
[38;5;12m- [39m[38;5;14m[1mMonster API[0m[38;5;12m (https://whisperui.monsterapi.ai) - Whisper demo running on Monster API. ([39m[38;5;14m[1mSource[0m[38;5;12m (https://github.com/saharmor/whisper-playground))[39m
|
||
[38;5;12m- [39m[38;5;14m[1mWeb Whisper[0m[38;5;12m (https://whisper.r3d.red) - Whisper demo by Pluja. ([39m[38;5;14m[1mSource[0m[38;5;12m (https://codeberg.org/pluja/web-whisper))[39m
|
||
[38;5;12m- [39m[38;5;14m[1mYouTube Video Transcription[0m[38;5;12m (https://github.com/ArthurFDLR/whisper-youtube) - Running on Colab.[39m
|
||
|
||
[38;2;255;187;0m[4mPackages[0m
|
||
|
||
[38;2;255;187;0m[4mJavaScript[0m
|
||
|
||
[38;5;12m- [39m[38;5;14m[1muse-whisper[0m[38;5;12m (https://github.com/chengsokdara/use-whisper) - React hook.[39m
|
||
|
||
[38;2;255;187;0m[4mArticles[0m
|
||
|
||
[38;5;12m- [39m[38;5;14m[1mWhispers of A.I.'s Modular Future[0m[38;5;12m (https://www.newyorker.com/tech/annals-of-technology/whispers-of-ais-modular-future) - The future of machine learning lies in adaptable and accessible open-source speech-transcription programs.[39m
|
||
[38;5;12m-[39m[38;5;12m [39m[38;5;14m[1mHow[0m[38;5;14m[1m [0m[38;5;14m[1mto[0m[38;5;14m[1m [0m[38;5;14m[1mRun[0m[38;5;14m[1m [0m[38;5;14m[1mWhisper[0m[38;5;14m[1m [0m[38;5;14m[1mSpeech[0m[38;5;14m[1m [0m[38;5;14m[1mRecognition[0m[38;5;14m[1m [0m[38;5;14m[1mModel[0m[38;5;12m [39m[38;5;12m(https://www.assemblyai.com/blog/how-to-run-openais-whisper-speech-recognition-model/)[39m[38;5;12m [39m[38;5;12m-[39m[38;5;12m [39m[38;5;12mExplains[39m[38;5;12m [39m[38;5;12mhow[39m[38;5;12m [39m[38;5;12mto[39m[38;5;12m [39m[38;5;12minstall[39m[38;5;12m [39m[38;5;12mand[39m[38;5;12m [39m[38;5;12mrun[39m[38;5;12m [39m[38;5;12mthe[39m[38;5;12m [39m[38;5;12mmodel,[39m[38;5;12m [39m[38;5;12mas[39m[38;5;12m [39m[38;5;12mwell[39m[38;5;12m [39m[38;5;12mas[39m[38;5;12m [39m[38;5;12mproviding[39m[38;5;12m [39m[38;5;12ma[39m[38;5;12m [39m[38;5;12mperformance[39m[38;5;12m [39m[38;5;12manalysis[39m[38;5;12m [39m[38;5;12mcomparing[39m[38;5;12m [39m[38;5;12mWhisper[39m[38;5;12m [39m[38;5;12mto[39m[38;5;12m [39m
|
||
[38;5;12mother[39m[38;5;12m [39m[38;5;12mmodels.[39m
|
||
[38;5;12m-[39m[38;5;12m [39m[38;5;14m[1mCreate[0m[38;5;14m[1m [0m[38;5;14m[1myour[0m[38;5;14m[1m [0m[38;5;14m[1mown[0m[38;5;14m[1m [0m[38;5;14m[1mspeech[0m[38;5;14m[1m [0m[38;5;14m[1mto[0m[38;5;14m[1m [0m[38;5;14m[1mtext[0m[38;5;14m[1m [0m[38;5;14m[1mapp[0m[38;5;14m[1m [0m[38;5;14m[1musing[0m[38;5;14m[1m [0m[38;5;14m[1mFlask[0m[38;5;12m [39m[38;5;12m(https://blog.paperspace.com/whisper-openai-flask-application-deployment/)[39m[38;5;12m [39m[38;5;12m-[39m[38;5;12m [39m[38;5;12mThe[39m[38;5;12m [39m[38;5;12mtutorial[39m[38;5;12m [39m[38;5;12mdemonstrates[39m[38;5;12m [39m[38;5;12mWhisper's[39m[38;5;12m [39m[38;5;12mspeech-to-text[39m[38;5;12m [39m[38;5;12mmodel,[39m[38;5;12m [39m[38;5;12mwith[39m[38;5;12m [39m[38;5;12ma[39m[38;5;12m [39m[38;5;12mdemo[39m[38;5;12m [39m[38;5;12mon[39m[38;5;12m [39m[38;5;12mrunning[39m[38;5;12m [39m[38;5;12mit[39m[38;5;12m [39m[38;5;12min[39m[38;5;12m [39m[38;5;12ma[39m[38;5;12m [39m[38;5;12mGradient[39m[38;5;12m [39m[38;5;12mNotebook[39m[38;5;12m [39m[38;5;12mand[39m[38;5;12m [39m[38;5;12ma[39m[38;5;12m [39m[38;5;12mguide[39m[38;5;12m [39m
|
||
[38;5;12mfor[39m[38;5;12m [39m[38;5;12msetting[39m[38;5;12m [39m[38;5;12mup[39m[38;5;12m [39m[38;5;12ma[39m[38;5;12m [39m[38;5;12mFlask[39m[38;5;12m [39m[38;5;12mapp[39m[38;5;12m [39m[38;5;12mwith[39m[38;5;12m [39m[38;5;12mGradient[39m[38;5;12m [39m[38;5;12mDeployments.[39m
|
||
[38;5;12m- [39m[38;5;14m[1mConvert Podcasts to Text[0m[38;5;12m (https://betterprogramming.pub/openais-whisper-tutorial-42140dd696ee) - Tutorial on the Whisper API with Python for speech-to-text transcription, showcasing GPU's faster transcription and advanced technology.[39m
|
||
|
||
[38;2;255;187;0m[4mVideos[0m
|
||
|
||
[38;5;12m- [39m[38;5;14m[1mOpen AI's Whisper is Amazing![0m[38;5;12m (https://www.youtube.com/watch?v=OCBZtgQGt1I) - Introduction to Whisper.[39m
|
||
[38;5;12m- [39m[38;5;14m[1mHow to do Free Speech-to-Text Transcription Better Than Google Premium API[0m[38;5;12m (https://www.youtube.com/watch?v=msj3wuYf3d8) - Tutorial.[39m
|
||
[38;5;12m- [39m[38;5;14m[1mMultilingual AI Speech Recognition Live App[0m[38;5;12m (https://www.youtube.com/watch?v=ywIyc8l1K1Q) - Tutorial.[39m
|
||
|
||
[38;2;255;187;0m[4mCommunity[0m
|
||
|
||
[38;5;12m- [39m[38;5;14m[1mDiscussions[0m[38;5;12m (https://github.com/openai/whisper/discussions)[39m
|
||
[38;5;12m- [39m[38;5;14m[1mDiscord[0m[38;5;12m (https://discord.com/invite/openai)[39m
|
||
|
||
[38;2;255;187;0m[4mThird-party APIs[0m
|
||
|
||
[48;2;30;30;40m[38;5;13m[3mAPIs that use Whisper.[0m
|
||
|
||
[38;5;12m- [39m[38;5;14m[1mWhisper+[0m[38;5;12m (https://www.oneai.com/speech-to-text) - Extension of the Whisper model which adds powerful features such as speaker identification custom vocabulary, summarization, and chapter generation.[39m
|
||
[38;5;12m- [39m[38;5;14m[1mReplicate[0m[38;5;12m (https://replicate.com/openai/whisper) - Use Whisper running on Replicate.[39m
|
||
|
||
[38;2;255;187;0m[4mRelated lists[0m
|
||
|
||
[38;5;12m- [39m[38;5;14m[1mawesome-chatgpt[0m[38;5;12m (https://github.com/sindresorhus/awesome-chatgpt) - ChatGPT resources.[39m
|
||
|
||
[38;5;12mwhisper Github: https://github.com/sindresorhus/awesome-whisper[39m
|