update lists

This commit is contained in:
2025-07-18 22:22:32 +02:00
parent 55bed3b4a1
commit 5916c5c074
3078 changed files with 331679 additions and 357255 deletions

View File

@@ -1,4 +1,4 @@
 Python for Scientific Audio
 Python for Scientific Audio
!Awesome (https://cdn.rawgit.com/sindresorhus/awesome/d7305f38d29fed78fa85652e3a63e154dd8e8829/media/badge.svg) (https://github.com/sindresorhus/awesome) !Build Status 
(https://github.com/faroit/awesome-python-scientific-audio/workflows/CI/badge.svg) (https://github.com/faroit/awesome-python-scientific-audio/actions?query=workflow%3ACI+branch%3Amaster+event%3Apush)
@@ -59,13 +59,13 @@
⟡ pydub (http://pydub.com) :octocat: (https://github.com/jiaaro/pydub) :package: (https://pypi.python.org/pypi/mdct) - Manipulate audio with a simple and easy high level interface.
⟡ pytftb (http://tftb.nongnu.org) :octocat: (https://github.com/scikit-signal/pytftb) - Implementation of the MATLAB Time-Frequency Toolbox.
⟡ pyroomacoustics (https://github.com/LCAV/pyroomacoustics) :octocat: (https://github.com/LCAV/pyroomacoustics) :package: (https://pypi.python.org/pypi/pyroomacoustics) - Room Acoustics Simulation (RIR generator)
⟡ PyRubberband (https://github.com/bmcfee/pyrubberband) :octocat: (https://github.com/bmcfee/pyrubberband) :package: (https://pypi.python.org/pypi/pyrubberband/) - Wrapper for rubberband (http://breakfastquay.com/rubberband/) to do 
pitch-shifting and time-stretching.
⟡ PyRubberband (https://github.com/bmcfee/pyrubberband) :octocat: (https://github.com/bmcfee/pyrubberband) :package: (https://pypi.python.org/pypi/pyrubberband/) - Wrapper for rubberband (http://breakfastquay.com/rubberband/) to do pitch-shifting
and time-stretching.
⟡ PyWavelets (http://pywavelets.readthedocs.io) :octocat: (https://github.com/PyWavelets/pywt) :package: (https://pypi.python.org/pypi/PyWavelets) - Discrete Wavelet Transform in Python.
⟡ Resampy (http://resampy.readthedocs.io) :octocat: (https://github.com/bmcfee/resampy) :package: (https://pypi.python.org/pypi/resampy) - Sample rate conversion.
⟡ SFS-Python (http://www.sfstoolbox.org) :octocat: (https://github.com/sfstoolbox/sfs-python) :package: (https://pypi.python.org/pypi/sfs/) - Sound Field Synthesis Toolbox.
⟡ sound_field_analysis (https://appliedacousticschalmers.github.io/sound_field_analysis-py/) :octocat: (https://github.com/AppliedAcousticsChalmers/sound_field_analysis-py) :package: (https://pypi.org/project/sound-field-analysis/) - 
Analyze, visualize and process sound field data recorded by spherical microphone arrays.
⟡ sound_field_analysis (https://appliedacousticschalmers.github.io/sound_field_analysis-py/) :octocat: (https://github.com/AppliedAcousticsChalmers/sound_field_analysis-py) :package: (https://pypi.org/project/sound-field-analysis/) - Analyze, 
visualize and process sound field data recorded by spherical microphone arrays.
⟡ STFT (http://stft.readthedocs.io) :octocat: (https://github.com/nils-werner/stft) :package: (https://pypi.python.org/pypi/stft) - Standalone package for Short-Time Fourier Transform.
Feature extraction
@@ -74,8 +74,7 @@
⟡ audioFlux (https://github.com/libAudioFlux/audioFlux) :octocat: (https://github.com/libAudioFlux/audioFlux) :package: (https://pypi.python.org/pypi/audioflux) - A library for audio and music analysis, feature extraction.
⟡ audiolazy (https://github.com/danilobellini/audiolazy) :octocat: (https://github.com/danilobellini/audiolazy) :package: (https://pypi.python.org/pypi/audiolazy/) - Realtime Audio Processing lib, general purpose.
⟡ essentia (http://essentia.upf.edu) :octocat: (https://github.com/MTG/essentia) - Music related low level and high level feature extractor, C++ based, includes Python bindings.
⟡ python_speech_features (https://github.com/jameslyons/python_speech_features) :octocat: (https://github.com/jameslyons/python_speech_features) :package: (https://pypi.python.org/pypi/python_speech_features) - Common speech features 
for ASR.
⟡ python_speech_features (https://github.com/jameslyons/python_speech_features) :octocat: (https://github.com/jameslyons/python_speech_features) :package: (https://pypi.python.org/pypi/python_speech_features) - Common speech features for ASR.
⟡ pyYAAFE (https://github.com/Yaafe/Yaafe) :octocat: (https://github.com/Yaafe/Yaafe) - Python bindings for YAAFE feature extractor.
⟡ speechpy (https://github.com/astorfi/speechpy) :octocat: (https://github.com/astorfi/speechpy) :package: (https://pypi.python.org/pypi/speechpy) - Library for Speech Processing and Recognition, mostly feature extraction for now.
⟡ spafe (https://github.com/SuperKogito/spafe) :octocat: (https://github.com/SuperKogito/spafe) :package: (https://pypi.org/project/spafe/) - Python library for features extraction from audio files.
@@ -91,8 +90,8 @@
⟡ aeneas (https://www.readbeyond.it/aeneas/) :octocat: (https://github.com/readbeyond/aeneas/) :package: (https://pypi.python.org/pypi/aeneas/) - Forced aligner, based on MFCC+DTW, 35+ languages.
⟡ deepspeech (https://github.com/mozilla/DeepSpeech) :octocat: (https://github.com/mozilla/DeepSpeech) :package: (https://pypi.org/project/deepspeech/) - Pretrained automatic speech recognition.
⟡ gentle (https://github.com/lowerquality/gentle) :octocat: (https://github.com/lowerquality/gentle) - Forced-aligner built on Kaldi.
⟡ Parselmouth (https://github.com/YannickJadoul/Parselmouth) :octocat: (https://github.com/YannickJadoul/Parselmouth) :package: (https://pypi.org/project/praat-parselmouth/) - Python interface to the Praat (http://www.praat.org) 
phonetics and speech analysis, synthesis, and manipulation software.
⟡ Parselmouth (https://github.com/YannickJadoul/Parselmouth) :octocat: (https://github.com/YannickJadoul/Parselmouth) :package: (https://pypi.org/project/praat-parselmouth/) - Python interface to the Praat (http://www.praat.org) phonetics and 
speech analysis, synthesis, and manipulation software.
⟡ persephone (https://persephone.readthedocs.io/en/latest/) :octocat: (https://github.com/persephone-tools/persephone) :package: (https://pypi.org/project/persephone/) - Automatic phoneme transcription tool.
⟡ pyannote.audio (https://github.com/pyannote/pyannote-audio) :octocat: (https://github.com/pyannote/pyannote-audio) :package: (https://pypi.org/project/pyannote-audio/) - Neural building blocks for speaker diarization.
⟡ pyAudioAnalysis (https://github.com/tyiannak/pyAudioAnalysis)² :octocat: (https://github.com/tyiannak/pyAudioAnalysis) :package: (https://pypi.python.org/pypi/pyAudioAnalysis/) - Feature Extraction, Classification, Diarization.
@@ -100,11 +99,9 @@
⟡ pypesq (https://github.com/vBaiCai/python-pesq) :octocat: (https://github.com/vBaiCai/python-pesq) - Wrapper for the PESQ score calculation.
⟡ pystoi (https://github.com/mpariente/pystoi) :octocat: (https://github.com/mpariente/pystoi) :package: (https://pypi.org/project/pystoi) - Short Term Objective Intelligibility measure (STOI).
⟡ PyWorldVocoder (https://github.com/JeremyCCHsu/Python-Wrapper-for-World-Vocoder) :octocat: (https://github.com/JeremyCCHsu/Python-Wrapper-for-World-Vocoder) - Wrapper for Morise's World Vocoder.
⟡ Montreal Forced Aligner (https://montrealcorpustools.github.io/Montreal-Forced-Aligner/) :octocat: (https://github.com/MontrealCorpusTools/Montreal-Forced-Aligner) - Forced aligner, based on Kaldi (HMM), English (others can be 
trained).
⟡ Montreal Forced Aligner (https://montrealcorpustools.github.io/Montreal-Forced-Aligner/) :octocat: (https://github.com/MontrealCorpusTools/Montreal-Forced-Aligner) - Forced aligner, based on Kaldi (HMM), English (others can be trained).
⟡ SIDEKIT (http://lium.univ-lemans.fr/sidekit/) :package: (https://pypi.python.org/pypi/SIDEKIT/) - Speaker and Language recognition.
⟡ SpeechRecognition (https://github.com/Uberi/speech_recognition) :octocat: (https://github.com/Uberi/speech_recognition) :package: (https://pypi.python.org/pypi/SpeechRecognition/) - Wrapper for several ASR engines and APIs, online 
and offline.
⟡ SpeechRecognition (https://github.com/Uberi/speech_recognition) :octocat: (https://github.com/Uberi/speech_recognition) :package: (https://pypi.python.org/pypi/SpeechRecognition/) - Wrapper for several ASR engines and APIs, online and offline.
Environmental Sounds
@@ -122,8 +119,8 @@
⟡ commonfate (https://github.com/aliutkus/commonfate) :octocat: (https://github.com/aliutkus/commonfate) :package: (https://pypi.python.org/pypi/commonfate) - Common Fate Model and Transform.
⟡ NTFLib (https://github.com/stitchfix/NTFLib) :octocat: (https://github.com/stitchfix/NTFLib) - Sparse Beta-Divergence Tensor Factorization.
⟡ NUSSL (https://interactiveaudiolab.github.io/project/nussl.html) :octocat: (https://github.com/interactiveaudiolab/nussl) :package: (https://pypi.python.org/pypi/nussl) - Holistic source separation framework including DSP methods and 
deep learning methods.
⟡ NUSSL (https://interactiveaudiolab.github.io/project/nussl.html) :octocat: (https://github.com/interactiveaudiolab/nussl) :package: (https://pypi.python.org/pypi/nussl) - Holistic source separation framework including DSP methods and deep 
learning methods.
⟡ NIMFA (http://nimfa.biolab.si) :octocat: (https://github.com/marinkaz/nimfa) :package: (https://pypi.python.org/pypi/nimfa) - Several flavors of non-negative-matrix factorization.
Music Information Retrieval
@@ -152,8 +149,7 @@
⟡ Jupylet (https://github.com/nir/jupylet) :octocat: (https://github.com/nir/jupylet) - Subtractive, additive, FM, and sample-based sound synthesis.
⟡ PYO (http://ajaxsoundstudio.com/software/pyo/) :octocat: (https://github.com/belangeo/pyo) - Realtime audio dsp engine.
⟡ python-sounddevice (https://github.com/spatialaudio/python-sounddevice) :octocat: (http://python-sounddevice.readthedocs.io) :package: (https://pypi.python.org/pypi/sounddevice) - PortAudio wrapper providing realtime audio I/O with 
NumPy.
⟡ python-sounddevice (https://github.com/spatialaudio/python-sounddevice) :octocat: (http://python-sounddevice.readthedocs.io) :package: (https://pypi.python.org/pypi/sounddevice) - PortAudio wrapper providing realtime audio I/O with NumPy.
⟡ ReTiSAR (https://github.com/AppliedAcousticsChalmers/ReTiSAR) :octocat: (https://github.com/AppliedAcousticsChalmers/ReTiSAR) - Binarual rendering of streamed or IR-based high-order spherical microphone array signals.
Web Audio
@@ -165,8 +161,7 @@
⟡ beets (http://beets.io/) :octocat: (https://github.com/beetbox/beets) :package: (https://pypi.python.org/pypi/beets) - Music library manager and MusicBrainz (https://musicbrainz.org/) tagger.
⟡ musdb (http://dsdtools.readthedocs.io) :octocat: (https://github.com/sigsep/sigsep-mus-db) :package: (https://pypi.python.org/pypi/musdb) - Parse and process the MUSDB18 dataset.
⟡ medleydb (http://medleydb.readthedocs.io) :octocat: (https://github.com/marl/medleydb) - Parse medleydb (http://medleydb.weebly.com/) audio + annotations.
⟡ Soundcloud API (https://github.com/soundcloud/soundcloud-python) :octocat: (https://github.com/soundcloud/soundcloud-python) :package: (https://pypi.python.org/pypi/soundcloud) - Wrapper for Soundcloud API 
(https://developers.soundcloud.com/).
⟡ Soundcloud API (https://github.com/soundcloud/soundcloud-python) :octocat: (https://github.com/soundcloud/soundcloud-python) :package: (https://pypi.python.org/pypi/soundcloud) - Wrapper for Soundcloud API (https://developers.soundcloud.com/).
⟡ Youtube-Downloader (http://rg3.github.io/youtube-dl/) :octocat: (https://github.com/rg3/youtube-dl) :package: (https://pypi.python.org/pypi/youtube_dl) - Download youtube videos (and the audio).
⟡ audiomate (https://github.com/ynop/audiomate) :octocat: (https://github.com/ynop/audiomate) :package: (https://pypi.python.org/pypi/audiomate/) - Loading different types of audio datasets.
⟡ mirdata (https://mirdata.readthedocs.io/en/latest/) :octocat: (https://github.com/mir-dataset-loaders/mirdata) :package: (https://pypi.python.org/pypi/mirdata) - Common loaders for Music Information Retrieval (MIR) datasets.
@@ -192,10 +187,10 @@
Scientific Papers
⟡ Python for audio signal processing (http://eprints.maynoothuniversity.ie/4115/1/40.pdf) - John C. Glover, Victor Lazzarini and Joseph Timoney, Linux Audio Conference 2011.
⟡ librosa: Audio and Music Signal Analysis in Python (http://conference.scipy.org/proceedings/scipy2015/pdfs/brian_mcfee.pdf), Video (https://www.youtube.com/watch?v=MhOdbtPhbLU) - Brian McFee, Colin Raffel, Dawen Liang, Daniel P.W. 
Ellis, Matt McVicar, Eric Battenberg, Oriol Nieto, Scipy 2015.
⟡ pyannote.audio: neural building blocks for speaker diarization (https://arxiv.org/abs/1911.01255), Video (https://www.youtube.com/watch?v=37R_R82lfwA) - Hervé Bredin, Ruiqing Yin, Juan Manuel Coria, Gregory Gelly, Pavel Korshunov, 
Marvin Lavechin, Diego Fustes, Hadrien Titeux, Wassim Bouaziz, Marie-Philippe Gill, ICASSP 2020.
⟡ librosa: Audio and Music Signal Analysis in Python (http://conference.scipy.org/proceedings/scipy2015/pdfs/brian_mcfee.pdf), Video (https://www.youtube.com/watch?v=MhOdbtPhbLU) - Brian McFee, Colin Raffel, Dawen Liang, Daniel P.W. Ellis, Matt 
McVicar, Eric Battenberg, Oriol Nieto, Scipy 2015.
⟡ pyannote.audio: neural building blocks for speaker diarization (https://arxiv.org/abs/1911.01255), Video (https://www.youtube.com/watch?v=37R_R82lfwA) - Hervé Bredin, Ruiqing Yin, Juan Manuel Coria, Gregory Gelly, Pavel Korshunov, Marvin 
Lavechin, Diego Fustes, Hadrien Titeux, Wassim Bouaziz, Marie-Philippe Gill, ICASSP 2020.
Other Resources
@@ -217,3 +212,5 @@
License
!License: CC BY 4.0 (https://img.shields.io/badge/License-CC%20BY%204.0-lightgrey.svg) (https://creativecommons.org/licenses/by/4.0/)
pythonscientificaudio Github: https://github.com/faroit/awesome-python-scientific-audio