Skip to main content

Python Audio Libraries

Curated list of the most awesome Python Audio Libraries.

Python is a versatile programming language with a wide range of applications. When it comes to audio processing, Python offers a variety of powerful libraries that can help you manipulate, analyze, and generate audio signals. Whether you are a musician, a data scientist, or a developer working on a multimedia project, these audio libraries can enhance your workflow and unlock creative possibilities. In this listicle, we will explore some of the most awesome audio libraries available for Python, showcasing their unique features and highlighting their potential applications.

  • pyAudioAnalysis - Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications.
  • librosa - Python library for audio and music analysis.
  • audiomentations - A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning..
  • madmom - Python audio and music signal processing library.
  • python-acoustics - A Python library aimed at acousticians..
  • amodem - Audio MODEM Communication Library in Python.
  • audioread - cross-library (GStreamer + Core Audio + MAD + FFmpeg) audio decoding for Python.
  • pycaw - Python Core Audio Windows Library.
  • SoundCard - A Pure-Python Real-Time Audio Library.
  • pytaglib - Python audio tagging library.
  • python-soundfile - SoundFile is an audio library based on libsndfile, CFFI, and NumPy.
  • audiomate - Python library for handling audio datasets..
  • polymath - Convert any music library into a music production sample-library with ML.
  • nussl - A flexible source separation library in Python.
  • nonoCAPTCHA - An asynchronized Python library to automate solving ReCAPTCHA v2 using audio.
  • surfboard - Novoic's audio feature extraction library.
  • muda - A library for augmenting annotated audio data.
  • aeneas - aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment).
  • python-fsb5 - Library and tool to extract audio from FSB5 (FMOD Sample Bank) files.
  • mirdata - Python library for working with Music Information Retrieval datasets.
  • audiotsm - A python library for real-time audio time-scale modification procedures.
  • PythonAudioEffects - A Python library that can apply: darth vader, echo, radio, robotic, and ghost effects to audio samples..
  • pydiogment - 📣 Python library for audio augmentation.
  • PyTSMod - An open-source Python library for audio time-scale modification..
  • soundata - Python library for downloading, loading & working with sound datasets.
  • radiotool - a python library for manipulating audio files.
  • zignal - Python audio signal processing library.
  • Musical-Note-detection - This is a python code for detection of musical notes from an audio file using python libraries numpy and scipy..
  • audio-metadata - A library for reading and, in the future, writing audio metadata. https://audio-metadata.readthedocs.io/.
  • rgain3 - A Python 3 compatible fork of rgain -- ReplayGain tools and Python library.
  • Adafruit_CircuitPython_Waveform - CircuitPython library to generate single wavelength waveforms..
  • praatIO - A python library for working with praat, textgrids, time aligned audio transcripts, and audio files. It is primarily used for extracting features from and making manipulations on audio files given hierarchical time-aligned transcriptions (utterance > word > syllable > phone, etc)..
  • wavencoder - WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models with PyTorch backend..
  • sample-diffusion - A Python library and CLI for generating audio samples using Harmonai Dance Diffusion models..
  • sound-machine - Python library for musical audio synthesis.
  • AudioScraper - Python script that pulls out the audio in the RTP layer of a packet capture. This particular script makes use of the PyShark library. It also assumes the pcap is being read in as a file (not live capture.).
  • pyAudioKits - Powerful Python audio workflow support based on librosa and other libraries.
  • Resound - Python audio clip fingerprinting library.
  • dvtag - A tool for tagging your doujin voice library. Support mp3/flac..
  • PythonMusicPlayer - Simple Python music player implemented with different GUI and audio libraries.
  • pyrekordbox - Inofficial Python package for interacting with the library of Pioneer's Rekordbox DJ software.
  • apkit - Audio Processing Kit -- a python library.
  • audiolabel - Python library for reading and writing label files for phonetic analysis (Praat, ESPS, Wavesurfer)..
  • discogstagger - Console based audio-file metadata tagger that uses the Discogs.com API v2 (JSON based). Relies on the Mutagen and discogs-client libraries. Currently supports FLAC and MP3 file types..
  • micmon - A Python library and set of scripts to create labelled audio datasets from raw audio files and use them to train sound detection models..
  • libf0 - A Python Library for Fundamental Frequency Estimation in Music Recordings.
  • PMLib - A physical modelling library for SuperCollider.
  • teal - Library of TensorFlow layers for audio data processing and data augmentation.
  • tones - Pure-python library for generating audio tones, with support for polyphony, pitch-bending and vibrato.
  • AudioEndpointControl - A library to access and control audio devices (Soundcard speakers/mics) written in Python for Windows vista and later..
  • thunder-speech - A Hackable speech recognition library..
  • audioperm - A python library for generating different permutations of audible segments from audio files..
  • pynuvo - Python Library for Nuvo Audio Systems (Serial).
  • pi-topPULSE - pi-topPULSE Raspberry Pi HAT: automatic device initialisation, Python library for 7x7 LED array & microphone, and I�S audio configuration..
  • fast-align-audio - A fast python library for aligning similar audio snippets passed in as NumPy arrays.
  • vtc-py - A SMTPE video timecode library for Python.
  • audioanalysis - A Python library focused on classification of audio data, especially that gathered from research in neuroscience on animal vocalization.
  • pythonTaglib - Python audio tagging library.
  • pyrilla - Python bindings to gorilla-audio library.
  • audio_python - A Python script using gTTS library to create Text-To-Speech mp3 - https://letslearnabout.net/python/creating-text-to-speech-with-python-and-gtts/.
  • audio-plot-lib - This library provides graph sonification functions and has been developed for a project named "Data science and machine learning resources for screen reader users". Please refer to the project page for more details..
  • Audiobook-from-pdf-Python - Learn how to create a Audiobook from any pdf using Python. We are using two libraries to build Audiobook using Python 1. pyttsx3 Description ------------------ pyttsx3 is a text-to-speech conversion library in Python. Unlike alternative libraries, it works offline, and is compatible with both Python 2 and 3. Installation ------------------ pip install pyttsx3 2. PyPDF2 Description ----------------- A Pure-Python library built as a PDF toolkit. It is capable of extracting document information splitting documents page by page merging documents page by page cropping pages merging multiple pages into a single page encrypting and decrypting PDF files and more! Installation ------------------- pip install PyPDF2 .
  • AudioBook - AudioBook that reads a pdf file using Python libraries pttsx3 & PyPDF2.
  • AudioWaterMark - An audio water mark library written in python..
  • audio-plot - Python library to converts a line graph to sound and return an object that can be played in Jupyter notebook or Google Colab. Values are represented by pitches, and the timeline is represented by left and right pans. It was created to make data science fun for the visually impaired..
  • PythonAudiobookGenerator - Python Script to generate MP3 files from Articles and PDF Books using Google Text To Speech Library.
  • timething - Timething is a library for aligning text transcripts with their audio recordings..
  • tamcolors - tamcolors is a terminal game library which supports multiplayer and audio. tamcolors gives a buffer which lets the user set the character, foreground color and background color which can draw at a stable FPS of 25 on all supported console. .
  • Adafruit_CircuitPython_BD3491FS - CircuitPython library for the Rohm BD3491FS Audio Processor.
  • ml-training-airflow-mlflow-example - An example of setting up local audio ML training pipeline on Airflow with MLFlow experiment tracking and custom python library..
  • Audio_Libraries - Audio_Libraries with python.
  • pyrasta - A Library for audio processing..

Conclusion

In conclusion, Python provides a rich ecosystem of audio libraries that cater to different needs and requirements. From simple audio playback to advanced signal processing and synthesis, these libraries empower developers to work with audio in a flexible and efficient manner. Whether you are a beginner or an experienced programmer, exploring these libraries can open up new avenues for creativity and innovation in the world of audio. So dive in, experiment with these tools, and unleash your imagination to create amazing audio applications with Python.