pyACA

Python scripts accompanying the book "An Introduction to Audio Content Analysis". The source code shows example implementations of basic approaches, features, and algorithms for music audio content analysis.

All implementations are also available in:

functionality

The top-level functions are (alphabetical):

computeBeatHisto: calculates a simple beat histogram

computeChords: simple chord recognition

computeFeature: calculates instantaneous features

computeFingerprint: audio fingerprint extraction

computeKey: calculates a simple key estimate

computeMelSpectrogram: computes a mel spectrogram

computeNoveltyFunction: simple onset detection

computePitch: calculates a fundamental frequency estimate

computeSpectrogram: computes a magnitude spectrogram

The names of the additional functions follow the following conventions:

Feature*: instantaneous features

Pitch*: pitch tracking approach

Novelty*: novelty function computation

Tool*: additional helper functions and basic algorithms such as

Blocking of audio into overlapping blocks

Pre-processing audio

Conversion (freq2bark, freq2mel, freq2midi, mel2freq, midi2freq)

Filterbank (Gammatone)

Gaussian Mixture Model

Principal Component Analysis

Feature Selection

Dynamic Time Warping

K-Means Clustering

K Nearest Neighbor classification

Non-Negative Matrix Factorization

Viterbi algorithm

documentation

The latest full documentation of this package can be found at https://alexanderlerch.github.io/pyACA.

design principles

Please note that the provided code examples are only intended to showcase algorithmic principles – they are not entirely suitable for practical usage without parameter optimization and additional algorithmic tuning. Rather, they intend to show how to implement audio analysis solutions and to facilitate algorithmic understanding to enable the reader to design and implement their own analysis approaches.

minimal dependencies

The required dependencies are reduced to a minimum, more specifically to only numpy and scipy, for the following reasons:

accessibility, i.e., clear algorithmic implementation from scratch without obfuscation by using 3rd party implementations,
maintainability through independence of 3rd party code. This design choice brings, however, some limitations; for instance, reading of non-RIFF audio files is not supported and the machine learning models are very simple.

readability

Consistent variable naming and formatting, as well as the choice for simple implementations allow for easier parsing. The readability of the source code will sometimes come at the cost of lower performance.

cross-language comparability

All code is matched exactly with Matlab implementations and the equations in the book. This also means that the python code might violate typical python style conventions in order to be consistent.

getting started

installation

pip install pyACA

code examples

example 1: computation and plot of the Spectral Centroid

import pyACA
import matplotlib.pyplot as plt 

# file to analyze
cPath = "c:/temp/test.wav"

# extract feature
[v, t] = pyACA.computeFeatureCl(cPath, "SpectralCentroid")

# plot feature output
plt.plot(t,np.squeeze(v))

example 2: Computation of two features (here: Spectral Centroid and Spectral Flux)

import pyACA

# read audio file
cPath = "c:/temp/test.wav"
[f_s, afAudioData] = pyACA.ToolReadAudio(cPath)

# compute feature
[vsc, t] = pyACA.computeFeature("SpectralCentroid", afAudioData, f_s)
[vsf, t] = pyACA.computeFeature("SpectralFlux", afAudioData, f_s)

Name		Name	Last commit message	Last commit date
Latest commit History 206 Commits
.github		.github
.idea		.idea
.vs/pyACA		.vs/pyACA
pyACA		pyACA
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
citation.cff		citation.cff
doxy.config		doxy.config
pyACA.pyproj		pyACA.pyproj
pyACA.sln		pyACA.sln
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
setup.py		setup.py

License

alexanderlerch/pyACA

Folders and files

Latest commit

History

Repository files navigation

pyACA

functionality

documentation

design principles

minimal dependencies

readability

cross-language comparability

related repositories and links

getting started

installation

code examples

About

Topics

Resources

License

Stars

Watchers

Forks

Sponsor this project

Languages