• Stars
    star
    5,775
  • Rank 7,036 (Top 0.2 %)
  • Language
    Python
  • License
    Apache License 2.0
  • Created over 10 years ago
  • Updated 9 months ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications

A Python library for audio feature extraction, classification, segmentation and applications

This is general info. Click here for the complete wiki and here for a more generic intro to audio data handling

News

  • [2022-01-01] If you are not interested in training audio models from your own data, you can check the Deep Audio API, were you can directly send audio data and receive predictions with regards to the respective audio content (speech vs silence, musical genre, speaker gender, etc).
  • [2021-08-06] deep-audio-features deep audio classification and feature extraction using CNNs and Pytorch
  • Check out paura a Python script for realtime recording and analysis of audio data

General

pyAudioAnalysis is a Python library covering a wide range of audio analysis tasks. Through pyAudioAnalysis you can:

  • Extract audio features and representations (e.g. mfccs, spectrogram, chromagram)
  • Train, parameter tune and evaluate classifiers of audio segments
  • Classify unknown sounds
  • Detect audio events and exclude silence periods from long recordings
  • Perform supervised segmentation (joint segmentation - classification)
  • Perform unsupervised segmentation (e.g. speaker diarization) and extract audio thumbnails
  • Train and use audio regression models (example application: emotion recognition)
  • Apply dimensionality reduction to visualize audio data and content similarities

Installation

  • Clone the source of this library: git clone https://github.com/tyiannak/pyAudioAnalysis.git
  • Install dependencies: pip install -r ./requirements.txt
  • Install using pip: pip install -e .

An audio classification example

More examples and detailed tutorials can be found at the wiki

pyAudioAnalysis provides easy-to-call wrappers to execute audio analysis tasks. Eg, this code first trains an audio segment classifier, given a set of WAV files stored in folders (each folder representing a different class) and then the trained classifier is used to classify an unknown audio WAV file

from pyAudioAnalysis import audioTrainTest as aT
aT.extract_features_and_train(["classifierData/music","classifierData/speech"], 1.0, 1.0, aT.shortTermWindow, aT.shortTermStep, "svm", "svmSMtemp", False)
aT.file_classification("data/doremi.wav", "svmSMtemp","svm")

Result: (0.0, array([ 0.90156761, 0.09843239]), ['music', 'speech'])

In addition, command-line support is provided for all functionalities. E.g. the following command extracts the spectrogram of an audio signal stored in a WAV file: python audioAnalysis.py fileSpectrogram -i data/doremi.wav

Further reading

Apart from this README file, to bettern understand how to use this library one should read the following:

@article{giannakopoulos2015pyaudioanalysis,
  title={pyAudioAnalysis: An Open-Source Python Library for Audio Signal Analysis},
  author={Giannakopoulos, Theodoros},
  journal={PloS one},
  volume={10},
  number={12},
  year={2015},
  publisher={Public Library of Science}
}

For Matlab-related audio analysis material check this book.

Author

Theodoros Giannakopoulos, Principal Researcher of Multimodal Machine Learning at the Multimedia Analysis Group of the Computational Intelligence Lab (MagCIL) of the Institute of Informatics and Telecommunications, of the National Center for Scientific Research "Demokritos"

More Repositories

1

paura

Python AUdio Recording and Analysis (paura)
Python
217
star
2

pyImageClassification

Image Feature Extraction and Classification Using Python
Python
107
star
3

multimodalAnalysis

Python examples for the course "Multimodal Information Processing & Analysis" of the MSc in Data Science in NCSR Demokritos
Jupyter Notebook
95
star
4

deep_audio_features

Pytorch implementation of deep audio embedding calculation
Python
93
star
5

color_your_music_mood

A realtime demo for generating colors based on musical moods
Python
43
star
6

basic_audio_analysis

Jupyter Notebook
34
star
7

multimodal_movie_analysis

A Python Library for Multimodal Analysis of Movies and Content-based Movie Recommendation
Python
25
star
8

basic_audio_handling

A set of examples for basic audio data handling
Jupyter Notebook
13
star
9

amvoc

A Python Tool for Analysis of Mouse Vocal Communication
Python
13
star
10

pyTextClassification

Training and using classifiers for textual documents
Python
13
star
11

AUROS

A ROS framework for Audio Analysis
C++
12
star
12

ml-python

Python examples for the Machine Learning Course at MSc of AI at NCSR Demokritos
Jupyter Notebook
11
star
13

readys

A Speech Analytics Python Tool for Speaking Assessment
Python
10
star
14

python-data-science

Introduction to Python for Data Science
Jupyter Notebook
9
star
15

recognizeFitExercise

Classification of fitness exercises based on accelerometer and camera information
Python
9
star
16

pyVisualizeMp3Tags

Generate visualisations and reports on collections of mp3 files using python
Python
8
star
17

soundscape_quality

Data handling and baseline approach for soundscape quality estimation
Python
6
star
18

pyScholar

Python Library to Analyse and Visualise Google Scholar Metadata
Python
6
star
19

pySLRF

Python algorithms for scanning laser range finder data
Python
5
star
20

pyOpenAireTextClassifier

An open source set of supervised learning procedures for scientific text classification
Python
3
star
21

inf_teiste_data_structures_lab

C
2
star
22

dl-python

Python material for deep learning
Python
2
star
23

segment_sound

A Python script to break an audio signal into audio segments using silence removal
Python
2
star
24

covid19_predict

A simple Python predictor for covid19 data
Python
2
star
25

ml-stocks

Python
2
star
26

cv_track_face

Simple face tracking using opencv and python
Python
1
star
27

Multimodal-User-Monitoring

Python
1
star
28

music-metadata-analysis

A python lib for analysing data from the Spotify API
Python
1
star
29

tyiannak.github.io

HTML
1
star
30

inf_teiste_info_theory_lab

Information Theory Library and Examples
Python
1
star