luuil/Tensorflow-Audio-Classification

Stars
125
Rank 284,730 (Top 6 %)
Language
Python
License
Apache License 2.0
Created over 6 years ago
Updated almost 3 years ago

luuil/Tensorflow-Audio-Classification

luuil

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Audio classification with VGGish as feature extractor in TensorFlow

Audio Classification

Classify the audios. In this repo, I train a model on UrbanSound8K dataset, and achieve about 80% accuracy on test dataset.

There is a pre-trained model in urban_sound_train, trained epoch is 1000

Usage

audio_train.py: Train audio model from scratch or restore from checkpoint.
audio_params.py: Configuration for training a model.
audio_inference_demo.py: Demo for test the trained model.
./audio/*: Dependencies of training, model and datasets.
./vggish/*: Dependencies of VGGish for feature extracting.

Env setup

Conda are recommended, just need one line: conda env create -f conda.env.yml

Train & Test

Config parameters: audio_params.py.
Train the model: python audio_train.py. (It will create tfrecords automaticly if not exists)
Check the training process from tensorboard: tensorboard --logdir=./data/tensorboard
Test the model: python audio_inference_demo.py.

Tools

Dataset

urban sound dataset

Ref. Blogs

MLFSPS

Feature Selection Prototype System in Multi-Label Classification

Merging-Models-for-TensorFlow-Serving

Merging Models for TensorFlow Serving HOT UPDATING

OCR-Synthesizer

Synthesizer for OCR training

Tools

Our Little Tools

google-ml-crash-course

from https://developers.google.com/machine-learning/crash-course/

Jupyter Notebook

cmake-examples

MLLCWFS

Source code for paper "A Label Correlation Based Weighting Feature Selection Approach for Multi-label Data"