• Stars
    star
    109
  • Rank 317,187 (Top 7 %)
  • Language
    Python
  • License
    MIT License
  • Created over 7 years ago
  • Updated over 1 year ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Some useful features of speech process, such as MFCC, gammatone filterbank, GFCC, spectrum(power spectrum and log-power spectrum), Amplitude Modulation Spectrum(AMS) and so on.

2020-08-12 Tips:

Please use the code in speech_utils.py and feature_extractor.py. The rest files are used when I developed this project.

2018-05-27 Update Feature Extractors and Utils

All implemented feature extractors have been written in the file ‘feature_extractor.py’. Please use this file for the newest version.

Speech Feature Extractors

  1. Features include: MFCC, GFCC, gammatone filterbank, Power Spectrum, Log-Power Spectrum, Amplitude Modulation Spectrum(AMS, two version), Short-Time-Fourier-Transfer Spectrum.
  2. Utils include: Ideal Binary Mask, Ideal Ratio Mask, Speech synthesis method, Mixer by dB
  3. Normalizer include: zero-to-one normalizer, unit-vector normalizer.