Top Rating
- Top Contributors
  Discover the Top Open Source contributors by country or by language
- Interviews
  Discover real stories from Open Source developers
Discover

Discover your Favorite Language
Discover the top trending repositories and projects on Github. Explore the latest trends in your preferred languages.

Rust

Shell

C#

Perl

Crystal

Zig

Objective-C

Swift

More Languages
Awesome

Awesome repositories
Discover the most awesome repositories and projects of your favorite languages. Inspired by the Awesome-* lists trend in GitHub.

Kotlin

JavaScript

Elixir

TypeScript

Rust

Dart

Python

F#

More Languages
By Country

Rankings by Country
Discover the community of talented open source contributors in each country.

🇧🇮 Burundi

🇮🇩 Indonesia

🇲🇿 Mozambique

🇫🇯 Fiji

🇸🇩 Sudan

🇷🇪 Réunion

🇸🇪 Sweden

🇨🇳 China

All Countries Compare Countries

ZhihaoDU/speech_feature_extractor

Stars
109
Rank 317,187 (Top 7 %)
Language
Python
License
MIT License
Created over 7 years ago
Updated over 1 year ago

ZhihaoDU

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Some useful features of speech process, such as MFCC, gammatone filterbank, GFCC, spectrum(power spectrum and log-power spectrum), Amplitude Modulation Spectrum(AMS) and so on.

2020-08-12 Tips:

Please use the code in speech_utils.py and feature_extractor.py. The rest files are used when I developed this project.

2018-05-27 Update Feature Extractors and Utils

All implemented feature extractors have been written in the file ‘feature_extractor.py’. Please use this file for the newest version.

Speech Feature Extractors

Features include: MFCC, GFCC, gammatone filterbank, Power Spectrum, Log-Power Spectrum, Amplitude Modulation Spectrum(AMS, two version), Short-Time-Fourier-Transfer Spectrum.
Utils include: Ideal Binary Mask, Ideal Ratio Mask, Speech synthesis method, Mixer by dB
Normalizer include: zero-to-one normalizer, unit-vector normalizer.

du2022sond

Speaker overlap-aware Neural Diarization

du2020dan

The implementation of our paper "Double adversarial networks for monaural speech enhancement" accepted by INTERSPEECH 2020.

Python

food_is_unstopped

Food is unstopped!!!! GO!

Python

du2020kws

This a small footprint robust KWS system which is based on the multi-conditional training, retraining and joint-training. This system includes a small-footprint KWS system and a small-footprint speech enhancement model. We also investigate a compress method for CNN and LSTM.

Python

zhihaodu.github.io