deep_learning_for_speech_enhancement_keras_python
deep learning based speech enhancement using keras python
Authors: YONG XU & QIUQIANG KONG
Goal:
Make the GPU-C++ code project convert to python code which is much easier for the community to follow and use. The training and decoding code will be unified into the python code. Keras will be used as the toolkit.
Invitation:
I want to invite you to be one of the contributors of this project, please contact me if you have interest. [email protected]
My final goal is to build a universal & robust deep learning based speech enhancement front end. And aslo try to adapt it to really serve for the speech recognition back-end.
Ref:
The original GPU-C++ code: https://github.com/yongxuUSTC/DNN-for-speech-enhancement
Please cite the following papers if you use this code:
[1] A Regression Approach to Speech Enhancement Based on Deep Neural Networks. Yong Xu, Jun Du,Li-Rong Dai and Chin-Hui Lee, IEEE/ACM Transactions on Audio,Speech, and Language Processing,P.7-19,Vol.23,No.1, 2015 (2018 IEEE SPS Best paper award, citations > 600)
[2] An Experimental Study on Speech Enhancement Based on Deep Neural Networks. Yong Xu, Jun Du, Li-Rong Dai and Chin-Hui Lee,IEEE signal processing letters, p. 65-68,vol.21,no. 1,January 2014 (citations > 550)
[3] Multi-Objective Learning and Mask-Based Post-Processing for Deep Neural Network Based Speech Enhancement, Yong Xu, Jun Du, Zhen Huang, Li-Rong Dai, Chin-Hui Lee, Interspeech2015
Some DNN based speech enhancemen demos:
http://staff.ustc.edu.cn/~jundu/The%20team/yongxu/demo/SE_DNN_taslp.html
http://staff.ustc.edu.cn/~jundu/The%20team/yongxu/demo/IS15.html