• Stars
    star
    139
  • Rank 261,482 (Top 6 %)
  • Language
    Python
  • Created over 7 years ago
  • Updated over 7 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

speech recognition based on tensorflow 1.0.0

Speech-to-Text-WaveNet : End-to-end sentence level Chinese speech recognition using DeepMind's WaveNet

A tensorflow implementation for Chinese speech recognition based on DeepMind's WaveNet: A Generative Model for Raw Audio. (Hereafter the Paper)

Version

Current Version : 0.0.1

Dependencies

  1. python == 3.5
  2. tensorflow == 1.0.0
  3. librosa == 0.5.0

Dataset

清华30小时中文数据集

Directories

  1. cache: save data featrue and word dictionary
  2. data: wav files and related labels
  3. model: save the models

Network model

  1. Data random shuffle per epoch
  2. Xavier initialization
  3. Adam optimization algorithms
  4. Batch Normalization

Train the network

python3 train.py

Test the network

python3 test.py

Other resources

  1. TensorFlow练习15: 中文语音识别
  2. ibab's WaveNet(speech synthesis) tensorflow implementationt
  3. buriburisuri's WaveNet(English speech recognition) tensorflow and sugartensor implementationt