xcmyz/FastSpeech

Stars
856
Rank 53,268 (Top 2 %)
Language
Python
License
MIT License
Created over 5 years ago
Updated over 1 year ago

xcmyz/FastSpeech

xcmyz

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

The Implementation of FastSpeech based on pytorch.

FastSpeech-Pytorch

The Implementation of FastSpeech Based on Pytorch.

Update (2020/07/20)

Optimize the training process.
Optimize the implementation of length regulator.
Use the same hyper parameter as FastSpeech2.
The measures of the 1, 2 and 3 make the training process 3 times faster than before.
Better speech quality.

Model

My Blog

Prepare Dataset

Download and extract LJSpeech dataset.
Put LJSpeech dataset in data.
Unzip alignments.zip.
Put Nvidia pretrained waveglow model in the waveglow/pretrained_model and rename as waveglow_256channels.pt;
Run python3 preprocess.py.

Training

Run python3 train.py.

Evaluation

Run python3 eval.py.

Notes

In the paper of FastSpeech, authors use pre-trained Transformer-TTS model to provide the target of alignment. I didn't have a well-trained Transformer-TTS model so I use Tacotron2 instead.
I use the same hyper-parameter as FastSpeech2.
The examples of audio are in sample.
pretrained model.

Reference

Repository

Paper

FastVocoder

Include Basis-MelGAN, MelGAN, HifiGAN and Multiband-HifiGAN, maybe NHV in the future.

Transformer-TTS

TTS model based on Transformer.

FastSpeech2

The Implementation of FastSpeech2 Based on Pytorch.

CLONE

ConvTasNet4BasisMelGAN

This repo contains conv-tasnet for basis-melgan. If you want to get code of basis-melgan, please refer to FastVocoder.

Tacotron2-Pytorch

follow NVIDIA, simplify it and support data parallel.

Lifelong-Learning-Tacotron2

MultiSpeaker Tacotron2 using LifeLong Learning.

Hackathon-EnglishLearning

Voice Scoring System.

tacotron2.xcmyz

new version of tacotron2 (old version: https://github.com/xcmyz/Tacotron2-Pytorch)

LM-Tacotron2

Tacotron2 Combine with Language Model (BERT).

SpeakerVerification

Speaker Verification (GE2E Loss)

Gobang-AI

A C++ Implementation of Gobang AI.

Forced-Alignment

using montreal-forced-aligner.

bert-race

BERT/ALBERT based model for RACE dataset, support multi-worker, multi-GPU, FP16 and bind CPU.

Calculator

A Calculator implemented in Python.

FaceDetection

VAE-Tacotron

A Pytorch Implementation of Tacotron Combined with VAE

xcmyz

Polynomial-Calculator

基于Python实现的带有图形界面的多项式计算器

AVX-programming

CPU acceleration using AVX (Advanced Vector Extensions)

ExpressionTransformation

prefix expression, infix expression, postfix expression.