• Stars
    star
    367
  • Rank 116,257 (Top 3 %)
  • Language
    Python
  • License
    Apache License 2.0
  • Created over 2 years ago
  • Updated 6 months ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Production First and Production Ready End-to-End Text-to-Speech Toolkit

WeTTS

Production First and Production Ready End-to-End Text-to-Speech Toolkit

Note: This project is at its early statge now. Its design and implementation are subjected to change.

Install

We suggest installing WeTTS with Anaconda or Miniconda. Clone this repo:

git clone https://github.com/wenet-e2e/wetts.git

Create environment:

conda create -n wetts python=3.8 -y
conda activate wetts
pip install -r requirements.txt
conda install -n wetts pytorch=1.11 torchaudio cudatoolkit=10.2 -c pytorch -c conda-forge -y

Please note you should use cudatoolkit=11.3 for CUDA 11.3.

Roadmap

We mainly focus on end to end, production, and on-device TTS. We are going to use:

Dataset

We plan to support a variaty of open source TTS datasets, include but not limited to:

  • Baker, Chinese Standard Mandarin Speech corpus open sourced by Data Baker.
  • AISHELL-3, a large-scale and high-fidelity multi-speaker Mandarin speech corpus.
  • Opencpop, Mandarin singing voice synthesis (SVS) corpus open sourced by Netease Fuxi.

Pretrained Models

Dataset Language Checkpoint Model Runtime Model
Baker CN BERT BERT
Baker CN VITS VITS

Runtime

We plan to support a variaty of hardwares and platforms, including:

  • x86
  • Android
  • Raspberry Pi
  • Other on-device platforms
./build/bin/tts_main \
  --tagger baker_bert_onnx/zh_tn_tagger.fst \
  --verbalizer baker_bert_onnx/zh_tn_verbalizer.fst \
  --vocab baker_bert_onnx/vocab.txt \
  --char2pinyin baker_bert_onnx/pinyin_dict.txt \
  --pinyin2id baker_bert_onnx/polyphone_phone.txt \
  --pinyin2phones baker_bert_onnx/lexicon.txt \
  --g2p_prosody_model baker_bert_onnx/19.onnx \
  --speaker2id baker_vits_v1_onnx/speaker.txt \
  --sname baker \
  --phone2id baker_vits_v1_onnx/phones.txt \
  --vits_model baker_vits_v1_onnx/G_250000.onnx \
  --text "你好,我是小明。" \
  --wav_path audio.wav

Discussion & Communication

For Chinese users, you can aslo scan the QR code on the left to follow our offical account of WeNet. We created a WeChat group for better discussion and quicker response. Please scan the personal QR code on the right, and the guy is responsible for inviting you to the chat group.

Or you can directly discuss on Github Issues.

Acknowledgement

  1. We borrow a lot of code from vits for VITS implementation.
  2. We refer PaddleSpeech for pinyin lexicon generation.

More Repositories

1

wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit
Python
4,073
star
2

speech-synthesis-paper

List of speech synthesis papers.
989
star
3

wespeaker

Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit
Python
690
star
4

WenetSpeech

A 10000+ hours dataset for Chinese speech recognition
Shell
488
star
5

wekws

Production First and Production Ready End-to-End Keyword Spotting Toolkit
Python
444
star
6

WeTextProcessing

Text Normalization & Inverse Text Normalization
Python
443
star
7

speech-recognition-papers

Towards hot directions in industrial end to end speech recognition
325
star
8

opencpop

Opencpop: A High-Quality Open Source Chinese Popular Song Database for Singing Voice Synthesis
207
star
9

wenet-kws

Production First and Production Ready End-to-End Keyword Spotting Toolkit
Python
142
star
10

west

We Speech Transcript based on LLM, in 300 lines of code.
Python
109
star
11

wesep

Target Speaker Extraction Toolkit
Python
80
star
12

wesignal

Production first, nn-based on-device signal processing toolkit.
63
star
13

WeTextProcessing.deprecated

C++
61
star
14

wesubtitle

用 OCR 提取视频硬字幕
Python
54
star
15

llm-papers

List of Large Lanugage Model Papers
51
star
16

wecut

video cut powered by AI
25
star
17

WeSpeech-AI

Open Source Speech/Text Data on AI
18
star
18

nn-singal-processing-papers

List of NN based singal processing papers
17
star
19

wenet_in_action_homework

WeNet 实战课程作业
Python
16
star
20

wenet-e2e.github.io

WeNet Community
CSS
1
star
21

wenet-contributors

Contributors of WeNet, including individual and companies.
1
star