• Stars
    star
    989
  • Rank 46,300 (Top 1.0 %)
  • Language
  • License
    MIT License
  • Created over 4 years ago
  • Updated over 1 year ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

List of speech synthesis papers.

Speech Synthesis Paper

List of speech synthesis papers (-> more papers <-). Welcome to recommend more awesome papers ๐Ÿ˜€.

Repositories for collecting awesome speech paper:

What is the meaning of 'โ˜…'? I add 'โ˜…' to the papers which number of citations is over 50 (only in Acoustic Model, Vocoder and TTS towards Stylization). Beginner can read these paper first to get basic knowledge of Deep-Learning-based TTS model (#1).

Content

TTS Frontend

Acoustic Model

Autoregressive Model

Non-Autoregressive Model

Alignment Study

Data Efficiency

Vocoder

Autoregressive Model

Non-Autoregressive Model

Others

TTS towards Stylization

Expressive TTS

MultiSpeaker TTS

New Perspective on TTS

Voice Conversion

ASR & TTS Based

VAE & Auto-Encoder Based

GAN Based

Singing

Singing Voice Synthesis

Singing Voice Conversion

More Repositories

1

wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit
Python
4,073
star
2

wespeaker

Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit
Python
690
star
3

WenetSpeech

A 10000+ hours dataset for Chinese speech recognition
Shell
488
star
4

wekws

Production First and Production Ready End-to-End Keyword Spotting Toolkit
Python
444
star
5

WeTextProcessing

Text Normalization & Inverse Text Normalization
Python
443
star
6

wetts

Production First and Production Ready End-to-End Text-to-Speech Toolkit
Python
367
star
7

speech-recognition-papers

Towards hot directions in industrial end to end speech recognition
325
star
8

opencpop

Opencpop: A High-Quality Open Source Chinese Popular Song Database for Singing Voice Synthesis
207
star
9

wenet-kws

Production First and Production Ready End-to-End Keyword Spotting Toolkit
Python
142
star
10

west

We Speech Transcript based on LLM, in 300 lines of code.
Python
109
star
11

wesep

Target Speaker Extraction Toolkit
Python
80
star
12

wesignal

Production first, nn-based on-device signal processing toolkit.
63
star
13

WeTextProcessing.deprecated

C++
61
star
14

wesubtitle

็”จ OCR ๆๅ–่ง†้ข‘็กฌๅญ—ๅน•
Python
54
star
15

llm-papers

List of Large Lanugage Model Papers
51
star
16

wecut

video cut powered by AI
25
star
17

WeSpeech-AI

Open Source Speech/Text Data on AI
18
star
18

nn-singal-processing-papers

List of NN based singal processing papers
17
star
19

wenet_in_action_homework

WeNet ๅฎžๆˆ˜่ฏพ็จ‹ไฝœไธš
Python
16
star
20

wenet-e2e.github.io

WeNet Community
CSS
1
star
21

wenet-contributors

Contributors of WeNet, including individual and companies.
1
star