• Stars
    star
    115
  • Rank 305,916 (Top 7 %)
  • Language
    Python
  • License
    MIT License
  • Created over 2 years ago
  • Updated over 2 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Avocodo: Generative Adversarial Network for Artifact-free Vocoder

Avocodo: Generative Adversarial Network for Artifact-free Vocoder

Unofficial implementation of Avocodo: Generative Adversarial Network for Artifact-free Vocoder.

Disclaimer: It only works on config_v1.json for now and this repo build with experimentation purpose not for Production.

  • For best quality speech synthesis please visit deepsync.co

Training:

python train.py --config config_v1.json

Notes:

  • Avocodo uses same Generator as HiFi-GAN V1 and V2 but using different discriminators for modelling better lower and higher frequencies.
  • PQMF is the crucial for both Discriminators.
  • Losses are similar to HiFi-GAN.
  • Performance and speed both are some what similar to HiFi-GAN.
  • Avocodo far better than HiFi-GAN when it comes to synthesize unseen speaker.
  • Avocodo training is around 20 % faster than HiFi-GAN also it took very less training to output excellent quality of audio.

Citations:

@misc{https://doi.org/10.48550/arxiv.2206.13404,
  doi = {10.48550/ARXIV.2206.13404},
  
  url = {https://arxiv.org/abs/2206.13404},
  
  author = {Bak, Taejun and Lee, Junmo and Bae, Hanbin and Yang, Jinhyeok and Bae, Jae-Sung and Joo, Young-Sun},
  
  keywords = {Audio and Speech Processing (eess.AS), Artificial Intelligence (cs.AI), Sound (cs.SD), FOS: Electrical engineering, electronic engineering, information engineering, FOS: Electrical engineering, electronic engineering, information engineering, FOS: Computer and information sciences, FOS: Computer and information sciences},
  
  title = {Avocodo: Generative Adversarial Network for Artifact-free Vocoder},
  
  publisher = {arXiv},
  
  year = {2022},
  
  copyright = {arXiv.org perpetual, non-exclusive license}
}

More Repositories

1

ViViT-pytorch

Implementation of ViViT: A Video Vision Transformer
Python
500
star
2

ResUnet

Pytorch implementation of ResUnet and ResUnet ++
Python
444
star
3

VocGAN

VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network
Python
317
star
4

FNet-pytorch

Unofficial implementation of Google's FNet: Mixing Tokens with Fourier Transforms
Python
249
star
5

convolution-vision-transformers

PyTorch Implementation of CvT: Introducing Convolutions to Vision Transformers
Python
217
star
6

iSTFTNet-pytorch

iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier Transform
Python
214
star
7

MLP-Mixer-pytorch

Unofficial implementation of MLP-Mixer: An all-MLP Architecture for Vision
Python
207
star
8

hifigan-denoiser

HiFi-GAN: High Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks
Python
198
star
9

CrossViT-pytorch

Implementation of CrossViT: Cross-Attention Multi-Scale Vision Transformer for Image Classification
Python
180
star
10

AdaSpeech

AdaSpeech: Adaptive Text to Speech for Custom Voice
Jupyter Notebook
156
star
11

HiFiplusplus-pytorch

HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement
Python
148
star
12

SoundStorm-pytorch

Google's SoundStorm: Efficient Parallel Audio Generation
Python
116
star
13

Fre-GAN-pytorch

Fre-GAN: Adversarial Frequency-consistent Audio Synthesis
Python
101
star
14

CeiT-pytorch

Implementation of Convolutional enhanced image Transformer
Python
99
star
15

vae_tacotron2

VAE Tacotron 2, an alternative of GST Tacotron
Python
85
star
16

TalkNet2-pytorch

TalkNet 2: Non-Autoregressive Depth-Wise Separable Convolutional Model for Speech Synthesis with Explicit Pitch and Duration Prediction.
Python
85
star
17

TFGAN

TFGAN: Time and Frequency Domain Based Generative Adversarial Network for High-fidelity Speech Synthesis
Python
84
star
18

LightSpeech

LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search
Python
80
star
19

HiFi-GAN

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
Python
79
star
20

NaturalSpeech2

Python
70
star
21

UnivNet-pytorch

UnivNet: A Neural Vocoder with Multi-Resolution Spectrogram Discriminators for High-Fidelity Waveform Generation
Python
69
star
22

AdaSpeech2

AdaSpeech 2: Adaptive Text to Speech with Untranscribed Data
Jupyter Notebook
69
star
23

AudioMAE-pytorch

Unofficial PyTorch implementation of Masked Autoencoders that Listen
Python
61
star
24

melgan

MelGAN implementation with Multi-Band and Full Band supports...
Jupyter Notebook
59
star
25

Liveness-Detection

Liveness Detection for human face
Python
52
star
26

gmvae_tacotron

Gaussian Mixture VAE Tacotron
Python
52
star
27

iSTFT-Avocodo-pytorch

Ultrafast GAN based Vocoder for Text to Speech
Python
50
star
28

Phone-Level-Mixture-Density-Network-for-TTS

Rich Prosody Diversity Modelling with Phone-level Mixture Density Network
Jupyter Notebook
45
star
29

LSTM-Time-Series-Analysis

Using LSTM network for time series forecasting
Jupyter Notebook
44
star
30

NU-Wave-pytorch

NU-Wave: A Diffusion Probabilistic Model for Neural Audio Upsampling
Python
37
star
31

ResMLP-pytorch

ResMLP: Feedforward networks for image classification with data-efficient training
Python
36
star
32

PPSpeech

PPSpeech: Phrase based Parallel End-to-End TTS System
Python
35
star
33

Zero-Shot-TTS

Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration
Python
34
star
34

NU-Wave2-pytorch

NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates [WIP]
Python
24
star
35

SiT-pytorch

SiT: Self-supervised vision Transformer
Python
19
star
36

rectified-linear-attention

Sparse Attention with Linear Units
Python
17
star
37

CoaT-pytorch

CoaT: Co-Scale Conv-Attentional Image Transformers
Python
16
star
38

Movie-Recommender-System

Python
13
star
39

LocalViT-pytorch

LocalViT: Bringing Locality to Vision Transformers
Python
9
star
40

Bidirectional-LEM-pytorch

Pytorch Implementation of Bidirectional Long Expressive Memory
Python
9
star
41

compact-convolution-transformer

Compact Convolution Transformers
Python
8
star
42

WaveFlow

WaveFlow : A Compact Flow-based Model for Raw Audio
Python
4
star
43

McKinsey-Hiring-Hack-Challenge

My solution for Online McKinsey Hiring Hack Challenge hosted by Analytics Vidhya.
Jupyter Notebook
4
star
44

Word2Vec

Word2Vec tutorial using tensorflow
Jupyter Notebook
3
star
45

IMDB-Movie-Review-sentiment-Analysis

Jupyter Notebook
3
star
46

Meme-recognizer

Recognize the given image is Meme or not
Jupyter Notebook
3
star
47

Introduction-to-Tensorflow

Tensorflow tutorial from scratch
Jupyter Notebook
2
star
48

fastspeech2_samples

2
star
49

MyApplication

Android application in which audio and image play simultaneously
Java
2
star
50

Loan-Prediction-Challenge

Jupyter Notebook
2
star
51

CNN-Visualization

Jupyter Notebook
2
star
52

Twins-SVT-pytorch

Twins: Revisiting the Design of Spatial Attention in Vision Transformers
2
star
53

Image-classifier-for-all

Universal Image classifier
Jupyter Notebook
2
star
54

PropertySetUp

1
star
55

Document-Classifier

Classify documents using Machine learning
Jupyter Notebook
1
star
56

Data-Analysis

Jupyter Notebook
1
star
57

Avito-Duplicate-ads

Jupyter Notebook
1
star
58

Natural-Language-Processing

Jupyter Notebook
1
star
59

Keras

Predictive analysis using Keras a powerful Neural network library run over theano for python
Jupyter Notebook
1
star
60

Data-Mining-Algos

Famous Data Mining Algos written in python using scikit-learn library
Python
1
star
61

Identify-Question-Type

Given a question, the aim is to identify the category it belongs to. The four categories to handle for this assignment are : Who, What, When, Affirmation(yes/no). Label any sentence that does not fall in any of the above four as "Unknown" type.
Jupyter Notebook
1
star
62

Inception-Transformer-pytorch

iFormer: Inception Transformer
1
star
63

Email-Classification-Statement-Contract

classify emails into statements and contracts
Python
1
star
64

Movie-Recommendation-System

Hybrid Movie recommendation system
Jupyter Notebook
1
star
65

SystemInfo

Jupyter Notebook
1
star
66

rishikksh20.github.io

My Github Blog
HTML
1
star
67

LSTM_syntheic_gradient

Jupyter Notebook
1
star