Yamagishi and Echizen Laboratories, National Institute of Informatics (@nii-yamagishilab)

Top repositories

1

project-NN-Pytorch-scripts

see README
Python
275
star
2

multi-speaker-tacotron

VCTK multi-speaker tacotron for ICASSP 2020
Python
262
star
3

Capsule-Forensics-v2

Implementation of the Capsule-Forensics-v2
Python
114
star
4

self-attention-tacotron

An implementation of "Investigation of enhanced Tacotron text-to-speech synthesis systems with self-attention for pitch accent language" https://arxiv.org/abs/1810.11960
Python
113
star
5

tacotron2

An implementation of Tacotron and Tacotron2
Python
81
star
6

project-CURRENNT-public

CURRENNNT codes and scripts
Cuda
76
star
7

ClassNSeg

Implementation and demonstration of the paper: Multi-task Learning for Detecting and Segmenting Manipulated Facial Images and Videos
Python
75
star
8

ZMM-TTS

ZMM-TTS: Zero-shot Multilingual and Multispeaker Speech Synthesis Conditioned on Self-supervised Discrete Speech Representations
C
72
star
9

project-CURRENNT-scripts

This repository contains the scripts to use CURRENNT
Python
64
star
10

Extended_VQVAE

Python
59
star
11

mos-finetune-ssl

Python
59
star
12

Intelligibility-MetricGAN

Implementation for paper "iMetricGAN: Intelligibility Enhancement for Speech-in-Noise using Generative Adversarial Network-based Metric Learning"
Python
51
star
13

VCC2020-database

49
star
14

Attention_Backend_for_ASV

Attention Backend for Aotumatic Speaker Verification with Multiple Enrollment Utterances
Python
45
star
15

TSNetVocoder

Python
42
star
16

Capsule-Forensics

Old implementation and demonstration of the Capsule-Forensics. The Capsule-Forensics-v2 has been released here: https://github.com/nii-yamagishilab/capsule-forensics-v2
Python
31
star
17

vctk-silence-labels

19
star
18

midi-to-audio

Project for MIDI to Audio Synthesis
Shell
17
star
19

PartialSpoof

Jupyter Notebook
17
star
20

NELE-GAN

Implementation for paper: Multi-Metric Optimization using Generative Adversarial Networks for Near-End Speech Intelligibility Enhancement
Python
16
star
21

speaker_sex_attribute_privacy

Project for HIDING SPEAKER’S SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINE
Python
14
star
22

SSL-SAS

Language independent SSL-based Speaker Anonymization system
Python
11
star
23

ssnt-tts

An implementation of SSNT-TTS.
Python
6
star
24

mla

A Multi-Level Attention Model for Evidence-Based Fact Checking
Python
4
star
25

downloader-DR-VCTK-complete

downloader to obtain the complete DR-VCTK dataset (250GB)
Python
4
star
26

Modular-CNN-for-CGIs-PIs-discrimination

Python
2
star
27

ewc

Python
2
star
28

fashion_adv

Fashion-Guided Adversarial Attack on Person Segmentation
Python
2
star
29

partial_rank_similarity

Jupyter Notebook
2
star
30

VCC2020-listeningtest

1
star
31

Generalization_of_CMs_regularizations

The source code for the paper Improving Generalization Ability of Countermeasures for New Mismatch Scenario by Combining Multiple Advanced Regularization Terms (interspeech2023)
Python
1
star
32

xfever

Shell
1
star