• Stars
    star
    3
  • Rank 3,944,605 (Top 79 %)
  • Language
    Python
  • Created 3 months ago
  • Updated 3 months ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

More Repositories

1

SpeechTasks

This is a list of speech tasks and datasets, which can provide training data for Generative AI, AIGC, AI model training, intelligent speech tool development, and speech applications.
72
star
2

MaskSpec

The Pytorch implementation of paper: Masked Spectrogram Prediction For Self-Supervised Audio Pre-Training
Python
37
star
3

DuTa-VC

Source code and demo for INTERPSEECH 2023 paper: DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion Probabilistic Model
Python
32
star
4

SpecAugment-plus

A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification
Python
30
star
5

Automatic_Speech_Annotator

Automatic speech annotator processing speech with voice activaty detection, overlapping speech detection, speaker diarization and automatic speech recognition
Python
27
star
6

nnAudio2

Audio processing by using pytorch 1D convolution network (based on nnAudio). Gammatone Spectrogram and SpecAugmentation are now available on GPU.
Python
20
star
7

DCASE-2020-Task1A-Code

A pytorch implementation of the paper : Acoustic Scene Classification with Multiple Decision Schemes.
Python
19
star
8

Fast-GeCo

Source code and demo for INTERSPEECH 2024 paper: Noise-robust Speech Separation with Fast Generative Correction
15
star
9

AT-GCN

Pytorch implementation of the paper : Modeling Label Dependencies for Audio Tagging with Graph Convolutional Network
Python
14
star
10

LibriLightMix-WHAMR

Python scripts to create noisy and reverberant 2-speaker mixture audio with Libri-Light and WHAM
Python
14
star
11

GL-AT

Pytorch implementation of the paper : A Global-local Attention Framework for Weakly Labelled Audio Tagging.
Python
13
star
12

Aty-TTS

Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speech
Python
9
star
13

DCASE2021_Task6_PKU

This is the code of PKU team for DCASE 2021 Task 6.
Python
8
star
14

LibriLightMix-WHAM

Python scripts to create noisy mixture audio with Libri-Light and WHAM
Python
8
star
15

FPNet

A signal segmentation method of CNN for audio event classification
Python
7
star
16

CNN-model-and-visualization

A CNN model (RseNet) for image classification( CIFAR-10), including filter and output of layers visualization.
Python
6
star
17

Speech-paper-crawl

My Python scripts for crawling paper related on speech processing.
Python
5
star
18

Du-N2DVC-Demo

HTML
3
star
19

SCNN

SincConv layer using in AED and ASC
Python
3
star
20

project2021

PKU team for 2021 project 'Guangchangwu detection'.
Python
3
star
21

DCASE2020-Task6-PKU

A Pytorch implementation of the DCASE2020 Task6 by PKU team : Automated Audio Captioning With Temporal Attention
Python
2
star
22

Babycry-sound-detection

PyTorch implementations of neural network models for Babycry sound detection, including training process and test demo. Based on DCASE2017 Task2: Detection of rare sound events.
Python
2
star
23

CommonVoice

Python
2
star
24

Aty-TTS-Demo

HTML
1
star
25

helinwang

JavaScript
1
star
26

Pytorch-audio_feature

Audio feature extraction in Pytorch module.
Python
1
star
27

dcase2019_1D

Dcase2019 Task1a using audio feature module.
Python
1
star
28

SSR-Speech-Demo

https://wanghelin1997.github.io/SSR-Speech-Demo/
JavaScript
1
star