• Stars
    star
    8
  • Rank 2,088,395 (Top 42 %)
  • Language
    Python
  • Created over 3 years ago
  • Updated over 2 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

This is the code of PKU team for DCASE 2021 Task 6.

More Repositories

1

SpeechTasks

This is a list of speech tasks and datasets, which can provide training data for Generative AI, AIGC, AI model training, intelligent speech tool development, and speech applications.
72
star
2

MaskSpec

The Pytorch implementation of paper: Masked Spectrogram Prediction For Self-Supervised Audio Pre-Training
Python
37
star
3

DuTa-VC

Source code and demo for INTERPSEECH 2023 paper: DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion Probabilistic Model
Python
32
star
4

SpecAugment-plus

A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification
Python
30
star
5

Automatic_Speech_Annotator

Automatic speech annotator processing speech with voice activaty detection, overlapping speech detection, speaker diarization and automatic speech recognition
Python
27
star
6

nnAudio2

Audio processing by using pytorch 1D convolution network (based on nnAudio). Gammatone Spectrogram and SpecAugmentation are now available on GPU.
Python
20
star
7

DCASE-2020-Task1A-Code

A pytorch implementation of the paper : Acoustic Scene Classification with Multiple Decision Schemes.
Python
19
star
8

Fast-GeCo

Source code and demo for INTERSPEECH 2024 paper: Noise-robust Speech Separation with Fast Generative Correction
15
star
9

AT-GCN

Pytorch implementation of the paper : Modeling Label Dependencies for Audio Tagging with Graph Convolutional Network
Python
14
star
10

LibriLightMix-WHAMR

Python scripts to create noisy and reverberant 2-speaker mixture audio with Libri-Light and WHAM
Python
14
star
11

GL-AT

Pytorch implementation of the paper : A Global-local Attention Framework for Weakly Labelled Audio Tagging.
Python
13
star
12

Aty-TTS

Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speech
Python
9
star
13

LibriLightMix-WHAM

Python scripts to create noisy mixture audio with Libri-Light and WHAM
Python
8
star
14

FPNet

A signal segmentation method of CNN for audio event classification
Python
7
star
15

CNN-model-and-visualization

A CNN model (RseNet) for image classification( CIFAR-10), including filter and output of layers visualization.
Python
6
star
16

Speech-paper-crawl

My Python scripts for crawling paper related on speech processing.
Python
5
star
17

Du-N2DVC-Demo

HTML
3
star
18

SCNN

SincConv layer using in AED and ASC
Python
3
star
19

Speech-Captioning-Dataset

Python
3
star
20

project2021

PKU team for 2021 project 'Guangchangwu detection'.
Python
3
star
21

DCASE2020-Task6-PKU

A Pytorch implementation of the DCASE2020 Task6 by PKU team : Automated Audio Captioning With Temporal Attention
Python
2
star
22

Babycry-sound-detection

PyTorch implementations of neural network models for Babycry sound detection, including training process and test demo. Based on DCASE2017 Task2: Detection of rare sound events.
Python
2
star
23

CommonVoice

Python
2
star
24

Aty-TTS-Demo

HTML
1
star
25

helinwang

JavaScript
1
star
26

Pytorch-audio_feature

Audio feature extraction in Pytorch module.
Python
1
star
27

dcase2019_1D

Dcase2019 Task1a using audio feature module.
Python
1
star
28

SSR-Speech-Demo

https://wanghelin1997.github.io/SSR-Speech-Demo/
JavaScript
1
star