WangHelin1997/DCASE2021_Task6_PKU

Stars
8
Rank 2,088,395 (Top 42 %)
Language
Python
Created over 3 years ago
Updated over 2 years ago

WangHelin1997/DCASE2021_Task6_PKU

WangHelin1997

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

This is the code of PKU team for DCASE 2021 Task 6.

SpeechTasks

This is a list of speech tasks and datasets, which can provide training data for Generative AI, AIGC, AI model training, intelligent speech tool development, and speech applications.

MaskSpec

The Pytorch implementation of paper: Masked Spectrogram Prediction For Self-Supervised Audio Pre-Training

DuTa-VC

Source code and demo for INTERPSEECH 2023 paper: DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion Probabilistic Model

SpecAugment-plus

A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification

Automatic_Speech_Annotator

Automatic speech annotator processing speech with voice activaty detection, overlapping speech detection, speaker diarization and automatic speech recognition

nnAudio2

Audio processing by using pytorch 1D convolution network (based on nnAudio). Gammatone Spectrogram and SpecAugmentation are now available on GPU.

DCASE-2020-Task1A-Code

A pytorch implementation of the paper : Acoustic Scene Classification with Multiple Decision Schemes.

Fast-GeCo

Source code and demo for INTERSPEECH 2024 paper: Noise-robust Speech Separation with Fast Generative Correction

AT-GCN

Pytorch implementation of the paper : Modeling Label Dependencies for Audio Tagging with Graph Convolutional Network

LibriLightMix-WHAMR

Python scripts to create noisy and reverberant 2-speaker mixture audio with Libri-Light and WHAM

GL-AT

Pytorch implementation of the paper : A Global-local Attention Framework for Weakly Labelled Audio Tagging.

Aty-TTS

Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speech

LibriLightMix-WHAM

Python scripts to create noisy mixture audio with Libri-Light and WHAM

FPNet

A signal segmentation method of CNN for audio event classification

CNN-model-and-visualization

A CNN model (RseNet) for image classification( CIFAR-10), including filter and output of layers visualization.

Speech-paper-crawl

My Python scripts for crawling paper related on speech processing.

Du-N2DVC-Demo

SCNN

SincConv layer using in AED and ASC

Speech-Captioning-Dataset

project2021

PKU team for 2021 project 'Guangchangwu detection'.

DCASE2020-Task6-PKU

A Pytorch implementation of the DCASE2020 Task6 by PKU team : Automated Audio Captioning With Temporal Attention

Babycry-sound-detection

PyTorch implementations of neural network models for Babycry sound detection, including training process and test demo. Based on DCASE2017 Task2: Detection of rare sound events.

CommonVoice

Aty-TTS-Demo

helinwang

Pytorch-audio_feature

Audio feature extraction in Pytorch module.

dcase2019_1D

Dcase2019 Task1a using audio feature module.

SSR-Speech-Demo

https://wanghelin1997.github.io/SSR-Speech-Demo/