There are no reviews yet. Be the first to send feedback to the community and the maintainers!
SpeechTasks
This is a list of speech tasks and datasets, which can provide training data for Generative AI, AIGC, AI model training, intelligent speech tool development, and speech applications.MaskSpec
The Pytorch implementation of paper: Masked Spectrogram Prediction For Self-Supervised Audio Pre-TrainingDuTa-VC
Source code and demo for INTERPSEECH 2023 paper: DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion Probabilistic ModelSpecAugment-plus
A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene ClassificationAutomatic_Speech_Annotator
Automatic speech annotator processing speech with voice activaty detection, overlapping speech detection, speaker diarization and automatic speech recognitionnnAudio2
Audio processing by using pytorch 1D convolution network (based on nnAudio). Gammatone Spectrogram and SpecAugmentation are now available on GPU.DCASE-2020-Task1A-Code
A pytorch implementation of the paper : Acoustic Scene Classification with Multiple Decision Schemes.Fast-GeCo
Source code and demo for INTERSPEECH 2024 paper: Noise-robust Speech Separation with Fast Generative CorrectionAT-GCN
Pytorch implementation of the paper : Modeling Label Dependencies for Audio Tagging with Graph Convolutional NetworkLibriLightMix-WHAMR
Python scripts to create noisy and reverberant 2-speaker mixture audio with Libri-Light and WHAMGL-AT
Pytorch implementation of the paper : A Global-local Attention Framework for Weakly Labelled Audio Tagging.Aty-TTS
Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-SpeechDCASE2021_Task6_PKU
This is the code of PKU team for DCASE 2021 Task 6.FPNet
A signal segmentation method of CNN for audio event classificationCNN-model-and-visualization
A CNN model (RseNet) for image classification( CIFAR-10), including filter and output of layers visualization.Speech-paper-crawl
My Python scripts for crawling paper related on speech processing.Du-N2DVC-Demo
SCNN
SincConv layer using in AED and ASCSpeech-Captioning-Dataset
project2021
PKU team for 2021 project 'Guangchangwu detection'.DCASE2020-Task6-PKU
A Pytorch implementation of the DCASE2020 Task6 by PKU team : Automated Audio Captioning With Temporal AttentionBabycry-sound-detection
PyTorch implementations of neural network models for Babycry sound detection, including training process and test demo. Based on DCASE2017 Task2: Detection of rare sound events.CommonVoice
Aty-TTS-Demo
helinwang
Pytorch-audio_feature
Audio feature extraction in Pytorch module.dcase2019_1D
Dcase2019 Task1a using audio feature module.SSR-Speech-Demo
https://wanghelin1997.github.io/SSR-Speech-Demo/Love Open Source and this site? Check out how you can help us