There are no reviews yet. Be the first to send feedback to the community and the maintainers!
FullSubNet
PyTorch implementation of "FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."NBSS
The official repo of NBC & SpatialNet for multichannel speech separation, denoising, and dereverberationMcNet
The official repo: "McNet: Fuse Multiple Cues for Multichannel Speech Enhancement", ICASSP 2023audiossl
A library built for easier audio self-supervised training, downstream tasks evaluationATST-SED
This repo includes the official implementations of "Fine-tune the pretrained ATST model for sound event detection".FS-EEND
The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based attractors". [ICASSP 2024]RealMAN
A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and LocalizationRVAE-EM
Official PyTorch implementation of "RVAE-EM: Generative speech dereverberation based on recurrent variational auto-encoder and convolutive transfer function" [ICASSP2024]pytorch_lightning_template_for_beginners
A pytorch template for beginners based on pytorch_lightningNarrowband_DeepFiltering
UMA-ASR
This repository is the official implementation of "Unimodal Aggregation for CTC-based Speech Recognition".RCT
This repo gives the code for the official implementation of RCT.OnlineSSL_DPRTF_EG
LSTM-noisePSD
bss_ctf_lasso
Microphone-Array-Generalization-for-Multichannel-Narrowband-Deep-Speech-Enhancement-
Audio-WestlakeU.github.io
Audio and Signal Information Processing Lab in Westlake University concentrates on speech processing algorithmDP_RTF_SSL
SMIF_online_dereverb
ATST-RCT
ATST-RCT model for DCASE 2022 task4.RTF_InterFrameSpecSub
RS_noisePSD
Love Open Source and this site? Check out how you can help us