• Stars
    star
    529
  • Rank 83,810 (Top 2 %)
  • Language
    Python
  • License
    MIT License
  • Created almost 4 years ago
  • Updated over 1 year ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

PyTorch implementation of "FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."

FullSubNet

Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement

version Generic badge Documentation Status version python mit

Guides

The documentation is hosted on Read the Docs. Check the documentation for how to train and test models.

Citation

If you use this code for your research, please consider citeing:

@INPROCEEDINGS{hao2020fullsubnet,
    author={Hao, Xiang and Su, Xiangdong and Horaud, Radu and Li, Xiaofei},
    booktitle={ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)},
    title={Fullsubnet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement},
    year={2021},
    pages={6633-6637},
    doi={10.1109/ICASSP39728.2021.9414177}
}

License

This repository Under the MIT license.

More Repositories

1

NBSS

The official repo of NBC & SpatialNet for multichannel speech separation, denoising, and dereverberation
Python
194
star
2

McNet

The official repo: "McNet: Fuse Multiple Cues for Multichannel Speech Enhancement", ICASSP 2023
Python
96
star
3

audiossl

A library built for easier audio self-supervised training, downstream tasks evaluation
Python
84
star
4

FN-SSL

The Official PyTorch Implementation of FN-SSL & IPDnet for Sound Source Localization
Python
82
star
5

ATST-SED

This repo includes the official implementations of "Fine-tune the pretrained ATST model for sound event detection".
Jupyter Notebook
76
star
6

FS-EEND

The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based attractors". [ICASSP 2024]
Python
72
star
7

RealMAN

A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization
Python
58
star
8

RVAE-EM

Official PyTorch implementation of "RVAE-EM: Generative speech dereverberation based on recurrent variational auto-encoder and convolutive transfer function" [ICASSP2024]
Python
38
star
9

pytorch_lightning_template_for_beginners

A pytorch template for beginners based on pytorch_lightning
Python
33
star
10

Narrowband_DeepFiltering

Python
19
star
11

UMA-ASR

This repository is the official implementation of "Unimodal Aggregation for CTC-based Speech Recognition".
Shell
13
star
12

RCT

This repo gives the code for the official implementation of RCT.
Python
12
star
13

OnlineSSL_DPRTF_EG

MATLAB
8
star
14

LSTM-noisePSD

Python
8
star
15

bss_ctf_lasso

MATLAB
5
star
16

Microphone-Array-Generalization-for-Multichannel-Narrowband-Deep-Speech-Enhancement-

Python
3
star
17

Audio-WestlakeU.github.io

Audio and Signal Information Processing Lab in Westlake University concentrates on speech processing algorithm
3
star
18

DP_RTF_SSL

MATLAB
3
star
19

SMIF_online_dereverb

MATLAB
3
star
20

ATST-RCT

ATST-RCT model for DCASE 2022 task4.
Python
2
star
21

RTF_InterFrameSpecSub

MATLAB
1
star
22

RS_noisePSD

MATLAB
1
star