• Stars
    star
    76
  • Rank 420,374 (Top 9 %)
  • Language
    Jupyter Notebook
  • License
    MIT License
  • Created about 1 year ago
  • Updated 3 months ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

This repo includes the official implementations of "Fine-tune the pretrained ATST model for sound event detection".

More Repositories

1

FullSubNet

PyTorch implementation of "FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."
Python
529
star
2

NBSS

The official repo of NBC & SpatialNet for multichannel speech separation, denoising, and dereverberation
Python
194
star
3

McNet

The official repo: "McNet: Fuse Multiple Cues for Multichannel Speech Enhancement", ICASSP 2023
Python
96
star
4

audiossl

A library built for easier audio self-supervised training, downstream tasks evaluation
Python
84
star
5

FN-SSL

The Official PyTorch Implementation of FN-SSL & IPDnet for Sound Source Localization
Python
82
star
6

FS-EEND

The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based attractors". [ICASSP 2024]
Python
72
star
7

RealMAN

A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization
Python
58
star
8

RVAE-EM

Official PyTorch implementation of "RVAE-EM: Generative speech dereverberation based on recurrent variational auto-encoder and convolutive transfer function" [ICASSP2024]
Python
38
star
9

pytorch_lightning_template_for_beginners

A pytorch template for beginners based on pytorch_lightning
Python
33
star
10

Narrowband_DeepFiltering

Python
19
star
11

UMA-ASR

This repository is the official implementation of "Unimodal Aggregation for CTC-based Speech Recognition".
Shell
13
star
12

RCT

This repo gives the code for the official implementation of RCT.
Python
12
star
13

OnlineSSL_DPRTF_EG

MATLAB
8
star
14

LSTM-noisePSD

Python
8
star
15

bss_ctf_lasso

MATLAB
5
star
16

Microphone-Array-Generalization-for-Multichannel-Narrowband-Deep-Speech-Enhancement-

Python
3
star
17

Audio-WestlakeU.github.io

Audio and Signal Information Processing Lab in Westlake University concentrates on speech processing algorithm
3
star
18

DP_RTF_SSL

MATLAB
3
star
19

SMIF_online_dereverb

MATLAB
3
star
20

ATST-RCT

ATST-RCT model for DCASE 2022 task4.
Python
2
star
21

RTF_InterFrameSpecSub

MATLAB
1
star
22

RS_noisePSD

MATLAB
1
star