• Stars
    star
    13
  • Rank 1,512,713 (Top 30 %)
  • Language
    Shell
  • Created about 1 year ago
  • Updated 2 months ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

This repository is the official implementation of "Unimodal Aggregation for CTC-based Speech Recognition".

More Repositories

1

FullSubNet

PyTorch implementation of "FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."
Python
529
star
2

NBSS

The official repo of NBC & SpatialNet for multichannel speech separation, denoising, and dereverberation
Python
194
star
3

McNet

The official repo: "McNet: Fuse Multiple Cues for Multichannel Speech Enhancement", ICASSP 2023
Python
96
star
4

audiossl

A library built for easier audio self-supervised training, downstream tasks evaluation
Python
84
star
5

FN-SSL

The Official PyTorch Implementation of FN-SSL & IPDnet for Sound Source Localization
Python
82
star
6

ATST-SED

This repo includes the official implementations of "Fine-tune the pretrained ATST model for sound event detection".
Jupyter Notebook
76
star
7

FS-EEND

The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based attractors". [ICASSP 2024]
Python
72
star
8

RealMAN

A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization
Python
58
star
9

RVAE-EM

Official PyTorch implementation of "RVAE-EM: Generative speech dereverberation based on recurrent variational auto-encoder and convolutive transfer function" [ICASSP2024]
Python
38
star
10

pytorch_lightning_template_for_beginners

A pytorch template for beginners based on pytorch_lightning
Python
33
star
11

Narrowband_DeepFiltering

Python
19
star
12

RCT

This repo gives the code for the official implementation of RCT.
Python
12
star
13

OnlineSSL_DPRTF_EG

MATLAB
8
star
14

LSTM-noisePSD

Python
8
star
15

bss_ctf_lasso

MATLAB
5
star
16

Microphone-Array-Generalization-for-Multichannel-Narrowband-Deep-Speech-Enhancement-

Python
3
star
17

Audio-WestlakeU.github.io

Audio and Signal Information Processing Lab in Westlake University concentrates on speech processing algorithm
3
star
18

DP_RTF_SSL

MATLAB
3
star
19

SMIF_online_dereverb

MATLAB
3
star
20

ATST-RCT

ATST-RCT model for DCASE 2022 task4.
Python
2
star
21

RTF_InterFrameSpecSub

MATLAB
1
star
22

RS_noisePSD

MATLAB
1
star