• Stars
    star
    113
  • Rank 308,309 (Top 7 %)
  • Language
    Python
  • License
    Apache License 2.0
  • Created 8 months ago
  • Updated 5 months ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Code for paper "Large Language Models are Efficient Learners of Noise-Robust Speech Recognition"

More Repositories

1

STAR-Adapt

Code for paper "Self-Taught Recognizer: Toward Unsupervised Adaptation for Speech Foundation Models"
Python
241
star
2

GenTranslate

Code for paper "GenTranslate: Large Language Models are Generative Multilingual Speech and Machine Translators"
Python
189
star
3

NASE

Code for paper "Noise-aware Speech Enhancement using Diffusion Probabilistic Model"
Python
80
star
4

DPSL-ASR

Code for paper "Dual-Path Style Learning for End-to-End Noise-Robust Speech Recognition"
Python
35
star
5

Unified-Enhance-Separation

Code for paper "Unifying Speech Enhancement and Separation with Gradient Modulation for End-to-End Noise-Robust Speech Separation"
Python
34
star
6

GILA

Code for paper "Cross-Modal Global Interaction and Local Alignment for Audio-Visual Speech Recognition"
Python
17
star
7

MIR-GAN

Code for paper "MIR-GAN: Refining Frame-Level Modality-Invariant Representations with Adversarial Network for Audio-Visual Speech Recognition"
Python
14
star
8

Gradient-Remedy

Code for paper "Gradient Remedy for Multi-Task Learning in End-to-End Noise-Robust Speech Recognition"
Python
13
star
9

UniVPM

Code for paper "Hearing Lips in Noise: Universal Viseme-Phoneme Mapping and Transfer for Robust Audio-Visual Speech Recognition"
Python
12
star
10

RATS-Channel-A-Speech-Data

This is a public repository for RATS Channel-A Speech Data, which is a chargeable noisy speech dataset under LDC. Here we release its Log-Mel Fbank features and several raw wavform listening samples.
11
star
11

UNA-GAN

Code for paper "Unsupervised Noise adaptation using Data Simulation"
Python
6
star
12

UNA-GAN-Demo

HTML
2
star