• Stars
    star
    553
  • Rank 80,462 (Top 2 %)
  • Language
    Python
  • License
    MIT License
  • Created about 1 year ago
  • Updated 20 days ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Speech, Language, Audio, Music Processing with Large Language Model

More Repositories

1

AniTalker

[ACM MM 2024] This is the official code for "AniTalker: Animate Vivid and Diverse Talking Faces through Identity-Decoupled Facial Motion Encoding"
Jupyter Notebook
1,335
star
2

VoiceFlow-TTS

[ICASSP 2024] This is the official code for "VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching"
Python
302
star
3

text2sql-lgesql

[ACL 2021] This is the project containing source codes and pre-trained models about ACL2021 Long Paper ``LGESQL: Line Graph Enhanced Text-to-SQL Model with Mixed Local and Non-Local Relations".
Python
147
star
4

UniCATS-CTX-vec2wav

[AAAI 2024] Code for CTX-vec2wav in UniCATS
Python
119
star
5

UniCATS-CTX-txt2vec

[AAAI 2024] CTX-txt2vec, the acoustic model in UniCATS
Python
61
star
6

WebSRC-Baseline

The baseline code for WebSRC dataset.
HTML
37
star
7

Mobile-Env

A Universal Platform for Training and Evaluation of Mobile Interaction
Python
29
star
8

StoryTTS

https://goarsenal.github.io/StoryTTS/
HTML
20
star
9

BER

Balanced Error Rate for Speaker Diarization
Python
18
star
10

MSDWILD

This dataset is designed for multi-modal speaker diarization and lip-speech synchronization in the wild.
HTML
16
star
11

text2sql-GPT

[EMNLP 2023 Findings] ACT-SQL: In-Context Learning for Text-to-SQL with Automatically-Generated Chain-of-Thought
Python
16
star
12

public_talks

Materials of public talks given By SJTU X-LANCE members
14
star
13

suzhou-tutorials

A brief tutorial and startup scripts about suzhou clusters for members of speechlab
Shell
13
star
14

PaperReading

整理各研究方向经典论文
10
star
15

META-GUI-baseline

[EMNLP 2022] The baseline code for META-GUI dataset
Python
10
star
16

TIE

Python
9
star
17

text2sql-multiturn-GPT

[NAACL 2024] CoE-SQL: In-Context Learning for Multi-Turn Text-to-SQL with Chain-of-Editions
Python
6
star
18

medical-dataset

[ACL 2023 Findings] CSS: A Large-scale Cross-schema Chinese Text-to-SQL Medical Dataset
Python
5
star
19

AttrEnhZsAc

Python
4
star
20

WebSRC

WebSRC: A dataset for web based structural machine reading comprehension.
CSS
2
star
21

D4

[EMNLP 2022] D4: a Chinese Dialogue Dataset for Depression-Diagnosis-Oriented Chat
CSS
1
star