Alibaba Damo Academy (@alibaba-damo-academy)
  • Stars
    star
    519
  • Global Org. Rank 23,333 (Top 8 %)
  • Followers 41
  • Registered about 2 years ago
  • Most used languages
    Python
    93.8 %

Top repositories

1

FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Python
3,955
star
2

FunClip

Open-source, accurate and easy-to-use video clipping tool | εΌ€ζΊγ€η²Ύε‡†γ€ζ–ΉδΎΏηš„θ§†ι’‘εˆ‡η‰‡ε·₯ε…·
Python
956
star
3

3D-Speaker

A repository for single- and multi-modal speaker verification, speaker recognition and speaker diarization.
Python
630
star
4

KAN-TTS

KAN-TTS is a speech-synthesis training framework, please try the demos we have posted at https://modelscope.cn/models?page=1&tasks=text-to-speech
Python
415
star
5

FunCodec

FunCodec is a research-oriented toolkit for audio quantization and downstream applications, such as text-to-speech synthesis, music generation et.al.
Python
273
star
6

SpokenNLP

A wide variety of research projects developed by the SpokenNLP team of Speech Lab, Alibaba Group.
Python
100
star
7

former3d

Python
97
star
8

alice

34
star
9

kws-training-suite

Python
22
star
10

self-supervised-anatomical-embedding-v2

Python
22
star
11

same

Medical image registration, affine and deformable registration
Python
22
star
12

ct-sam3d

Jupyter Notebook
19
star
13

pixel-lesion-patient-network

Python
19
star
14

Med_Query

Python
19
star
15

samconvex

Fast Discrete Optimization for CT Registration
Python
13
star
16

universe

A lightweight driving simulator based on vectorized representation.
Python
5
star
17

ai-for-social-science

Python
4
star
18

Building-Energy-Management-Ensemble-Approach

2
star