• Stars
    star
    78
  • Rank 412,246 (Top 9 %)
  • Language
    Python
  • License
    MIT License
  • Created about 6 years ago
  • Updated 4 months ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Dataset and baseline for the first Audiocaption task

More Repositories

1

GPV

Repository for our Interspeech2020 general-purpose voice activity detection (GPVAD) paper
Python
140
star
2

PLDA

An LDA/PLDA estimator using KALDI in python for speaker verification tasks
Python
99
star
3

Datadriven-GPVAD

The codebase for Data-driven general-purpose voice activity detection.
Python
93
star
4

text_based_depression

Source code for the paper "Text-based Depression Detection: What Triggers An Alert"
Python
44
star
5

SAT

Streaming Audiotransformers for online Audio tagging
Python
39
star
6

Dasheng

Source for the Interspeech 2024 Paper "Scaling up masked audio encoder learning for general audio classification"
Python
39
star
7

PSL

Source code for ICASSP2022 "Pseudo Strong labels for large scale weakly supervised audio tagging"
Python
30
star
8

CDur

Repository for the paper "Towards duration robust weakly supervised sound event detection"
Python
23
star
9

UIT_Mobile

Source Code for the Paper "UNIFIED KEYWORD SPOTTING AND AUDIO TAGGING ON MOBILE DEVICES WITH TRANSFORMERS"
Python
23
star
10

Speaker-Anti-Spoofing-Classifiers

Baselines and Classifiers for speaker anti-spoofing detection
Python
18
star
11

Dcase2018_pooling

Repo for our pooling approach on the DCASE2018 task4
Python
15
star
12

HEAR2021_EfficientLatent

Submission to the HEAR2021 Challenge
Python
15
star
13

XiaomiVPN

A short introduction how to successfully install a VPN client on a Xiaomi router.
Shell
14
star
14

CED

Source code for Consistent ensemble distillation for audio tagging
Python
10
star
15

SpokenLanguageClassifiers

Pretrained spoken language classifiers from audio.
Python
8
star
16

HEAR_CED

Hear evaluation for CED models.
Python
6
star
17

audiodataload

Audiodataloaders for raw wave and HTK features in torch.
Lua
2
star
18

NumericalAnalysis

Homework for Numerical Analysis
TeX
2
star
19

Sublime3-pydoc

Sublime 3 Pydoc plugin
Python
2
star
20

MatTheory

Repo for the Latex files
TeX
1
star
21

Nanopi-R4S

My NanoPi R4S builds
Shell
1
star
22

torchhtk

A simple HTK (Hidden markov kit) dataloader for torch
Lua
1
star
23

Gentleplayer

Simple, easy, no extras playlist generator for Android
Java
1
star