There are no reviews yet. Be the first to send feedback to the community and the maintainers!
GPV
Repository for our Interspeech2020 general-purpose voice activity detection (GPVAD) paperPLDA
An LDA/PLDA estimator using KALDI in python for speaker verification tasksDatadriven-GPVAD
The codebase for Data-driven general-purpose voice activity detection.AudioCaption
Dataset and baseline for the first Audiocaption tasktext_based_depression
Source code for the paper "Text-based Depression Detection: What Triggers An Alert"SAT
Streaming Audiotransformers for online Audio taggingDasheng
Source for the Interspeech 2024 Paper "Scaling up masked audio encoder learning for general audio classification"PSL
Source code for ICASSP2022 "Pseudo Strong labels for large scale weakly supervised audio tagging"CDur
Repository for the paper "Towards duration robust weakly supervised sound event detection"UIT_Mobile
Source Code for the Paper "UNIFIED KEYWORD SPOTTING AND AUDIO TAGGING ON MOBILE DEVICES WITH TRANSFORMERS"Speaker-Anti-Spoofing-Classifiers
Baselines and Classifiers for speaker anti-spoofing detectionDcase2018_pooling
Repo for our pooling approach on the DCASE2018 task4HEAR2021_EfficientLatent
Submission to the HEAR2021 ChallengeXiaomiVPN
A short introduction how to successfully install a VPN client on a Xiaomi router.CED
Source code for Consistent ensemble distillation for audio taggingSpokenLanguageClassifiers
Pretrained spoken language classifiers from audio.HEAR_CED
Hear evaluation for CED models.audiodataload
Audiodataloaders for raw wave and HTK features in torch.NumericalAnalysis
Homework for Numerical AnalysisSublime3-pydoc
Sublime 3 Pydoc pluginMatTheory
Repo for the Latex filesNanopi-R4S
My NanoPi R4S buildstorchhtk
A simple HTK (Hidden markov kit) dataloader for torchLove Open Source and this site? Check out how you can help us