fast-transformers
Pytorch library for fast transformer implementationsimportance-sampling
Code for experiments regarding importance sampling for training neural networksbob
Bob is a free signal-processing and machine learning toolbox originally developed by the Biometrics group at Idiap Research Institute, in Switzerland.fullgrad-saliency
Full-gradient saliency mapsESLAM
multicamera-calibration
Multi-Camera Calibration SuiteGeoNeRF
Generalizing NeRF with Geometry Priorsattention-sampling
This Python package enables the training and inference of deep learning models for very large data, such as megapixel images, using attention-samplingacoustic-simulator
Implementation of audio degradation processesmser
Linear time Maximally Stable Extremal Regions implementationkaldi-ivector
Extension to Kaldi implementing the standard i-vector hyperparameter estimation and i-vector extraction proceduremhan
Multilingual hierarchical attention networks toolkitpkwrap
A pytorch wrapper for LF-MMI training and parallel training in KaldiHAN_NMT
Document-Level Neural Machine Translation with Hierarchical Attention Networksgafro
An efficient c++ library targeting robotics applications using geometric algebrajuicer
Juicer is a Weighted Finite State Transducer (WFST) based decoder for Automatic Speech Recognition (ASR).facereclib
Compare your face recognition algorithm to baseline algorithmsg2g-transformer
Pytorch implementation of “Recursive Non-Autoregressive Graph-to-Graph Transformer for Dependency Parsing with Iterative Refinement”model-uncertainty-for-adaptation
Code paper Uncertainty Reduction for Uncertainty Reduction for Model Adaptation in Semantic Segmentation at CVPR 2021eakmeans
Implementation of fast exact k-means algorithmsssp
Speech Signal Processing - a small collection of routines in Python to do signal processingatco2-corpus
A Corpus for Research on Robust Automatic Speech Recognition and Natural Language Understanding of Air Traffic Control Communicationsresidual_pose
Residual Pose: A Decoupled Approach for Depth-based 3D Human Pose Estimationpotr
CNN_QbE_STD
Implementation of the work presented in "CNN based Query by Example Spoken Term Detection"w2v2-air-traffic
nnsslm
Neural Network based Sound Source Localization Modelspsfestimation
Code for the PyTorch implementation of "Spatially-Variant CNN-based Point Spread Function Estimation for Blind Deconvolution and Depth Estimation in Optical Microscopy", IEEE Transactions on Image Processing, 2020.gile
A generalized input-label embedding for text classificationIBDiarization
C++ Implementation of the Information Bottleneck Systemsemiblindpsfdeconv
Code for "Semi-Blind Spatially-Variant Deconvolution in Optical Microscopy with Local Point Spread Function Estimation By Use Of Convolutional Neural Networks" ICIP 2018IdiapTTS
A Python-based modular toolbox for building Deep Neural Network models (using PyTorch) for statistical parametric speech synthesisHMMGradients.jl
Enables computing the gradient of the parameters of Hidden Markov Models (HMMs)inv-tn
A bunch of scripts exploiting several tools to perform inverse text normalization (ITN)deepfocus
Pytorch implementation of "DeepFocus: a Few-Shot Microscope Slide Auto-Focus using a Sample Invariant CNN-based Sharpness Function"zff_vad
Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filteringcontextual-biasing-on-gpus
Implementation of the contextual biasing for ASR decoding on GPUs without lattice generation. The code supports submission to Interspeech 2023.icassp-oov-recognition
Data and code related to the ICASSP submission "A comparison of methods for OOV-word recognition"multimodal_gaze_target_prediction
This repo provides the training and testing code for our paper "A Modular Multimodal Architecture for Gaze Target Prediction: Application to Privacy-Sensitive Settings" published at the GAZE workshop at CVPR 2022phonvoc
Phonetic and phonological vocoding platformasrt
Various scripts that facilitate the preparation of Automatic Speech Recognition related resourcesfast_pose_machines
Efficient Pose Machine for Multi-Person Pose Estimationapam
APAM toolkit is built on PyTorch and provides recipes to adapt pretrained acoustic models with a variety of sequence discriminative training criterions.libssp
Speech Signal Processing - C++ port of a subset of the Python library SSPcbrec
Content-based Recommendation Generatorwmil-sgd
Weighted multiple-instance learning algorithm based on stochastic gradient descenttorgo_asr
A Kaldi recipe for training automatic speech recognition systems on the Torgo corpus of dysarthric speechsparch
PyTorch based toolkit for developing spiking neural networks (SNNs) by training and testing them on speech command recognition tasksttgo
A PyTorch implementation of TTGO algorithm and the applications presented in the paper "Tensor Train for Global Optimization Problems in Robotics"iss
Scripts for speech processinghypermixing
PyTorch implementation for HyperMixing, a linear-time token-mixing technique used in HyperMixer architectureDepthInSpace
A PyTorch-based program which estimates 3D depth maps from active structured-light sensor's multiple video framesrgbd
tracter
Tracter is a data flow framework.drill
Deep residual output layers for neural language generationnvib_transformers
bert-text-diarization-atc
pddetection-reps-learning
Supervised Speech Representation Learning for Parkinson's Disease Classificationzentas
Partitional data clustering around centerslinear-transformer-experiments
Experiments using fast linear transformeremorec
Emotion-based Recommendation GeneratorDocRec
Keyword extraction and document recommendation in conversationsdepth_human_synthesis
DepthHuman: A tool for depth image synthesis for human pose estimationgafar
Geometry-aware Face Reconstructionnvib
hallucination-detection
cnn-for-voice-antispoofing
CNNs for voice antispoofing detectionwav2vec-lfmmi
Recipes from fine-tuning a pre-trained wav2vec 2.0 model using the espresso tool kitilqr_planner
A C++ iLQR library that allows you to solve iLQR optimization problem on any robot as long as you provide an URDF file describing the kinematics chain of the robotAPT
A reference-based metric to evaluate the accuracy of pronoun translation (APT)sentence-planner
iss-dicts
ISS scripts for handling pronunciation dictionariescncsharedtask
slog
Similarity Learning on Graph (SLOG) matlab codesvfoa
Methods to estimate the visual focus of attentionbuslr
BuSLR: Build System for Speech and Language ResearchNode_weighted_GCN_for_depression_detection
Node-weighted Graph Convolutional Network for Depression Detection in Transcribed Clinical Interviewsabroad-re
Towards an end-to-end Relation Extraction system for the natural product literature: datasets, strategies and modelsML3
ML3 classifier (Multiclass Latent Locally Linear Support Vector Machines)ssl-caller-detection
Source code for the paper 'Can Self-Supervised Neural Representations Pre-Trained on Human Speech distinguish Animal Callers?' by E. Sarkar and M. Magimai Doss (2023).sense_aware_NMT
Sense-aware Neural Machine TranslationExVo-2022
Extracting pre-trained self-supervised embeddings for ICML ExVO 2022 challengephp-geremo
PHP Generic Registration Module [GPLv3]idiap.github.com
Main page for idiap@githubTIDIGITSRecipe.jl
A Julia recipe for training an ASR system using the TIDIGITS databasehpca
bayesian-recurrence
A Bayesian Interpretation of Recurrence in Neural Networksrethinking-saliency
Reference implementation of the ICLR 2021 paper "Rethinking the Role of Gradient-Based Attribution Methods for Model Interpretability".DiscoConn-Classifier
Classifier models and feature extractors for discourse relationspydhn
unsupervised_gaze_calibration
Allows to calibrate a gaze estimator in an unsupervised fashion by automatically collecting calibration samples using task-related priorsAttentive_Residual_Connections_NMT
Implementation and output data of "Global-Context Neural Machine Translation through Target-Side Attentive Residual Connections"FiniteStateTransducers.jl
Play with Weighted Finite State Transducers (WFST) in the Julia language.iss-wsj
ISS scripts for the Wall Street Journal taskarchs
Pytorch network architectures for audio perceptiondhgen
A Python module for generating District Heating Networks layoutstinyurdfparser
A lightweight URDF parser library, based on TinyXML2, that converts an [URDF file] into a KDL objectflowestimation
PyTorch implementation of "Estimating Nonplanar Flow from 2D Motion-blurred Widefield Microscopy Images via Deep Learning", submitted to IEEE ISBI, 2021apkit
Audio processing toolkittrimed
The trimed algorithm for obtaining the medoid of a setLove Open Source and this site? Check out how you can help us