There are no reviews yet. Be the first to send feedback to the community and the maintainers!
fast-transformers
Pytorch library for fast transformer implementationsimportance-sampling
Code for experiments regarding importance sampling for training neural networksbob
Bob is a free signal-processing and machine learning toolbox originally developed by the Biometrics group at Idiap Research Institute, in Switzerland.fullgrad-saliency
Full-gradient saliency mapsESLAM
multicamera-calibration
Multi-Camera Calibration SuiteGeoNeRF
Generalizing NeRF with Geometry Priorsattention-sampling
This Python package enables the training and inference of deep learning models for very large data, such as megapixel images, using attention-samplingacoustic-simulator
Implementation of audio degradation processesmser
Linear time Maximally Stable Extremal Regions implementationkaldi-ivector
Extension to Kaldi implementing the standard i-vector hyperparameter estimation and i-vector extraction proceduremhan
Multilingual hierarchical attention networks toolkitpkwrap
A pytorch wrapper for LF-MMI training and parallel training in KaldiHAN_NMT
Document-Level Neural Machine Translation with Hierarchical Attention Networksgafro
An efficient c++ library targeting robotics applications using geometric algebrajuicer
Juicer is a Weighted Finite State Transducer (WFST) based decoder for Automatic Speech Recognition (ASR).facereclib
Compare your face recognition algorithm to baseline algorithmsg2g-transformer
Pytorch implementation of “Recursive Non-Autoregressive Graph-to-Graph Transformer for Dependency Parsing with Iterative Refinement”model-uncertainty-for-adaptation
Code paper Uncertainty Reduction for Uncertainty Reduction for Model Adaptation in Semantic Segmentation at CVPR 2021eakmeans
Implementation of fast exact k-means algorithmsssp
Speech Signal Processing - a small collection of routines in Python to do signal processingatco2-corpus
A Corpus for Research on Robust Automatic Speech Recognition and Natural Language Understanding of Air Traffic Control Communicationspotr
residual_pose
Residual Pose: A Decoupled Approach for Depth-based 3D Human Pose EstimationCNN_QbE_STD
Implementation of the work presented in "CNN based Query by Example Spoken Term Detection"w2v2-air-traffic
nnsslm
Neural Network based Sound Source Localization Modelspsfestimation
Code for the PyTorch implementation of "Spatially-Variant CNN-based Point Spread Function Estimation for Blind Deconvolution and Depth Estimation in Optical Microscopy", IEEE Transactions on Image Processing, 2020.IBDiarization
C++ Implementation of the Information Bottleneck Systemgile
A generalized input-label embedding for text classificationsemiblindpsfdeconv
Code for "Semi-Blind Spatially-Variant Deconvolution in Optical Microscopy with Local Point Spread Function Estimation By Use Of Convolutional Neural Networks" ICIP 2018IdiapTTS
A Python-based modular toolbox for building Deep Neural Network models (using PyTorch) for statistical parametric speech synthesisHMMGradients.jl
Enables computing the gradient of the parameters of Hidden Markov Models (HMMs)inv-tn
A bunch of scripts exploiting several tools to perform inverse text normalization (ITN)deepfocus
Pytorch implementation of "DeepFocus: a Few-Shot Microscope Slide Auto-Focus using a Sample Invariant CNN-based Sharpness Function"zff_vad
Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filteringcontextual-biasing-on-gpus
Implementation of the contextual biasing for ASR decoding on GPUs without lattice generation. The code supports submission to Interspeech 2023.icassp-oov-recognition
Data and code related to the ICASSP submission "A comparison of methods for OOV-word recognition"multimodal_gaze_target_prediction
This repo provides the training and testing code for our paper "A Modular Multimodal Architecture for Gaze Target Prediction: Application to Privacy-Sensitive Settings" published at the GAZE workshop at CVPR 2022phonvoc
Phonetic and phonological vocoding platformasrt
Various scripts that facilitate the preparation of Automatic Speech Recognition related resourcesfast_pose_machines
Efficient Pose Machine for Multi-Person Pose Estimationapam
APAM toolkit is built on PyTorch and provides recipes to adapt pretrained acoustic models with a variety of sequence discriminative training criterions.libssp
Speech Signal Processing - C++ port of a subset of the Python library SSPcbrec
Content-based Recommendation Generatortorgo_asr
A Kaldi recipe for training automatic speech recognition systems on the Torgo corpus of dysarthric speechwmil-sgd
Weighted multiple-instance learning algorithm based on stochastic gradient descentttgo
A PyTorch implementation of TTGO algorithm and the applications presented in the paper "Tensor Train for Global Optimization Problems in Robotics"iss
Scripts for speech processinghypermixing
PyTorch implementation for HyperMixing, a linear-time token-mixing technique used in HyperMixer architectureDepthInSpace
A PyTorch-based program which estimates 3D depth maps from active structured-light sensor's multiple video framesrgbd
tracter
Tracter is a data flow framework.drill
Deep residual output layers for neural language generationnvib_transformers
bert-text-diarization-atc
pddetection-reps-learning
Supervised Speech Representation Learning for Parkinson's Disease Classificationzentas
Partitional data clustering around centerslinear-transformer-experiments
Experiments using fast linear transformeremorec
Emotion-based Recommendation GeneratorDocRec
Keyword extraction and document recommendation in conversationsdepth_human_synthesis
DepthHuman: A tool for depth image synthesis for human pose estimationgafar
Geometry-aware Face Reconstructionnvib
hallucination-detection
cnn-for-voice-antispoofing
CNNs for voice antispoofing detectionwav2vec-lfmmi
Recipes from fine-tuning a pre-trained wav2vec 2.0 model using the espresso tool kitilqr_planner
A C++ iLQR library that allows you to solve iLQR optimization problem on any robot as long as you provide an URDF file describing the kinematics chain of the robotAPT
A reference-based metric to evaluate the accuracy of pronoun translation (APT)sentence-planner
iss-dicts
ISS scripts for handling pronunciation dictionariescncsharedtask
slog
Similarity Learning on Graph (SLOG) matlab codesvfoa
Methods to estimate the visual focus of attentionbuslr
BuSLR: Build System for Speech and Language ResearchNode_weighted_GCN_for_depression_detection
Node-weighted Graph Convolutional Network for Depression Detection in Transcribed Clinical Interviewsabroad-re
Towards an end-to-end Relation Extraction system for the natural product literature: datasets, strategies and modelsML3
ML3 classifier (Multiclass Latent Locally Linear Support Vector Machines)sense_aware_NMT
Sense-aware Neural Machine Translationssl-caller-detection
Source code for the paper 'Can Self-Supervised Neural Representations Pre-Trained on Human Speech distinguish Animal Callers?' by E. Sarkar and M. Magimai Doss (2023).ExVo-2022
Extracting pre-trained self-supervised embeddings for ICML ExVO 2022 challengephp-geremo
PHP Generic Registration Module [GPLv3]idiap.github.com
Main page for idiap@githubTIDIGITSRecipe.jl
A Julia recipe for training an ASR system using the TIDIGITS databasehpca
bayesian-recurrence
A Bayesian Interpretation of Recurrence in Neural Networksrethinking-saliency
Reference implementation of the ICLR 2021 paper "Rethinking the Role of Gradient-Based Attribution Methods for Model Interpretability".DiscoConn-Classifier
Classifier models and feature extractors for discourse relationspydhn
unsupervised_gaze_calibration
Allows to calibrate a gaze estimator in an unsupervised fashion by automatically collecting calibration samples using task-related priorsAttentive_Residual_Connections_NMT
Implementation and output data of "Global-Context Neural Machine Translation through Target-Side Attentive Residual Connections"FiniteStateTransducers.jl
Play with Weighted Finite State Transducers (WFST) in the Julia language.iss-wsj
ISS scripts for the Wall Street Journal taskarchs
Pytorch network architectures for audio perceptiondhgen
A Python module for generating District Heating Networks layoutstinyurdfparser
A lightweight URDF parser library, based on TinyXML2, that converts an [URDF file] into a KDL objectflowestimation
PyTorch implementation of "Estimating Nonplanar Flow from 2D Motion-blurred Widefield Microscopy Images via Deep Learning", submitted to IEEE ISBI, 2021apkit
Audio processing toolkittrimed
The trimed algorithm for obtaining the medoid of a setsimple-imager
Linux Imaging and Deployment Made EasyLove Open Source and this site? Check out how you can help us