There are no reviews yet. Be the first to send feedback to the community and the maintainers!
fast-transformers
Pytorch library for fast transformer implementationsimportance-sampling
Code for experiments regarding importance sampling for training neural networksbob
Bob is a free signal-processing and machine learning toolbox originally developed by the Biometrics group at Idiap Research Institute, in Switzerland.ESLAM
fullgrad-saliency
Full-gradient saliency mapsmulticamera-calibration
Multi-Camera Calibration SuiteGeoNeRF
Generalizing NeRF with Geometry Priorsattention-sampling
This Python package enables the training and inference of deep learning models for very large data, such as megapixel images, using attention-samplingacoustic-simulator
Implementation of audio degradation processesmser
Linear time Maximally Stable Extremal Regions implementationkaldi-ivector
Extension to Kaldi implementing the standard i-vector hyperparameter estimation and i-vector extraction proceduremhan
Multilingual hierarchical attention networks toolkitpkwrap
A pytorch wrapper for LF-MMI training and parallel training in Kaldigafro
An efficient c++ library targeting robotics applications using geometric algebrag2g-transformer
Pytorch implementation of βRecursive Non-Autoregressive Graph-to-Graph Transformer for Dependency Parsing with Iterative Refinementβjuicer
Juicer is a Weighted Finite State Transducer (WFST) based decoder for Automatic Speech Recognition (ASR).facereclib
Compare your face recognition algorithm to baseline algorithmssigma-gpt
Ο-GPT: A New Approach to Autoregressive Modelsmodel-uncertainty-for-adaptation
Code paper Uncertainty Reduction for Uncertainty Reduction for Model Adaptation in Semantic Segmentation at CVPR 2021eakmeans
Implementation of fast exact k-means algorithmsatco2-corpus
A Corpus for Research on Robust Automatic Speech Recognition and Natural Language Understanding of Air Traffic Control Communicationsssp
Speech Signal Processing - a small collection of routines in Python to do signal processingpsfestimation
Code for the PyTorch implementation of "Spatially-Variant CNN-based Point Spread Function Estimation for Blind Deconvolution and Depth Estimation in Optical Microscopy", IEEE Transactions on Image Processing, 2020.w2v2-air-traffic
potr
residual_pose
Residual Pose: A Decoupled Approach for Depth-based 3D Human Pose EstimationCNN_QbE_STD
Implementation of the work presented in "CNN based Query by Example Spoken Term Detection"nnsslm
Neural Network based Sound Source Localization Modelssemiblindpsfdeconv
Code for "Semi-Blind Spatially-Variant Deconvolution in Optical Microscopy with Local Point Spread Function Estimation By Use Of Convolutional Neural Networks" ICIP 2018IBDiarization
C++ Implementation of the Information Bottleneck Systemgile
A generalized input-label embedding for text classificationIdiapTTS
A Python-based modular toolbox for building Deep Neural Network models (using PyTorch) for statistical parametric speech synthesisHMMGradients.jl
Enables computing the gradient of the parameters of Hidden Markov Models (HMMs)inv-tn
A bunch of scripts exploiting several tools to perform inverse text normalization (ITN)deepfocus
Pytorch implementation of "DeepFocus: a Few-Shot Microscope Slide Auto-Focus using a Sample Invariant CNN-based Sharpness Function"multimodal_gaze_target_prediction
This repo provides the training and testing code for our paper "A Modular Multimodal Architecture for Gaze Target Prediction: Application to Privacy-Sensitive Settings" published at the GAZE workshop at CVPR 2022hypermixing
PyTorch implementation for HyperMixing, a linear-time token-mixing technique used in HyperMixer architecturesparch
PyTorch based toolkit for developing spiking neural networks (SNNs) by training and testing them on speech command recognition taskszff_vad
Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filteringcontextual-biasing-on-gpus
Implementation of the contextual biasing for ASR decoding on GPUs without lattice generation. The code supports submission to Interspeech 2023.icassp-oov-recognition
Data and code related to the ICASSP submission "A comparison of methods for OOV-word recognition"asrt
Various scripts that facilitate the preparation of Automatic Speech Recognition related resourcesphonvoc
Phonetic and phonological vocoding platformfast_pose_machines
Efficient Pose Machine for Multi-Person Pose Estimationttgo
A PyTorch implementation of TTGO algorithm and the applications presented in the paper "Tensor Train for Global Optimization Problems in Robotics"apam
APAM toolkit is built on PyTorch and provides recipes to adapt pretrained acoustic models with a variety of sequence discriminative training criterions.torgo_asr
A Kaldi recipe for training automatic speech recognition systems on the Torgo corpus of dysarthric speechlibssp
Speech Signal Processing - C++ port of a subset of the Python library SSPcbrec
Content-based Recommendation Generatorwmil-sgd
Weighted multiple-instance learning algorithm based on stochastic gradient descentbert-text-diarization-atc
DepthInSpace
A PyTorch-based program which estimates 3D depth maps from active structured-light sensor's multiple video framesiss
Scripts for speech processingrgbd
tracter
Tracter is a data flow framework.pddetection-reps-learning
Supervised Speech Representation Learning for Parkinson's Disease Classificationdrill
Deep residual output layers for neural language generationnvib_transformers
Node_weighted_GCN_for_depression_detection
Node-weighted Graph Convolutional Network for Depression Detection in Transcribed Clinical Interviewsdepth_human_synthesis
DepthHuman: A tool for depth image synthesis for human pose estimationilqr_planner
A C++ iLQR library that allows you to solve iLQR optimization problem on any robot as long as you provide an URDF file describing the kinematics chain of the robotgafar
Geometry-aware Face Reconstructionzentas
Partitional data clustering around centerslinear-transformer-experiments
Experiments using fast linear transformeremorec
Emotion-based Recommendation Generatorhallucination-detection
DocRec
Keyword extraction and document recommendation in conversationsabroad-re
Towards an end-to-end Relation Extraction system for the natural product literature: datasets, strategies and modelsnvib
cnn-for-voice-antispoofing
CNNs for voice antispoofing detectionwav2vec-lfmmi
Recipes from fine-tuning a pre-trained wav2vec 2.0 model using the espresso tool kitpydhn
APT
A reference-based metric to evaluate the accuracy of pronoun translation (APT)iss-dicts
ISS scripts for handling pronunciation dictionariessentence-planner
slog
Similarity Learning on Graph (SLOG) matlab codescncsharedtask
inference-from-real-world-sparse-measurements
Implementation of the Multi-Layer Self-Attention, a state-of-the-art model designed for wind nowcasting tasksssl-caller-detection
Source code for the paper 'Can Self-Supervised Neural Representations Pre-Trained on Human Speech distinguish Animal Callers?' by E. Sarkar and M. Magimai Doss (2023).ExVo-2022
Extracting pre-trained self-supervised embeddings for ICML ExVO 2022 challengevfoa
Methods to estimate the visual focus of attentionbuslr
BuSLR: Build System for Speech and Language Researchdhgen
A Python module for generating District Heating Networks layoutsbayesian-recurrence
A Bayesian Interpretation of Recurrence in Neural NetworksTactileErgodicExploration
A Python package for ergodic control on point cloud using diffusion. It is supplementary material for the paper "Tactile Ergodic Control Using Diffusion and Geometric Algebra".ML3
ML3 classifier (Multiclass Latent Locally Linear Support Vector Machines)sense_aware_NMT
Sense-aware Neural Machine Translationphp-geremo
PHP Generic Registration Module [GPLv3]idiap.github.com
Main page for idiap@githubtinyurdfparser
A lightweight URDF parser library, based on TinyXML2, that converts an [URDF file] into a KDL objectTIDIGITSRecipe.jl
A Julia recipe for training an ASR system using the TIDIGITS databasehpca
rethinking-saliency
Reference implementation of the ICLR 2021 paper "Rethinking the Role of Gradient-Based Attribution Methods for Model Interpretability".DiscoConn-Classifier
Classifier models and feature extractors for discourse relationspygafro
A geometric algebra library targeted towards robotics applicationsanonymization
A Python library for anonymizing sensitive information in text data. Focused on Swiss French banking data.unsupervised_gaze_calibration
Allows to calibrate a gaze estimator in an unsupervised fashion by automatically collecting calibration samples using task-related priorsAttentive_Residual_Connections_NMT
Implementation and output data of "Global-Context Neural Machine Translation through Target-Side Attentive Residual Connections"FiniteStateTransducers.jl
Play with Weighted Finite State Transducers (WFST) in the Julia language.iss-wsj
ISS scripts for the Wall Street Journal taskLove Open Source and this site? Check out how you can help us