pyserini
Pyserini is a Python toolkit for reproducible information retrieval research with sparse and dense representations.anserini
Anserini is a Lucene toolkit for reproducible information retrieval researchdaam
Diffusion attentive attribution maps for interpreting Stable Diffusion.hedwig
PyTorch deep learning models for document classificationhonk
PyTorch implementations of neural network models for keyword spottingdocTTTTTquery
docTTTTTquery document expansion modelpygaggle
a gaggle of deep neural architectures for text ranking and question answering, designed for Pyserinirank_llm
Repository for prompt-decoding using LLMs (GPT3.5, GPT4, Vicuna, and Zephyr)BuboQA
Simple question answering over knowledge graphs (Mohammed et al., NAACL 2018)howl
Wake word detection modeling toolkit for Firefox Voice, supporting open datasets like Speech Commands and Common Voice.castor
PyTorch deep learning models for text processingDeeBERT
DeeBERT: Dynamic Early Exiting for Accelerating BERT Inferencebirch
Document ranking via sentence modeling using BERTcovidex
A multi-stage neural search engine for the COVID-19 Open Research Datasetduobert
Multi-stage passage ranking: monoBERT + duoBERTMP-CNN-Torch
Multi-Perspective Convolutional Neural Networks for modeling textual similarity (He et al., EMNLP 2015)mr.tydi
Mr. TyDi is a multi-lingual benchmark dataset built on TyDi, covering eleven typologically diverse languages.anserini-notebooks
Anserini notebookshonkling
Web app for keyword spotting using TensorflowJSafriberta
AfriBERTa: Exploring the Viability of Pretrained Multilingual Language Models for Low-resourced Languagesdhr
Dense hybrid representations for text retrievaldata
Castorini dataNCE-CNN-Torch
Noise-Contrastive Estimation for Question Answering with Convolutional Neural Networks (Rao et al. CIKM 2016)chatty-goose
A Python framework for conversational searchtransformers-arithmetic
d-bert
Distilling BERT using natural language generation.hf-spacerini
Plug-and-play Search Interfaces with Pyserini and Hugging Faceragnarok
Retrieval-Augmented Generation battle!anserini-tools
Evaluation tools shared across anserini, pyserini, and pygagglebertserini
BERTseriniSimpleDBpediaQA
simple QA over knowledge graphs on DBpediaonboarding
Onboarding guide to Jimmy Lin's research group at the University of Waterlooberxit
umbrela
VDPWI-NN-Torch
Very Deep Pairwise Word Interaction Neural Networks for modeling textual similarity (He and Lin, NAACL/HLT 2016)perm-sc
Official codebase for permutation self-consistency.LiT5
TREC-COVID
TREC-COVID results - this is a mirror of data on the TREC website in a more convenient format.honk-models
Pre-trained models for Honkhowl-deploy
JavaScript deployment for Howl, the wake word detection modeling toolkit for Firefox VoiceTweets2013-IA
The Tweets2013 Internet Archive collectionAfriTeVa-keji
TrecQA-NegEx
Code and dataset for SIGIR 2017 short paper "Automatically Extracting High-Quality Negative Examples for Answer Selection in Question Answering"meanmax
MeanMax estimators.cqe
SM-CNN-Torch
Torch implementation of Severyn and Moschitti's SIGIR 2015 CNN model for question answeringONNX-demo
anserini-notebooks-afirm2020
Colab notebooks for AFIRM '20serverless-bert-reranking
parrot
Keyword spotting using audio from speech synthesis services and YouTubetouche-error-analysis
A reproduction study of the Touché 2020 dataset in the BEIR benchmarkearlyexiting-monobert
afriteva
Text - 2 - Text for African languagestct_colbert
transformers-selective
serverless-inference
Neural network inference on serverless architecturenorbert
NorBERT: Anserini + dl4marco-bertanserini-spark
Anserini-Spark integrationrank_llm_data
numbert
Passage Ranking Library using various pretrained LMskim-cnn-vis
An in-browser visualization of Kim CNNreplicate-lce
kws-gen-data
Data for KWS generator.pyserini-data
BuboQA-models
candle
PyTorch utilities for parameter pruning and multiplies reductiongooselight2
Search frontend for Anseriniafriclirmatrix
AfriCLIRMatrix is a test collection for cross-lingual information retrieval research in 15 diverse African languages.biasprobe
sigtestv
SIGnificance TESTing Violations: an end-to-end toolkit for evaluating neural networks.howl-models
SolrAnserini
Anserini integration with Solrgooselight
🦆 Anserini + Blacklight 🦆anlessini
honkling-models
BuboQA-data
Hosting dataset for BuboQAragnarok_data
Love Open Source and this site? Check out how you can help us