There are no reviews yet. Be the first to send feedback to the community and the maintainers!
llm2vec
Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'medal
Large medical text dataset curated for abbreviation disambiguation, designed for natural language understanding pre-training in the medical domainlength-generalization
Code for the paper "The Impact of Positional Encoding on Length Generalization in Transformers", NeurIPS 2023bias-bench
ACL 2022: An Empirical Survey of the Effectiveness of Debiasing Techniques for Pre-trained Language Models.weblinx
WebLINX is a benchmark for building web navigation agents with conversational capabilitiesinstruct-qa
Code and Data for "Evaluating Correctness and Faithfulness of Instruction-Following Models for Question Answering"polytropon
topiocqa
Code and data for reproducing baselines for TopiOCQA, an open-domain conversational question-answering datasetimagecode
Code and data for ImageCoDe, a contextual vison-and-language benchmarkretriever-lm-reasoning
Code for "Can Retriever-Augmented Language Models Reason? The Blame Game Between the Retriever and the Language Model", EMNLP Findings 2023diffusion-itm
Code and data setup for the paper "Are Diffusion Models Vision-and-language Reasoners?"MLQuestions
latent-translation
Code for the paper "Modelling Latent Translations for Cross-Lingual Transfer"AdversarialTriggers
Code for "Universal Adversarial Triggers Are Not Universal."feedbackqa
FeedbackQA: Improving Question Answering Post-Deployment with Interactive Feedbackcontextual-nmn
VinePPO
Code for the paper "VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment"ud-to-meaning
A repository for code related to a project mapping Universal Dependencies syntactic forms to meaning representations.StarCoderSafetyEval
Code for safety evaluations of StarCoder.Love Open Source and this site? Check out how you can help us