Discover princeton-nlp/rationale-robustness Open Source project

Stars
26
Rank 930,752 (Top 19 %)
Language
Python
Created over 2 years ago
Updated about 2 years ago

princeton-nlp

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

NAACL 2022: Can Rationalization Improve Robustness? https://arxiv.org/abs/2204.11790

SWE-agent

SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2024]

Python

13,504

tree-of-thought-llm

[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models

Python

4,726

SimCSE

[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821

Python

3,381

SWE-bench

[ICLR 2024] SWE-Bench: Can Language Models Resolve Real-world Github Issues?

Python

1,846

MeZO

[NeurIPS 2023] MeZO: Fine-Tuning Language Models with Just Forward Passes. https://arxiv.org/abs/2305.17333

Python

1,031

PURE

[NAACL 2021] A Frustratingly Easy Approach for Entity and Relation Extraction https://arxiv.org/abs/2010.12812

Python

788

LM-BFF

[ACL 2021] LM-BFF: Better Few-shot Fine-tuning of Language Models https://arxiv.org/abs/2012.15723

Python

714

SimPO

SimPO: Simple Preference Optimization with a Reference-Free Reward

Python

672

DensePhrases

[ACL 2021] Learning Dense Representations of Phrases at Scale; EMNLP'2021: Phrase Retrieval Learns Passage Retrieval, Too https://arxiv.org/abs/2012.12624

Python

605

LLM-Shearing

[ICLR 2024] Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning

Python

546

ALCE

[EMNLP 2023] Enabling Large Language Models to Generate Text with Citations. Paper: https://arxiv.org/abs/2305.14627

Python

450

LESS

[ICML 2024] LESS: Selecting Influential Data for Targeted Instruction Tuning

Jupyter Notebook

354

AutoCompressors

[EMNLP 2023] Adapting Language Models to Compress Long Contexts

Python

273

WebShop

[NeurIPS 2022] 🛒WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents

Python

264

TRIME

[EMNLP 2022] Training Language Models with Memory Augmentation https://arxiv.org/abs/2205.12674

Python

192

intercode

[NeurIPS 2023 D&B] Code repository for InterCode benchmark https://arxiv.org/abs/2306.14898

Python

191

CoFiPruning

[ACL 2022] Structured Pruning Learns Compact and Accurate Models https://arxiv.org/abs/2204.00408

Python

188

OptiPrompt

[NAACL 2021] Factual Probing Is [MASK]: Learning vs. Learning to Recall https://arxiv.org/abs/2104.05240

Python

167

TransformerPrograms

[NeurIPS 2023] Learning Transformer Programs

Python

157

EntityQuestions

EMNLP'2021: Simple Entity-centric Questions Challenge Dense Retrievers https://arxiv.org/abs/2109.08535

Python

139

QuRating

[ICML 2024] Selecting High-Quality Data for Training Language Models

Python

137

CEPE

[ACL 2024] Long-Context Language Modeling with Parallel Encodings

Python

135

DinkyTrain

Princeton NLP's pre-training library based on fairseq with DeepSpeed kernel integration 🚃

Python

111

LLMBar

[ICLR 2024] Evaluating Large Language Models at Evaluating Instruction Following

Python

108

MQuAKE

[EMNLP 2023] MQuAKE: Assessing Knowledge Editing in Language Models via Multi-Hop Questions

Jupyter Notebook

USACO

Can Language Models Solve Olympiad Programming?

Python

ProLong

Homepage for ProLong (Princeton long-context language models) and paper "How to Train Long-Context Language Models (Effectively)"

Python

NLProofS

EMNLP 2022: Generating Natural Language Proofs with Verifier-Guided Search https://arxiv.org/abs/2205.12443

Python

CharXiv

[NeurIPS 2024] CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMs

Python

MADE

EMNLP 2021: Single-dataset Experts for Multi-dataset Question-Answering

Python

LM-Kernel-FT

A Kernel-Based View of Language Model Fine-Tuning https://arxiv.org/abs/2210.05643

Python

c-sts

[EMNLP 2023] C-STS: Conditional Semantic Textual Similarity

Python

calm-textgame

[EMNLP 2020] Keep CALM and Explore: Language Models for Action Generation in Text-based Games

Python

DataMUX

[NeurIPS 2022] DataMUX: Data Multiplexing for Neural Networks

Jupyter Notebook

ShortcutGrammar

EMNLP 2022: Finding Dataset Shortcuts with Grammar Induction https://arxiv.org/abs/2210.11560

Jupyter Notebook

LitSearch

A Retrieval Benchmark for Scientific Literature Search

Python

Collie

[ICLR 2024] COLLIE: Systematic Construction of Constrained Text Generation Tasks

Jupyter Notebook

EvalConvQA

[ACL 2022] Ditch the Gold Standard: Re-evaluating Conversational Question Answering

Python

HELMET

The HELMET Benchmark

Python

MABEL

EMNLP 2022: "MABEL: Attenuating Gender Bias using Textual Entailment Data" https://arxiv.org/abs/2210.14975

Python

LM-Science-Tutor

Python

PTP

Improving Language Understanding from Screenshots. Paper: https://arxiv.org/abs/2402.14073

Python

corpus-poisoning

[EMNLP 2023] Poisoning Retrieval Corpora by Injecting Adversarial Passages https://arxiv.org/abs/2310.19156

Python

InstructEval

[NAACL 2024 Findings] Evaluation suite for the systematic evaluation of instruction selection methods.

Jupyter Notebook

Edge-Pruning

Code and data for the paper "Finding Transformer Circuits with Edge Pruning".

Python

WhatICLLearns

[ACL 2023 Findings] What In-Context Learning “Learns” In-Context: Disentangling Task Recognition and Task Learning

Python

Cognac

Repo for paper: Controllable Text Generation with Language Constraints

Python

lwm

We develop world models that can be adapted with natural language. Intergrating these models into artificial agents allows humans to effectively control these agents through verbal communication.

Python

ELIZA-Transformer

Representing Rule-based Chatbots with Transformers

Python

semsup

Semantic Supervision: Enabling Generalization over Output Spaces

Python

benign-data-breaks-safety

Python

SRL-NLC

Safe Reinforcement Learning with Natural Language Constraints

datamux-pretraining

MUX-PLMs: Pretraining LMs with Data Multiplexing

Python

XTX

[ICLR 2022 Spotlight] Multi-Stage Episodic Control for Strategic Exploration in Text Games

Python

MultilingualAnalysis

Repository for the paper titled: "When is BERT Multilingual? Isolating Crucial Ingredients for Cross-lingual Transfer"

Python

dyck-transformer

[ACL 2021] Self-Attention Networks Can Process Bounded Hierarchical Languages

Python

blindfold-textgame

[NAACL 2021] Reading and Acting while Blindfolded: The Need for Semantics in Text Game Agents

Python

align-mlm

Python

metric-wsd

NAACL'2021: Non-Parametric Few-Shot Learning for Word Sense Disambiguation

Python

semsup-xc

SemSup-XC: Semantic Supervision for Extreme Classification

Jupyter Notebook

Heuristic-Core

[ACL 2024] The Heuristic Core: Understanding Subnetwork Generalization in Pretrained Language Models - https://arxiv.org/abs/2403.03942

Python

CopyCat

Python

NegotiationToM

Code release for Improving Dialog Systems for Negotiation with Personality Modeling.

Python

CARETS

Python

SPARTAN

SPARTAN: Sparse Hierarchical Memory for Parameter-Efficient Transformers

Python

il-scaling-in-games

Official code repo of "Scaling Laws for Imitation Learning in Single-Agent Games"

Python

attribute-tagging

[LaReL 2022] Towards an Enhanced, Faithful, and Adaptable Web Interaction Environment

Python

MoQA

Python

princeton-nlp/rationale-robustness

princeton-nlp

Reviews

Repository Details

More Repositories