nyu-mll/pretraining-learning-curves

Stars
20
Rank 1,121,974 (Top 23 %)
Language
License
MIT License
Created about 4 years ago
Updated about 4 years ago

nyu-mll/pretraining-learning-curves

nyu-mll

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

The repository for the paper "When Do You Need Billions of Words of Pretraining Data?"

jiant

jiant is an nlp toolkit

GLUE-baselines

[DEPRECATED] Repo for exploring multi-task learning approaches to learning sentence representations

multiNLI

quality

crows-pairs

This repository contains the data and code introduced in the paper "CrowS-Pairs: A Challenge Dataset for Measuring Social Biases in Masked Language Models" (EMNLP 2020).

BBQ

Repository for the Bias Benchmark for QA dataset.

DS-GA-1011-Fall2017

DS-GA-1011 Natural Language Processing with Representation Learning

Jupyter Notebook

ILF-for-code-generation

CoLA-baselines

Baselines and corpus accompanying paper Neural Network Acceptability Judgments

PRPN-Analysis

This repo contains the analysis results reported in the paper "Grammar Induction with Neural Language Models: An Unusual Replication"

SQuALITY

Query-focused summarization data

jiant-v1-legacy

The jiant toolkit for general-purpose text understanding models

Jupyter Notebook

msgs

This is a repository for the paper on testing inductive bias with scaled-down RoBERTa models.

nlu-test-sets

Analysis of NLU test sets with IRT

Jupyter Notebook

CoLA

Demo for Grammaticality Judgement (Acceptability) task

nope

Data and code for "NOPE: A Corpus of Naturally-Occurring Presuppositions in English."

semi-automatic-nli

This is a repository for data and code accompanying paper "Asking Crowdworkers to Write Entailment Examples: The Best of Bad Options" (AACL 2020)

online-code-for-edge-probing

Jupyter Notebook

wsc-formalizations

Jupyter Notebook

crowdsourcing-protocol-comparison

CNLI-generalization

GLUE-human-performance

nyu-ai-school-2023