• Stars
    star
    252
  • Rank 161,312 (Top 4 %)
  • Language
    HTML
  • Created over 7 years ago
  • Updated about 2 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Cornell NLVR and NLVR2 are natural language grounding datasets. Each example shows a visual input and a sentence describing it, and is annotated with the truth-value of the sentence.

Natural Language for Visual Reasoning

This repository contains data for NLVR (Suhr et al. 2017) and NLVR2 (Suhr and Zhou et al. 2018).

The Natural Language for Visual Reasoning corpora use the task of determining whether a sentence is true about a visual input, like an image. This task focuses on reasoning about sets of objects, comparisons, and spatial relations. This includes two datasets: NLVR, with synthetically generated images, and NLVR2, which includes natural photographs.

See the webpage for examples and the leaderboards here: http://lic.nlp.cornell.edu/nlvr/

If you have questions, please use the Issues page, or email us directly: [email protected]

Licensing

NLVR (original dataset with synthetically generated images; Suhr et al. 2017)

Following Microsoft COCO (http://cocodataset.org/#termsofuse), we have licensed the NLVR dataset (synthetically-generated images, structured representations, and annotations) under CC-BY-4.0 (https://creativecommons.org/licenses/by/4.0/).

NLVR2 (dataset with real images, Suhr and Zhou et al. 2018)

We have licensed the annotations of the NLVR2 images (sentences and binary labels) under CC-BY-4.0 (https://creativecommons.org/licenses/by/4.0/). We do not license the NLVR2 images as we do not hold the copyright to them.

More Repositories

1

newsroom

Tools for downloading and analyzing summaries and evaluating summarization systems. https://summari.es/
Perl
146
star
2

spf

Cornell Semantic Parsing Framework
Java
128
star
3

touchdown

Cornell Touchdown natural language navigation and spatial reasoning dataset.
Python
92
star
4

kilogram

The KiloGram Tangrams dataset
Jupyter Notebook
51
star
5

atis

Python
46
star
6

chalet

Cornell House Agent Learning Environment
HTML
46
star
7

blocks

Blocks World -- Simulator, Code, and Models (Misra et al. EMNLP 2017)
Python
40
star
8

ciff

Cornell Instruction Following Framework
Python
33
star
9

drif

Dynamic Robot Instruction Following
Python
31
star
10

cerealbar

Cereal Bar is a two-player web game designed for studying language understanding agents in collaborative interactions. This repository contains code for the game, a webapp hosting the game, the agent implementation, and recorded interactions in the game. http://lil.nlp.cornell.edu/cerealbar/
Python
28
star
11

amr

Cornell AMR Semantic Parser (Artzi et al., EMNLP 2015)
Java
23
star
12

bandit-qa

Code for Simulating Bandit Learning from User Feedback for Extractive Question Answering.
Python
18
star
13

nccg

Neural Shift Reduce Parser for CCG Semantic Parsing (Misra and Artzi, EMNLP 2016)
Java
17
star
14

lm-class

Materials for a language modeling class, broadly construed
NewLisp
16
star
15

cb2

An NLP research and data collection platform.
Python
14
star
16

vgnsl_analysis

"What is Learned in Visually Grounded Neural Syntax Acquisition", Noriyuki Kojima, Hadar Averbuch-Elor, Alexander Rush and Yoav Artzi (ACL 2020)
Python
12
star
17

navi

Code for Weakly Supervised Learning of Semantic Parsers for Mapping Instructions to Actions (Artzi and Zettlemoyer, TACL 2013)
Java
11
star
18

navigation-corpus

Navigation data used for Chen and Mooney 2011 and Artzi and Zettlemoyer 2013 (including cleaned up oracle data)
Python
9
star
19

qa-from-hf

Python
9
star
20

dynet_tutorials

Contains various short notebooks showing how to use DyNet. Created for CS 5740 at Cornell University.
Jupyter Notebook
8
star
21

scone

Python
7
star
22

lilgym

lilGym RL benchmark
Python
7
star
23

recnet

A human-driven recommendation system for academic readings.
TypeScript
4
star
24

lilgym-baselines

Python
2
star
25

gsmn

Code for RSS2018 paper on the Grounded Semantic Mapping Network
2
star
26

cerealbar_generation

Python
1
star
27

geoquery-corpus

The GeoQuery corpus
1
star
28

phrase_grounding

Python
1
star
29

kilogram-annotation-task

Task website for collecting tangram annotations from MTurk.
JavaScript
1
star
30

remote-teaching-setup

Remote teaching and talk recording setup
1
star