JFLEG (JHU FLuency-Extended GUG) corpus
Last updated: December 7th, 2018
(Make sure to download and use the latest version.)
Data
.
βββ EACL_exp # experiments in the EACL paper
βΒ Β βββ m2converter # script to create m2 format from plain texts
βΒ Β βββ mturk # mechanical turk experiments
βΒ Β βΒ Β βββ sample.csv
βΒ Β βΒ Β βββ pairwise.csv
βΒ Β βΒ Β βββ template.html
βΒ Β βββ manual_eval # manual analysis of 100 sentences
βΒ Β Β Β βββ README.md
βΒ Β Β Β βββ coded_sentences.csv
βββ README.md # This file
βββ EACLshort037.pdf
βββ dev # dev set (754 sentences originally from the GUG **test** set)
βΒ Β βββ dev.ref0
βΒ Β βββ dev.ref1
βΒ Β βββ dev.ref2
βΒ Β βββ dev.ref3
βΒ Β βββ dev.spellchecked.src (spellchecked by enchant)
βΒ Β βββ dev.src # source (This should be the input for your system.)
βββ eval
βΒ Β βββ gleu.py # evaluation script (sentence-level GLEU score)
βββ test # test set (747 sentenses ogirinally from the GUG **dev** set)
Β Β βββ test.ref0
Β Β βββ test.ref1
Β Β βββ test.ref2
Β Β βββ test.ref3
Β Β βββ test.spellchecked.src (spellchecked by enchant)
Β Β βββ test.src # source (This should be the input for your system.)
Evaluation
e.g. python ./eval/gleu.py -r ./dev/dev.ref[0-3] -s ./dev/dev.src --hyp YOUR_SYSTEM_OUTPUT
This returns the mean, standard deviation, and confidence interval.
Leader Board (published results)
N.B. Sytems with asterisk (*) are tuned on different data.
System | GLEU (dev) | GLEU (test) |
---|---|---|
Coyne et al. (2023) | 60.10* | 65.02 |
Ge et al. (2018) | N/A | 62.42 |
Liu et al. (2021) | N/A | 61.61 |
Grundkiewicz and Junczys-Dowmunt (2018) | N/A | 61.50 |
Junczys-Dowmunt et al. (2018) | N/A | 59.90 |
Chollampatt and Ng (2018) | 52.48 | 57.47 |
Chollampatt and Ng (2017) | 51.01 | 56.78 |
Xie et al. (2018)* | N/A | 56.20 |
Sakaguchi et al. (2017) | 49.82 | 53.98 |
Ji et al. (2017)* | 48.93 | 53.41 |
Yuan and Briscoe (2016)* | 47.20 | 52.05 |
Junczys-Dowmunt and Grundkiewicz (2016) | 49.74 | 51.46 |
Chollampatt et al. (2016)* | 46.27 | 50.13 |
Felice et al. (2014)* | 42.81 | 46.04 |
=================================== | ========== | ========== |
SOURCE | 38.21 | 40.54 |
REFERENCE | 55.26 | 62.37 |
- If you want to add your score, please send an e-mail to keisukes[at]allenai.org a link to your paper and system outputs.
- The reference scores are computed by averaging each reference.
Reference
The following paper should be cited in any publications that use this dataset:
Courtney Napoles, Keisuke Sakaguchi and Joel Tetreault. (EACL 2017): JFLEG: A Fluency Corpus and Benchmark for Grammatical Error Correction. In Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics. Valencia, Spain. April 03-07, 2017.
Michael Heilman, Aoife Cahill, Nitin Madnani, Melissa Lopez, Matthew Mulholland, and Joel Tetreault. (ACL 2014): Predicting Grammaticality on an Ordinal Scale. In Proceedings of the Association for Computational Linguistics. Baltimore, MD, USA. June 23-25, 2014.
bibtex information:
@InProceedings{napoles-sakaguchi-tetreault:2017:EACLshort,
author = {Napoles, Courtney and Sakaguchi, Keisuke and Tetreault, Joel},
title = {JFLEG: A Fluency Corpus and Benchmark for Grammatical Error Correction},
booktitle = {Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 2, Short Papers},
month = {April},
year = {2017},
address = {Valencia, Spain},
publisher = {Association for Computational Linguistics},
pages = {229--234},
url = {http://www.aclweb.org/anthology/E17-2037}
}
@InProceedings{heilman-EtAl:2014:P14-2,
author = {Heilman, Michael and Cahill, Aoife and Madnani, Nitin and Lopez, Melissa and Mulholland, Matthew and Tetreault, Joel},
title = {Predicting Grammaticality on an Ordinal Scale},
booktitle = {Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)},
month = {June},
year = {2014},
address = {Baltimore, Maryland},
publisher = {Association for Computational Linguistics},
pages = {174--180},
url = {http://www.aclweb.org/anthology/P14-2029}
}
Questions
- Please e-mail Courtney Napoles (napoles[at]cs.jhu.edu) and Keisuke Sakaguchi (keisuke[at]cs.jhu.edu).
License
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.