nlpcl-lab/bert-event-extraction

Stars
333
Rank 125,816 (Top 3 %)
Language
Python
License
MIT License
Created about 5 years ago
Updated over 4 years ago

nlpcl-lab/bert-event-extraction

nlpcl-lab

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Pytorch Solution of Event Extraction Task using BERT on ACE 2005 corpus

bert-event-extraction

Pytorch Solution of Event Extraction Task using BERT on ACE 2005 corpus

Prerequisites

Prepare ACE 2005 dataset.
Use nlpcl-lab/ace2005-preprocessing to preprocess ACE 2005 dataset in the same format as the data/sample.json. Then place it in the data directory as follows:
```
├── data
│     └── test.json
│     └── dev.json
│     └── train.json
│...
```

Install the packages.

pip install pytorch==1.0 pytorch_pretrained_bert==0.6.1 numpy

Usage

Train

python train.py

Evaluation

python eval.py --model_path=latest_model.pt

Result

Performance

Method	Trigger Classification (%)			Argument Classification (%)
Method	Precision	Recall	F1	Precision	Recall	F1
JRNN	66.0	73.0	69.3	54.2	56.7	55.5
JMEE	76.3	71.3	73.7	66.8	54.9	60.3
This model (BERT base)	63.4	71.1	67.7	48.5	34.1	40.0

The performance of this model is low in argument classification even though pretrained BERT model was used. The model is currently being updated to improve the performance.

Reference

Jointly Multiple Events Extraction via Attention-based Graph Information Aggregation (EMNLP 2018), Liu et al. [paper]
lx865712528's EMNLP2018-JMEE repository [github]
Kyubyong's bert_ner repository [github]

ace2005-preprocessing

ACE 2005 corpus preprocessing for Event Extraction task

event-extraction

Tensorflow Implementation of Dynamic Multi-Pooling Convolutional Neural Networks for Event Extraction

CADD_dataset

CADD: A Large-scale Comprehensive Abusiveness Detection Dataset with Multifaceted Labels from Reddit

Auto_Labeling

M1_2022

M3-credibility-predictor-ver5.1

M2_2022

dialog-eval-hard-negative

Code for "Generating Negative Samples by Manipulating Golden Responsesfor Unsupervised Learning of a Response Evaluation Model (NAACL-HLT 2021)"

Zero_shot_Reader

M3-credibility-predictor-ver6.1

UDEG

ultra-fast-writing

sentential_argument_generation

Code for the NLP4IF 2019 paper " ArgDiver: Generating Sentential Arguments from Diverse Perspectives on Controversial Topic"

ted-talks-annotation

Annotation of Tension Development in TED talks

mpqa2.0-preprocessing

NaverNewsCrawler

Homeworks-CS372

Example codes for cs372 homeworks (contributed by students)

starlab-website

unsupervised_aspect_extraction

A Tensorflow reimplementation of ACL 2017, "An unsupervised neural attention model for aspect extraction".

two-step-reason

Annotation quality control via two-step reason selection (EMNLP-IJCNLP 2019)

MPQA-Stance-Extraction

sentence-scoring

Simple code for 1) sentence fluency scoring and 2) measuring sentence-pair similarity

starlab-api

Home of all CredOn APIs