• Stars
    star
    333
  • Rank 125,816 (Top 3 %)
  • Language
    Python
  • License
    MIT License
  • Created about 5 years ago
  • Updated over 4 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Pytorch Solution of Event Extraction Task using BERT on ACE 2005 corpus

bert-event-extraction

Pytorch Solution of Event Extraction Task using BERT on ACE 2005 corpus

Prerequisites

  1. Prepare ACE 2005 dataset.

  2. Use nlpcl-lab/ace2005-preprocessing to preprocess ACE 2005 dataset in the same format as the data/sample.json. Then place it in the data directory as follows:

    β”œβ”€β”€ data
    β”‚     └── test.json
    β”‚     └── dev.json
    β”‚     └── train.json
    β”‚...
    
  3. Install the packages.

    pip install pytorch==1.0 pytorch_pretrained_bert==0.6.1 numpy
    

Usage

Train

python train.py

Evaluation

python eval.py --model_path=latest_model.pt

Result

Performance

Method Trigger Classification (%) Argument Classification (%)
Precision Recall F1 Precision Recall F1
JRNN 66.0 73.0 69.3 54.2 56.7 55.5
JMEE 76.3 71.3 73.7 66.8 54.9 60.3
This model (BERT base) 63.4 71.1 67.7 48.5 34.1 40.0

The performance of this model is low in argument classification even though pretrained BERT model was used. The model is currently being updated to improve the performance.

Reference

  • Jointly Multiple Events Extraction via Attention-based Graph Information Aggregation (EMNLP 2018), Liu et al. [paper]
  • lx865712528's EMNLP2018-JMEE repository [github]
  • Kyubyong's bert_ner repository [github]

More Repositories

1

ace2005-preprocessing

ACE 2005 corpus preprocessing for Event Extraction task
Python
287
star
2

event-extraction

Tensorflow Implementation of Dynamic Multi-Pooling Convolutional Neural Networks for Event Extraction
Python
27
star
3

CADD_dataset

CADD: A Large-scale Comprehensive Abusiveness Detection Dataset with Multifaceted Labels from Reddit
10
star
4

Auto_Labeling

Python
7
star
5

M1_2022

Python
7
star
6

M3-credibility-predictor-ver5.1

Python
7
star
7

M2_2022

Python
7
star
8

dialog-eval-hard-negative

Code for "Generating Negative Samples by Manipulating Golden Responsesfor Unsupervised Learning of a Response Evaluation Model (NAACL-HLT 2021)"
Python
6
star
9

Zero_shot_Reader

Python
5
star
10

M3-credibility-predictor-ver6.1

Python
4
star
11

UDEG

Python
2
star
12

ultra-fast-writing

BONOBONO
Python
2
star
13

sentential_argument_generation

Code for the NLP4IF 2019 paper " ArgDiver: Generating Sentential Arguments from Diverse Perspectives on Controversial Topic"
Python
2
star
14

ted-talks-annotation

Annotation of Tension Development in TED talks
HTML
2
star
15

mpqa2.0-preprocessing

Python
2
star
16

NaverNewsCrawler

Python
1
star
17

Homeworks-CS372

Example codes for cs372 homeworks (contributed by students)
Python
1
star
18

starlab-website

JavaScript
1
star
19

unsupervised_aspect_extraction

A Tensorflow reimplementation of ACL 2017, "An unsupervised neural attention model for aspect extraction".
Python
1
star
20

two-step-reason

Annotation quality control via two-step reason selection (EMNLP-IJCNLP 2019)
JavaScript
1
star
21

MPQA-Stance-Extraction

Python
1
star
22

sentence-scoring

Simple code for 1) sentence fluency scoring and 2) measuring sentence-pair similarity
Python
1
star
23

starlab-api

Home of all CredOn APIs
JavaScript
1
star