Kyubyong/bert_ner

Stars
281
Rank 147,023 (Top 3 %)
Language
Python
Created over 5 years ago
Updated about 5 years ago

Kyubyong/bert_ner

Kyubyong

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Ner with Bert

PyTorch Implementation of NER with pretrained Bert

I know that you know BERT. In the great paper, the authors claim that the pretrained models do great in NER. It's even impressive, allowing for the fact that they don't use any prediction-conditioned algorithms like CRFs. We try to reproduce the result in a simple manner.

Requirements

python>=3.6 (Let's move on to python 3 if you still use python 2)
pytorch==1.0
pytorch_pretrained_bert==0.6.1
numpy>=1.15.4

Training & Evaluating

STEP 1. Run the command below to download conll 2003 NER dataset.

bash download.sh

It should be extracted to conll2003/ folder automatically.

STEP 2a. Run the command if you want to do the feature-based approach.

python train.py --logdir checkpoints/feature --batch_size 128 --top_rnns --lr 1e-4 --n_epochs 30

STEP 2b. Run the command if you want to do the fine-tuning approach.

python train.py --logdir checkpoints/finetuning --finetuning --batch_size 32 --lr 5e-5 --n_epochs 3

Results in the paper

Feature-based approach

Fine-tuning

Results

F1 scores on conll2003 valid dataset are reported.
You can check the classification outputs in checkpoints.

epoch	feature-based	fine-tuning
1	0.2	0.95
2	0.75	0.95
3	0.84	0.96
4	0.88
5	0.89
6	0.90
7	0.90
8	0.91
9	0.91
10	0.92
11	0.92
12	0.93
13	0.93
14	0.93
15	0.93
16	0.92
17	0.93
18	0.93
19	0.93
20	0.93
21	0.94
22	0.94
23	0.93
24	0.93
25	0.93
26	0.93
27	0.93
28	0.93
29	0.94
30	0.93

transformer

A TensorFlow Implementation of the Transformer: Attention Is All You Need

nlp_tasks

Natural Language Processing Tasks and References

wordvectors

Pre-trained word vectors of 30+ languages

tacotron

A TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model

numpy_exercises

Numpy exercises.

dc_tts

A TensorFlow Implementation of DC-TTS: yet another text-to-speech model

sudoku

Can Neural Networks Crack Sudoku?

g2p

g2p: English Grapheme To Phoneme Conversion

tensorflow-exercises

TensorFlow Exercises - focusing on the comparison with NumPy.

css10

CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages

deepvoice3

Tensorflow Implementation of Deep Voice 3

neural_chinese_transliterator

Can CNNs transliterate Pinyin into Chinese characters correctly?

pytorch_exercises

Jupyter Notebook

nlp_made_easy

Explains nlp building blocks in a simple manner.

Jupyter Notebook

word_prediction

Word Prediction using Convolutional Neural Networks

g2pC

g2pC: A Context-aware Grapheme-to-Phoneme Conversion module for Chinese

g2pK

g2pK: g2p module for Korean

expressive_tacotron

Tensorflow Implementation of Expressive Tacotron

speaker_adapted_tts

Making a TTS model with 1 minute of speech samples within 10 minutes

neural_japanese_transliterator

Can neural networks transliterate Romaji into Japanese correctly?

tacotron_asr

Speech Recognition Using Tacotron

quasi-rnn

Character-level Neural Translation using Quasi-RNNs

label_smoothing

Corrupted labels and label smoothing

Jupyter Notebook

name2nat

name2nat: a Python package for nationality prediction from a name

bert-token-embeddings

Jupyter Notebook

cross_vc

Cross-lingual Voice Conversion

mtp

Multi-lingual Text Processing

pron_dictionaries

pronunciation dictionaries for multiple languages

msg_reply

a simple message reply suggestion system

word_ordering

Can neural networks order a scramble of words correctly?

kss

neural_tokenizer

Tokenize English sentences using neural networks.

bytenet_translation

A TensorFlow Implementation of Machine Translation In Neural Machine Translation in Linear Time

KoParadigm

KoParadigm: Korean Inflectional Paradigm Generator

specAugment

Tensor2tensor experiment with SpecAugment

vq-vae

A Tensorflow Implementation of VQ-VAE Speaker Conversion

lm_finetuning

Language Model Fine-tuning for Moby Dick

texture_generation

An Implementation of 'Texture Synthesis Using Convolutional Neural Networks' with Kylberg Texture Dataset

cjk_trans

Pre-trained Machine Translation Models of Korean from/to ECJ

h2h_converter

Convert Sino-Korean words written in Hangul to Chinese characters, which is called hanja in Korean, using neural networks

integer_sequence_learning

RNN Approaches to Integer Sequence Learning--the famous Kaggle competition

up_and_running_with_Tensorflow

A simple tutorial of TensorFlow + TensorFlow / NumPy exercises

Jupyter Notebook

neurobind

Yet Another Model Using Neural Networks for Predicting Binding Preferences of for Test DNA Sequences

kollocate

Collocation Search of Korean

kyubyong

WhereAmI

Where Am I? - If you want to meet me.

spam_detection

Spam Dectection Under Semi-supervised settings

helo_word

A Neural Grammatical Error Correction System Built On Better Pre-training and Sequential Transfer Learning