BERT-NER with doc_stride

Application of doc_stride in BERT for NER

Use Google's BERT for named entity recognition （CoNLL-2003 as the dataset）.

Disclaimer

This repo is modified version of https://github.com/kyzhouhzau/BERT-NER with addition of doc_stride in order to process large texts (sequence length > 512). Since Google's pretrained models have can only support max_seq_length of 512 tokens, we apply doc_stride, a method described for SQuAD dataset.

Folder Description:

BERT-NER
|____ bert                          # need git from [here](https://github.com/google-research/bert)
|____ cased_L-12_H-768_A-12	    # need download from [here](https://storage.googleapis.com/bert_models/2018_10_18/cased_L-12_H-768_A-12.zip)
|____ data		            # train data
|____ middle_data	            # middle data (label id map)
|____ output			    # output (final model, predict results)
|____ BERT_NER_ORIG.py		    # original code without doc_stride
|____ BERT_NER_STRIDE.py		    # main code with doc_stride
|____ conlleval.pl		    # eval code
|____ run_ner.sh    		    # run model and eval result

Usage:

bash run_ner.sh

What's in run_ner.sh:

python BERT_NER_STRIDE.py\
    --task_name="NER"  \
    --do_lower_case=False \
    --crf=False \
    --do_train=True   \
    --do_eval=True   \
    --do_predict=True \
    --data_dir=data   \
    --vocab_file=cased_L-12_H-768_A-12/vocab.txt  \
    --bert_config_file=cased_L-12_H-768_A-12/bert_config.json \
    --init_checkpoint=cased_L-12_H-768_A-12/bert_model.ckpt   \
    --max_seq_length=128   \
    --train_batch_size=32   \
    --learning_rate=2e-5   \
    --num_train_epochs=3.0   \
    --output_dir=./output/result_dir  \
    --doc_stride=128

perl conlleval.pl -d '\t' < ./output/result_dir/label_test.txt

Notice: cased model was recommened, according to this paper. CoNLL-2003 dataset and perl Script comes from here

RESULTS:(On test set)

Parameter setting:

do_lower_case=False
num_train_epochs=4.0
crf=False

accuracy:  98.15%; precision:  90.61%; recall:  88.85%; FB1:  89.72
              LOC: precision:  91.93%; recall:  91.79%; FB1:  91.86  1387
             MISC: precision:  83.83%; recall:  78.43%; FB1:  81.04  668
              ORG: precision:  87.83%; recall:  85.18%; FB1:  86.48  1191
              PER: precision:  95.19%; recall:  94.83%; FB1:  95.01  1311

Result description:

Here i just use the default paramaters, but as Google's paper says a 0.2% error is reasonable(reported 92.4%). Maybe some tricks need to be added to the above model.

reference:

[1] https://arxiv.org/abs/1810.04805

[2] https://github.com/google-research/bert

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bert

bert

cased_L-12_H-768_A-12

cased_L-12_H-768_A-12

data

data

output/result_dir

output/result_dir

BERT_NER_ORIG.py

BERT_NER_ORIG.py

BERT_NER_STRIDE.py

BERT_NER_STRIDE.py

README.md

README.md

conlleval.pl

conlleval.pl

function_test.py

function_test.py

metrics.py

metrics.py

run_ner.sh

run_ner.sh

Repository files navigation

BERT-NER with doc_stride

Disclaimer

Folder Description:

Usage:

What's in run_ner.sh:

RESULTS:(On test set)

Parameter setting:

Result description:

reference:

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
bert		bert
cased_L-12_H-768_A-12		cased_L-12_H-768_A-12
data		data
output/result_dir		output/result_dir
BERT_NER_ORIG.py		BERT_NER_ORIG.py
BERT_NER_STRIDE.py		BERT_NER_STRIDE.py
README.md		README.md
conlleval.pl		conlleval.pl
function_test.py		function_test.py
metrics.py		metrics.py
run_ner.sh		run_ner.sh

anupamsingh610/bert_ner_stride

Folders and files

Latest commit

History

Repository files navigation

BERT-NER with doc_stride

Disclaimer

Folder Description:

Usage:

What's in run_ner.sh:

RESULTS:(On test set)

Parameter setting:

Result description:

reference:

About

Topics

Resources

Stars

Watchers

Forks

Languages