Knowing False Negatives: An Adversarial Training Method for Distantly Supervised Relation Extraction

Abstract

Distantly supervised relation extraction (RE) automatically aligns unstructured text with relation instances in a knowledge base (KB). Due to the incompleteness of current KBs, sentences implying certain relations may be annotated as N/A instances, which causes the so called false negative (FN) problem. Current RE methods usually overlook this problem, inducing improper biases in both training and testing procedures. To address this issue, we propose a two-stage approach. First, it finds out possible FN samples by heuristically leveraging the memory mechanism of deep neural networks. Then, it aligns those unlabeled data with the training data into a unified feature space by adversarial training to assign pseudo labels and further utilize the information contained in them. Experiments on two wildly used benchmark datasets demonstrate the effectiveness of our approach.

File Structure

FAN/
├─ bert/
├─ data/
    ├── gids/
    ├── nyt10/
├─ loader/
├─ mining/: code for Stage I: Mining
├─ model/: neural networks
├─ output/
├─ align.py
├─ align.sh
├─ utils.py

Running

Stage I: Mining

$ cd mining
$ python run.py
$ python infer.py

Stage II: Aligning

python align.py --output_dir ./output --device cuda:0 --pos_file ./output/mining/nyt10/pos.txt
--unlabeled_file ./output/mining/nyt10/unlabeled.txt --rel2id ./data/nyt10/rel2id.json 
--epoch 10 --batch_size 160 --lr 1e-5 --pretrain ./bert/bert-base-uncased --optimizer sgd 
--contrastive --alpha 0.01 --beta 0.01 --gamma 0.0001 --tau 1.0 --weighting --max_bag_size -1

Citation

@inproceedings{FAN,
 author = {Kailong Hao and Botao Yu and Wei Hu},
 title = {Knowing False Negatives: An Adversarial Training Method for Distantly Supervised Relation Extraction},
 booktitle = {EMNLP},
 year = {2021},
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bert

bert

data

data

loader

loader

mining

mining

model

model

output

output

LICENSE

LICENSE

README.md

README.md

align.py

align.py

align.sh

align.sh

Repository files navigation

Knowing False Negatives: An Adversarial Training Method for Distantly Supervised Relation Extraction

Abstract

File Structure

Running

Stage I: Mining

Stage II: Aligning

Citation

About

Releases

Packages

Contributors 2

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
bert		bert
data		data
loader		loader
mining		mining
model		model
output		output
LICENSE		LICENSE
README.md		README.md
align.py		align.py
align.sh		align.sh

License

nju-websoft/FAN

Folders and files

Latest commit

History

Repository files navigation

Knowing False Negatives: An Adversarial Training Method for Distantly Supervised Relation Extraction

Abstract

File Structure

Running

Stage I: Mining

Stage II: Aligning

Citation

About

Resources

License

Stars

Watchers

Forks

Languages