Shift Reduce Dependency Parser for Japanese

Overview

A neural network based dependency parser (syntax tree parser) for Japanese.

TL;DR Summary:

Implemented a shift reduce parser using the arc-standard transition system.
Implemented a neural network using PyTorch to predict the next action given a parser state.
Trained neural network with UD Japanese GSD treebank and pretrained word embedding weights from Wikipedia2vec.

Example/Visualization

Example usage and dependency tree visualization in a Google Colab notebook here.

Training

Run python3 model.py to execute training loop.

Model weights are saved in PyTorch model state format to model.pth. Other relevant files such as model_lists.txt and embeddings/jawiki_gsd_word2vec.txt are also required to load the model.

The model should generally converge at approximately .96 LAS (Labelled Attachment Score), .97 UAS (Unlabelled Attachment Score). Increasing the hyperparameters (embed size, hidden size) beyond the specified defaults may marginally improve accuracy.

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
embeddings		embeddings
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
data.py		data.py
model.pth		model.pth
model.py		model.py
model_lists.txt		model_lists.txt
parse.py		parse.py
regenerate_embedding.py		regenerate_embedding.py
visualization.ipynb		visualization.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

embeddings

embeddings

.gitattributes

.gitattributes

.gitignore

.gitignore

README.md

README.md

data.py

data.py

model.pth

model.pth

model.py

model.py

model_lists.txt

model_lists.txt

parse.py

parse.py

regenerate_embedding.py

regenerate_embedding.py

visualization.ipynb

visualization.ipynb

Repository files navigation

Shift Reduce Dependency Parser for Japanese

Overview

Example/Visualization

Training

References

About

Releases

Packages

Languages

jonnyli1125/jp-srparser

Folders and files

Latest commit

History

Repository files navigation

Shift Reduce Dependency Parser for Japanese

Overview

Example/Visualization

Training

References

About

Topics

Resources

Stars

Watchers

Forks

Languages