VnCoreNLP đã có Python wrapper trên repo chính thức.

VnCoreNLP: https://github.com/vncorenlp/VnCoreNLP

Setup

$ pip install py4j

Copy VnCoreNLP.jar, vncorenlp.py and models to your project in the same directory

Example

from vncorenlp import VnCoreNLP

txt = 'học sinh học sinh học'

# Init & load model
vncore_nlp = VnCoreNLP(annotators="wseg pos ner parse")

# Use tokenize only
print(vncore_nlp.tokenize(txt, str=True))
print()
print(vncore_nlp.tokenize(txt, str=False))
print()
print(vncore_nlp.extract(txt))

Output:

học_sinh học_sinh học

['học_sinh', 'học_sinh', 'học']

[
    ['học_sinh', 'N', 'O', '3', 'sub'], 
    ['học_sinh', 'N', 'O', '1', 'nmod'], 
    ['học', 'V', 'O', '0', 'root']
]

Update new VnCoreNLP version

Clone or Download VnCoreNLP

$ git clone https://github.com/vncorenlp/VnCoreNLP

Build VnCoreNLP.jar from VnCoreNLP project

Copy Tokenizer.java to VnCoreNLP project

$ cp Tokenizer.java /path/VnCoreNLP/src/main/java/vn/

Build jar for Tokenizer.java main class

Copy ./models dir and new .jar file to this repository

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
models		models
README.md		README.md
Tokenizer.java		Tokenizer.java
VnCoreNLP.jar		VnCoreNLP.jar
example.py		example.py
vncorenlp.py		vncorenlp.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

models

models

README.md

README.md

Tokenizer.java

Tokenizer.java

VnCoreNLP.jar

VnCoreNLP.jar

example.py

example.py

vncorenlp.py

vncorenlp.py

Repository files navigation

VnCoreNLP đã có Python wrapper trên repo chính thức.

Setup

Example

Update new VnCoreNLP version

About

Releases

Packages

Languages

behitek/vncorenlp-wrapper

Folders and files

Latest commit

History

Repository files navigation

VnCoreNLP đã có Python wrapper trên repo chính thức.

Setup

Example

Update new VnCoreNLP version

About

Topics

Resources

Stars

Watchers

Forks

Languages