Skip to content

ArbelTepper/NLP-IAHLT_project

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 

Repository files navigation

NLP-IAHLT_project

Using the IAHLT CONLLU tagged Hebrew sentences corpus, I created a Python program that reads the file using the pyconll library. Through a costum object I made - "token_dictionary", I was able to create a dataframe which holds a frequency dictionary of all the words in the corpus which have the same "POS" and "relation" properties.

The token_dictionary has various methods that allow for interesting data extractions and visualizations.

Further explanation can be found in the notebook.

About

Exploring and visualizing CONULLU files in Python

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published