Skip to content

evaldask/lltk

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

33 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Build Status Coverage Status Maintainability

LLTK

Lithuanian Language ToolKit

This is Python3 library for Lithuanian language analysis. Currently these functions are supported:

  • word stemming,
  • paragraph splitting into sentences,
  • text summarization based on smmry.

For summarization example please checkout example jupyter notebook.

Stemming and splitting is based on TokenMill work. So big thumbs up for these guys for opensourcing their tools.

Installing for development

  • Fork the project
  • Clone your forked project:
$ git clone git@github.com:evalkaz/lltk.git
  • Install development dependencies:
$ pip install -r requirements-dev.txt

Running tests

After cloned and installed development dependencies run tests just typing the following command:

$ pytest

Any contribution is welcome!