Skip to content

fooger/elasticsearch-analysis-morphology

 
 

Repository files navigation

Morphological Analysis Plugin for ElasticSearch

The Morphological Analysis plugin integrates Russian and English morphology for java and lucene framework into elasticsearch. This plugin adds two new analyzers: “russian_morphology” and “english_morphology” and two token filters with the same names.

The demo.sh file shows a few examples of the analyzers behavior.

Switching to Hunspell

For Elasticsearch version 6.0 and above there is an officially supported hunspell token filter with russian dictionaries. But in my opinion it behaves much poorer than this plugin because of limited dictionary and no predefined behavior on unknown words.

Building
For building use latest gradle 6.2+ (https://gradle.org/install/#manually) and run

gradle build
.
Java SDK 13+ required.

Compatibility

Plugin is avaliable only for Elasticsearch 7.6.0.

Morphological Analysis Plugin Elasticsearch URL
7.6.0 7.6.0 build yourself: gradlew build

Installation

In order to install the plugin, simply run the following command in the elasticsearch home directory:


bin/elasticsearch-plugin install file:///C:\PROJECTS\elasticsearch-analysis-morphology\analysis-morphology-7.6.0.zip

where zip_file_path is full path to zip archive with builded plugin.
Also, if you not know how build the plugin, the compiled version is added (analysis-morphology-7.6.0.zip)

P.S. I am not java developer, and I do not know about any errors. I am not tested yet the plugin, only install.

About

Morphological Analysis Plugin for ElasticSearch

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages

  • Java 62.1%
  • Shell 37.9%