Releases: dbpedia-spotlight/dbpedia-spotlight
Scala Upgrade and TokenType Id fix
This release introduces the following:
- Upgrade Scala to version 2.10 and some deps clean up.
- Fixes a bug related to duplicated TokenType Ids thanks to @Lugrin
This release prepares the groundwork for the new Vector Models developed at GSOC 2015
0.7 Bug Fixes
This release mainly fixes some bugs in the current spotlight version 0.7. It also is the last release that targets Scala version 2.9.x.
DBpedia Spotlight 0.7
Main improvements of this version
-
smaller and much faster models through quantization of counts, optimization of search and some pruning (see memory usage here)
-
better handling of case
-
various fixes in Spotlight and PigNLProc
-
models can now be created without requiring a Hadoop and Pig installation:
git clone https://github.com/dbpedia-spotlight/model-quickstarter cd model-quickstarter ./index_db.sh -l wdir nl_NL nl/stopwords.list Dutch models/nl
-
UIMA support
-
support for confidence value
Models
This version breaks model compatibility with the previous version, so new models are available here.
Raw model data
In addition to those, we also re-ran the count collection for most languages with DBpedia 3.9 and are making those raw counts available here.
See also
- Spotlight Model Editor by Idio