Skip to content

Releases: wikipedia2vec/wikipedia2vec

v2.0.0

11 Jan 07:43
Compare
Choose a tag to compare
  • Enhanced the text extraction parser of Wikipedia pages
  • Enhanced the detection of category and disambiguation pages
  • Converted Cython’s *.pyx code to * .py by adopting pure Python mode
  • Added support of multi-step Wikipedia redirects
  • Fixed an issue related to mmap (#79)

v1.0.5

03 Apr 04:34
Compare
Choose a tag to compare
  • Fixed a bug in MentionDB occurred when case_sensitive=True (#67)

v1.0.4

07 Sep 13:03
Compare
Choose a tag to compare

Changelog:

  • Pages that belong to Module and TimedText namespaces are now ignored while creating DumpDB
  • Improved normalization rules of entity titles of Wikipedia links #36 #38
  • Fixed Jieba tokenizer #27