Skip to content

Releases: neologd/mecab-ipadic-neologd

2020-08-20

25 Aug 11:28
Compare
Choose a tag to compare

CAUTION
If you are the beginner of NLP, I don't recommend that you use this tag. I recommend that you use the latest version of master branch. I don't accept your request or complaint for this tag. (^O^).

We created the seed file of a neologism dictionary of a POS tagger on 2020-08-20.

The seed file in this tag (v0.0.7) will not update forever.

Therefore, this tag is very useful for the following applications.

  • Experiments for evaluation of the research results
  • Reproducibility of the experimental results of others
  • Creation of the processing results of morphological analysis that doesn't update forever

We created the seed file using following resources.

  • Dump data of hatena keyword
  • Japanese postal code number data download (ken_all.lzh)
  • The name-of-the-station list of whole country of Japan
  • The entry data of person names (last name / first name)
  • The entry data of emojis from Unicode 10.0 and Emoji 5.0
  • The entry data of Kaomoji strings
  • The entry data of adverbs
  • The entry data of adjectives
  • The entry data of adjective verbs
  • The entry data of interjections
  • The entry data of orthographic variant of general nouns
  • A lot of documents, which crawled from Web

2018-08-28

28 Aug 09:08
Compare
Choose a tag to compare

CAUTION
If you are the beginner of NLP, I don't recommend that you use this tag. I recommend that you use the latest version of master branch. I don't accept your request or complaint for this tag. (^O^).

We created the seed file of a neologism dictionary of a POS tagger on 2018-08-28.

The seed file in this tag (v0.0.6) will not update forever.

Therefore, this tag is very useful for the following applications.

  • Experiments for evaluation of the research results
  • Reproducibility of the experimental results of others
  • Creation of the processing results of morphological analysis that doesn't update forever

We created the seed file using following resources.

  • Dump data of hatena keyword
  • Japanese postal code number data download (ken_all.lzh)
  • The name-of-the-station list of whole country of Japan
  • The entry data of person names (last name / first name)
  • The entry data of Unicode emojis (version 8.0)
  • The entry data of Kaomoji strings
  • The entry data of adverbs
  • The entry data of adjectives
  • The entry data of adjective verbs
  • The entry data of interjections
  • The entry data of orthographic variant of general nouns
  • A lot of documents, which crawled from Web

2016-05-02

03 May 06:45
Compare
Choose a tag to compare

CAUTION
If you are the beginner of NLP, I don't recommend that you use this tag. I recommend that you use the latest version of master branch. I don't accept your request or complaint for this tag. (^O^).

We created the seed file of a neologism dictionary of a POS tagger on 2016-05-02.

The seed file in this tag (v0.0.5) will not update forever.

Therefore, this tag is very useful for the following applications.

  • Experiments for evaluation of the research results
  • Reproducibility of the experimental results of others
  • Creation of the processing results of morphological analysis that doesn't update forever

We created the seed file using following resources.

  • Dump data of hatena keyword
  • Japanese postal code number data download (ken_all.lzh)
  • The name-of-the-station list of whole country of Japan
  • The entry data of person names (last name / first name)
  • The entry data of Unicode emojis (version 8.0)
  • The entry data of Kaomoji strings
  • The entry data of adverbs
  • The entry data of adjectives
  • The entry data of adjective verbs
  • The entry data of interjections
  • The entry data of orthographic variant of general nouns
  • A lot of documents, which crawled from Web

2015-12-10

10 Dec 10:43
Compare
Choose a tag to compare

CAUTION
If you are the beginner of NLP, I don't recommend that you use this tag. I recommend that you use the latest version of master branch. I don't accept your request or complaint for this tag. (^O^).

We created the seed file of a neologism dictionary of a POS tagger on 2015-12-10.

The seed file in this tag (v0.0.4) will not update forever.

Therefore, this tag is very useful for the following applications.

  • Experiments for evaluation of the research results
  • Reproducibility of the experimental results of others
  • Creation of the processing results of morphological analysis that doesn't update forever

We created the seed file using following resources.

  • Dump data of hatena keyword
  • Japanese postal code number data download (ken_all.lzh)
  • The name-of-the-station list of whole country of Japan
  • The entry data of person names (last name / first name)
  • The entry data of Unicode emojis (version 8.0)
  • The entry data of Kaomoji strings
  • The entry data of adverbs
  • The entry data of adjectives
  • The entry data of adjective verbs
  • The entry data of interjections
  • The entry data of orthographic variant of general nouns
  • A lot of documents, which crawled from Web

2015-11-26

27 Nov 02:54
Compare
Choose a tag to compare

CAUTION
If you are the beginner of NLP, I don't recommend that you use this tag. I recommend that you use the latest version of master branch. I don't accept your request or complaint for this tag. (^O^).

We created the seed file of a neologism dictionary of a POS tagger on 2015-11-26.

The seed file in this tag (v0.0.3) will not update forever.

Therefore, this tag is very useful for the following applications.

  • Experiments for evaluation of the research results
  • Reproducibility of the experimental results of others
  • Creation of the processing results of morphological analysis that doesn't update forever

We created the seed file using following resources.

  • Dump data of hatena keyword
  • Japanese postal code number data download (ken_all.lzh)
  • The name-of-the-station list of whole country of Japan
  • The entry data of person names (last name / first name)
  • The entry data of Unicode emojis (version 8.0)
  • The entry data of Kaomoji strings
  • The entry data of adverbs
  • The entry data of adjectives
  • The entry data of interjections
  • The entry data of orthographic variant of general nouns
  • A lot of documents, which crawled from Web

2015-06-23

22 Jun 19:36
Compare
Choose a tag to compare

CAUTION
If you are the beginner of NLP, I don't recommend that you use this tag. I recommend that you use the latest version of master branch. I don't accept your request or complaint for this tag. (^O^).

We created the seed file of a neologism dictionary of a POS tagger on 2015-06-23.

The seed file in this tag (v0.0.2) will not update forever.

Therefore, this tag is very useful for the following applications.

  • Experiments for evaluation of the research results
  • Reproducibility of the experimental results of others
  • Creation of the processing results of morphological analysis that doesn't update forever

We created the seed file using following resources.

  • Dump data of hatena keyword
  • Japanese postal code number data download (ken_all.lzh)
  • The name-of-the-station list of whole country of Japan
  • The entry data of the person name (last name / first name)
  • The entry data of Unicode emoji
  • The entry data of the adverbs
  • A lot of documents, which crawled from Web

2015-03-24

23 Mar 16:14
Compare
Choose a tag to compare

CAUTION
If you are the beginner of NLP, I don't recommend that you use this tag. I recommend that you use the latest version of master branch. I don't accept your request or complaint for this tag. (^O^).

We created the seed file of a neologism dictionary of a POS tagger on 2015-03-24.

The seed file in this tag (v0.0.1) will not update forever.

Therefore, this tag is very useful for the following applications.

  • Experiments for evaluation of the research results
  • Reproducibility of the experimental results of others
  • Creation of the processing results of morphological analysis that doesn't update forever

We created the seed file using following resources.

  • Dump data of hatena keyword
  • Japanese postal code number data download (ken_all.lzh)
  • The name-of-the-station list of whole country of Japan
  • The entry data of the person name (last name / first name)
  • A lot of documents, which crawled from Web