Classification of Japanese news with BERT_multi

BERT, or Bidirectional Encoder Representations from Transformers by Google, is a new method of pre-training language representations which obtains state-of-the-art results on a wide array of Natural Language Processing (NLP) tasks.

The academic paper by Google which describes BERT in detail and provides full results on a number of tasks can be found here: https://arxiv.org/abs/1810.04805.

I use “livedoor news corpus” (1) for this experiment. The details of the experiment is explained in this blog. https://toshistats.wordpress.com/2019/04/30/bert-performs-very-well-in-japanese-in-our-experiment/

Evaluation results

test_accuracy = 0.8744,

finetuned with data of livedoor news corpus (training 3153 samples, test 826 samples)

(1) livedoor news corpus CC BY-ND 2.1 JP https://creativecommons.org/licenses/by-nd/2.1/jp/

Notice: ToshiStats Co., Ltd. and I do not accept any responsibility or liability for loss or damage occasioned to any person or property through using materials, instructions, methods, algorithms or ideas contained herein, or acting or refraining from acting as a result of such use. ToshiStats Co., Ltd. and I expressly disclaim all implied warranties, including merchantability or fitness for any particular purpose. There will be no duty on ToshiStats Co., Ltd. and me to correct any errors or defects in the codes and the software

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
Japanese_news_classification_with_TFhub_BERT.ipynb		Japanese_news_classification_with_TFhub_BERT.ipynb
README.md		README.md
TF2_Classification_of_Japanese_news_title_by_BERT_20200417.ipynb		TF2_Classification_of_Japanese_news_title_by_BERT_20200417.ipynb
livedoor_news_5classification_by_multilingual_BERT.ipynb		livedoor_news_5classification_by_multilingual_BERT.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Japanese_news_classification_with_TFhub_BERT.ipynb

Japanese_news_classification_with_TFhub_BERT.ipynb

README.md

README.md

TF2_Classification_of_Japanese_news_title_by_BERT_20200417.ipynb

TF2_Classification_of_Japanese_news_title_by_BERT_20200417.ipynb

livedoor_news_5classification_by_multilingual_BERT.ipynb

livedoor_news_5classification_by_multilingual_BERT.ipynb

Repository files navigation

Classification of Japanese news with BERT_multi

Evaluation results

About

Releases

Packages

Languages

TOSHISTATS/Classification-of-Japanese-news-with-BERT

Folders and files

Latest commit

History

Repository files navigation

Classification of Japanese news with BERT_multi

Evaluation results

About

Resources

Stars

Watchers

Forks

Languages