Japan Manifesto Classification Using LDA Model

This repository contain the process that how I deal with Japanese manifestos and conduct a LDA model on the data.

I use Python as main language in processing the txt files and build the csv file that can be use for further data cleaning and preprocessing. And I use Qnanteda package in R to run NLP processing, including building corpus, defining and delimiting textual features, and conducting topic modeling and LDA.

For more information about Quanteda, users could directly read their Source code from: https://github.com/quanteda/quanteda/ As for the data of Japan House of Representative (HOR), users could access it through: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/QFEPXD

Name		Name	Last commit message	Last commit date
Latest commit History 35 Commits
LICENSE		LICENSE
README.md		README.md
data_clean.ipynb		data_clean.ipynb
data_complie.ipynb		data_complie.ipynb
japan_analysis_bertopic.ipynb		japan_analysis_bertopic.ipynb
lda_modeling.Rmd		lda_modeling.Rmd
topic_modeling.ipynb		topic_modeling.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LICENSE

LICENSE

README.md

README.md

data_clean.ipynb

data_clean.ipynb

data_complie.ipynb

data_complie.ipynb

japan_analysis_bertopic.ipynb

japan_analysis_bertopic.ipynb

lda_modeling.Rmd

lda_modeling.Rmd

topic_modeling.ipynb

topic_modeling.ipynb

Repository files navigation

Japan Manifesto Classification Using LDA Model

About

Releases

Packages

Languages

License

deankuo/Japan-Manifesto-Classification

Folders and files

Latest commit

History

Repository files navigation

Japan Manifesto Classification Using LDA Model

About

Topics

Resources

License

Stars

Watchers

Forks

Languages