Skip to content

A course project investigating the public opinion on Chinese social media Weibo during the early onset of COVID-19.

Notifications You must be signed in to change notification settings

Yvonne27Jin/POLI3115

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

20 Commits
 
 
 
 
 
 
 
 

Repository files navigation

POLI3115

This is a group project research proposal from POLI3115 Politics and public opinion, HKU. You can read the full version of the proposal here.

Data sources:

Original dataset:Zhai, Yujia, 2020, "Weibo COVID dataset", https://doi.org/10.7910/DVN/DULFFJ, Harvard Dataverse, V1

Chinese emotional dataset used for sentiment analysis, imported in code folder as 情感词汇本体.xlsx.

Code

1_datafile_stream_processing-jsonTocsv.ipynb: Convert .json datafile to pandas dataframe and store as .csv files.

2.1_data_stream_processing-sampling.ipynb: Randomly sample 1% of the original sample for stage 1 explorative pilot analysis.

2.2_sentiment_analysis_topic_modeling.Rmd: Text tokenisation, sentimentt analysis, topic modeling of the sampled data in R. Visualisation included.

3.1_data_stream_processing-sub topic.ipynb: Filter sub-datasets with key words from the original dataset.

3.2_subtopic_sentiment_analysis.Rmd: An automated function conducting text processing and sentiment analysis for the sub-topic datasets. Results exported for visualisation in Tableau.

Sample data:

word_freq.rds: Cleaned word frequency by post and by date, from sampled data.

About

A course project investigating the public opinion on Chinese social media Weibo during the early onset of COVID-19.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published