A Weakly-Supervised Iterative Graph-Based Approach to Retrieve COVID-19 Misinformation Topics

A Weakly-Supervised Iterative Graph-Based Approach to Retrieve COVID-19 Misinformation Topics (BERT-Based)

A Weakly-Supervised Iterative Graph-Based Approach to Retrieve COVID-19 Misinformation Topics was accepted and presented at Cyber and Social Threats (CySoc) at the International Conference for Web and Social Media (ICWSM) 2022: https://cysoc2022.github.io/

Publication Link: https://arxiv.org/pdf/2205.09416.pdf

Slides: https://docs.google.com/presentation/d/1gQ-yghyB-2MbQfIXCQ3e_R8S8m7tstZg/edit?usp=sharing&ouid=114370483152309300785&rtpof=true&sd=true

Abstract

The COVID-19 pandemic has been accompanied by an ‘infodemic’ of accurate and inaccurate health information across social media. Detecting misinformation amidst dynamically changing information landscape is challenging; identifying relevant keywords and posts is arduous due to the large amount of human effort required to inspect the content and sources of posts. We aim to reduce the resource cost of this process by introducing a weakly-supervised iterative graph-based approach to detect keywords, topics, and themes related to misinformation, with a focus on COVID19. Our approach can successfully detect specific topics from general misinformation-related seed words in a few seed texts. Our approach utilizes the BERT-based Word Graph Search (BWGS) algorithm that builds on context-based neural network embeddings for retrieving misinformation-related posts. We utilize Latent Dirichlet Allocation (LDA) topic modeling for obtaining misinformation-related themes from the texts returned by BWGS. Furthermore, we propose the BERT-based Multi-directional Word Graph Search (BMDWGS) algorithm that utilizes greater starting context information for misinformation extraction. In addition to a qualitative analysis of our approach, our quantitative analyses show that BWGS and BMDWGS are effective in extracting misinformation-related content compared to common baselines in low data resource settings. Extracting such content is useful for uncovering prevalent misconceptions and concerns and for facilitating precision public health messaging campaigns to improve health behaviors.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
BWGS_Model.py		BWGS_Model.py
Covid_Model_Misinfo.ipynb		Covid_Model_Misinfo.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BWGS_Model.py

BWGS_Model.py

Covid_Model_Misinfo.ipynb

Covid_Model_Misinfo.ipynb

README.md

README.md

Repository files navigation

A Weakly-Supervised Iterative Graph-Based Approach to Retrieve COVID-19 Misinformation Topics

Abstract

About

Releases

Packages

Languages

harryw1248/COVID_19_Misinformation_Weakly_Supervised_BWGS

Folders and files

Latest commit

History

Repository files navigation

A Weakly-Supervised Iterative Graph-Based Approach to Retrieve COVID-19 Misinformation Topics

Abstract

About

Topics

Resources

Stars

Watchers

Forks

Languages