Skip to content

harryw1248/COVID_19_Misinformation_Weakly_Supervised_BWGS

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 

Repository files navigation

A Weakly-Supervised Iterative Graph-Based Approach to Retrieve COVID-19 Misinformation Topics

A Weakly-Supervised Iterative Graph-Based Approach to Retrieve COVID-19 Misinformation Topics (BERT-Based)

A Weakly-Supervised Iterative Graph-Based Approach to Retrieve COVID-19 Misinformation Topics was accepted and presented at Cyber and Social Threats (CySoc) at the International Conference for Web and Social Media (ICWSM) 2022: https://cysoc2022.github.io/

Publication Link: https://arxiv.org/pdf/2205.09416.pdf

Slides: https://docs.google.com/presentation/d/1gQ-yghyB-2MbQfIXCQ3e_R8S8m7tstZg/edit?usp=sharing&ouid=114370483152309300785&rtpof=true&sd=true

Abstract

The COVID-19 pandemic has been accompanied by an ‘infodemic’ of accurate and inaccurate health information across social media. Detecting misinformation amidst dynamically changing information landscape is challenging; identifying relevant keywords and posts is arduous due to the large amount of human effort required to inspect the content and sources of posts. We aim to reduce the resource cost of this process by introducing a weakly-supervised iterative graph-based approach to detect keywords, topics, and themes related to misinformation, with a focus on COVID19. Our approach can successfully detect specific topics from general misinformation-related seed words in a few seed texts. Our approach utilizes the BERT-based Word Graph Search (BWGS) algorithm that builds on context-based neural network embeddings for retrieving misinformation-related posts. We utilize Latent Dirichlet Allocation (LDA) topic modeling for obtaining misinformation-related themes from the texts returned by BWGS. Furthermore, we propose the BERT-based Multi-directional Word Graph Search (BMDWGS) algorithm that utilizes greater starting context information for misinformation extraction. In addition to a qualitative analysis of our approach, our quantitative analyses show that BWGS and BMDWGS are effective in extracting misinformation-related content compared to common baselines in low data resource settings. Extracting such content is useful for uncovering prevalent misconceptions and concerns and for facilitating precision public health messaging campaigns to improve health behaviors.

About

A Weakly-Supervised Iterative Graph-Based Approach to Retrieve COVID-19 Misinformation Topics (BERT-Based)

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published