This project analyzes the relationship between COVID-19 vaccination rates and Gross Domestic Product (GDP) from 2020 to 2023.
The project uses the following data files:
- Raw Data:
data/raw/vaccination.csv
: Raw vaccination data.data/raw/world_bank_economic_indicators_2020_2023.csv
: Raw economic indicators data from the World Bank.
- Cleaned Data:
data/cleaned/vaccination-cleaned.csv
: Cleaned vaccination data.data/cleaned/wbe_cleaned.csv
: Cleaned World Bank economic indicators data.
- Processed Data:
data/processed/merged_data.csv
: Merged and processed data used for analysis.
The analysis is performed using the following R scripts:
scripts/01_fetch_data.r
: Fetches the raw data.scripts/02_data_cleaning/vaccination_cleaning.r
: Cleans the raw vaccination data.scripts/02_data_cleaning/wbe_cleaning.r
: Cleans the raw World Bank economic indicators data.scripts/03_merged_data.r
: Merges the cleaned datasets.
- Ensure you have R installed on your system.
- Open an R environment.
- Set the working directory to the root of this project.
- Run the scripts in the following order:
source("scripts/01_fetch_data.r")
source("scripts/02_data_cleaning/vaccination_cleaning.r")
source("scripts/02_data_cleaning/wbe_cleaning.r")
source("scripts/03_merged_data.r")
- The final merged dataset will be available at
data/processed/merged_data.csv
.