This project showcases data cleaning techniques using R on a dataset of housing sales in Nashville from 2013–2016.
I used R and packages like tidyverse
, janitor
, and lubridate
to clean, transform, and prepare the dataset for analysis. The steps included:
- Cleaning and standardizing column names
- Handling missing values
- Formatting date fields
- Standardizing text fields
- Outlier detection and removal
Click here to view the full project
nashville_cleaning.Rmd
: Source code for the projectnashville_cleaning.html
: Rendered HTML fileNashville_housing_data_2013_2016.csv
: The dataset