Skip to content

This project demonstrates data cleaning on the Nashville Housing dataset (2013–2016) using R and packages like tidyverse, lubridate, and janitor. Key steps included standardizing column names, handling missing values, formatting dates, cleaning text fields, and identifying outliers. The cleaned dataset is now ready for analysis and modeling to unco

Notifications You must be signed in to change notification settings

nicolepcollins/nashville-housing-dataset-cleaning

Repository files navigation

Nashville Housing Data Cleaning Project

This project showcases data cleaning techniques using R on a dataset of housing sales in Nashville from 2013–2016.

📄 Project Summary

I used R and packages like tidyverse, janitor, and lubridate to clean, transform, and prepare the dataset for analysis. The steps included:

  • Cleaning and standardizing column names
  • Handling missing values
  • Formatting date fields
  • Standardizing text fields
  • Outlier detection and removal

🔗 View Project on RPubs

Click here to view the full project

📁 Files Included

  • nashville_cleaning.Rmd: Source code for the project
  • nashville_cleaning.html: Rendered HTML file
  • Nashville_housing_data_2013_2016.csv: The dataset

About

This project demonstrates data cleaning on the Nashville Housing dataset (2013–2016) using R and packages like tidyverse, lubridate, and janitor. Key steps included standardizing column names, handling missing values, formatting dates, cleaning text fields, and identifying outliers. The cleaned dataset is now ready for analysis and modeling to unco

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages