Skip to content

Repository to practice the process of data cleaning in python, using pandas. The dataset is raw data of laptops.

Notifications You must be signed in to change notification settings

juanmerino89/laptops-data-cleaning

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Laptops Data Cleaning

The purpose of this project is to practice the process of Data Cleaning in Python. I used the libraries Pandas, NumPy, Os and Re.

The raw data, is in the folder data, and it was extracted from the spanish website "www.pccomponentes.com" using Power Automate. The data includes all the Laptops in sale on 25th June 2023. The data columns are as follows:

  • Laptop: Raw data, including all the carachteristics for each laptop.

  • New Price: If there is discounted price, it's included in this column.

  • Old Price: If there is discounted price, the old price it's in this column.

  • Price: The price is here when there isn't any discount.

  • Old Price 2: Sometimes, the price is included here.

  • Status: Here we see the status of the laptop. However, the only interesting value is reacondicionado. If not, it's new.

In LaptopsDataCleaning.ipynb it's all the process of data cleaning, and the final result is the file laptops.csv

Thank you!

About

Repository to practice the process of data cleaning in python, using pandas. The dataset is raw data of laptops.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published