data-cleaning-pipeline

The dataset I wrangled (and analysed and visualized) is the tweet archive of Twitter user @dog_rates, also known as WeRateDogs. WeRateDogs is a Twitter account that rates people's dogs with a humorous comment about the dog.

python data-science twitter data-analytics data-analysis data-wrangling data-exploration data-analyst-nanodegree data-analysis-python weratedogs data-cleaning-pipeline data-analyst-with-python data-interpretation data-wrangling-twitter

Updated Nov 25, 2021
HTML

ved93 / ml-express

Star

A Python library for day to day data analysis and machine learning. This aims to make data building, cleaning and machine learning much much faster. A library of extension and helper modules for Python's data analysis and machine learning libraries.

visualization data-science machine-learning eda data-preprocessing feature-engineering data-preparation pandas-profiling data-summarization data-cleaning-pipeline

Updated Jan 12, 2022
Python

CeliaMuriel / inconsistent-company-names-demo

Star

Inconsistent company names demo

gcp google-cloud fuzzy-matching google-cloud-platform data-cleaning data-quality data-cleansing trifacta data-cleaning-pipeline cloud-dataprep data-cleanup data-cleaning-and-preprocessing

Updated Mar 5, 2022

Elysian01 / Data-Purifier

Star

A Python library for Automated Exploratory Data Analysis, Automated Data Cleaning, and Automated Data Preprocessing For Machine Learning and Natural Language Processing Applications in Python.

data-science jupyter exploratory-data-analysis python-library python-lib eda data-visualization python3 data-analysis data-preprocessing data-cleaning data-cleaning-pipeline datapurifier

Updated May 6, 2022
Jupyter Notebook

xyuebai / data-etl-for-ml

Star

Data ETL for machine learning with dockerizing, including data crawling, data transforming/cleaning, and saving data to s3

docker etl aws-s3 boto3 data-cleaning-pipeline

Updated Oct 19, 2022
Python

AnalystHub-Hub / IBM-Data-Science-Professional-Certificate

Star

I learnt data science through hands-on practice in the IBM Cloud using real data science tools and real-world data sets.

python data-science machine-learning ibm-watson-services machine-learning-algorithms data-visualization data-extraction data-scraping data-cleaning-pipeline ibm-cognos-analytics

Updated Oct 20, 2022
Jupyter Notebook

everks / dial-clean

Star

中文对话数据清洗

dialog data-cleaning-pipeline

Updated Nov 8, 2022
Python

LaureBerti / Learn2Clean

Star

Learn2Clean: Optimizing the Sequence of Tasks for Data Preparation and Cleaning

reinforcement-learning data-preprocessing automated data-cleaning data-curation data-cleaning-pipeline

Updated Dec 26, 2022
Python

vdechen / DataAnalysis_NGO

Star

This data analysis and visualization project aimed at presenting the work of OBA-Floripa NGO to authorities and the general population. The idea is to claim the need for continued funding resources, given the positive impact of the organization's activities on public health issues.

python dashboard data-visualization data-analysis tableau-public data-cleaning-pipeline