EDA, Forecasting, Data Cleaning and quality
-
Updated
Jun 19, 2023 - Jupyter Notebook
EDA, Forecasting, Data Cleaning and quality
data as a service, data concerns and data contracts
Generating Airflow DAG running soda data quality tests.
A set of Metrics and Tools for Data Quality Assessment and Reporting on Rare Diseases Data
This repository provides our generic test protocol for the integration test of ASS.
Generates a similarity key for a street address for matching inconsistent street addresses within a dataset(s)
Generates a similarity key for an individual name for matching inconsistent names within a dataset(s)
Ramblings of a curious mind
This program is running daily to check the sensor and probe data quality.
Aceleracao PySpark Capgemini 2022
Generates a similarity key for a company/organization name for matching inconsistent names within a dataset(s)
[R package] Tools for data quality testing
Explored transactional data and customer demographics to determine customer trends and behavior in order to highlight new potential high-value customers.
Udacity's Data Engineering Nanodegree project: Data Pipeline with Airflow.
NOW-QUAL: Vaccine coverage survey Near-time Data Monitoring and Cleaning standard development template
Simple Spark wrapper for validating data
OLAP in TSQL and Python
ADVICE: Save yourself and colleagues loads of time, by taking a few suggestions into account before using Excel.
Add a description, image, and links to the dataquality topic page so that developers can more easily learn about it.
To associate your repository with the dataquality topic, visit your repo's landing page and select "manage topics."