Scalable identity resolution, entity resolution, data mastering and deduplication using ML
-
Updated
May 11, 2024 - Java
Scalable identity resolution, entity resolution, data mastering and deduplication using ML
Advanced and Fast Data Transformation in R
Like awk but with SQL and table joins
Bruin is a data pipeline tool that is designed to be easy-to-use. It allows building data pipelines using SQL and Python, and has built-in data quality checks.
DeltaFi is a flexible, code-light data transformation and normalization platform.
This repository contains the tasks that I've completed during my Data Science Internhip.
This repository contains the Plant Ecosystem Analysis project, utilizing R to investigate the relationship between native plant species richness and ecological factors within diverse geographical gradients.
A visual data pipeline builder with various backends
Low-code Python library to safely use notebooks in production: schedule workflows, generate assets, trigger webhooks, send notifications, build pipelines, manage secrets (Cloud-only)
Wrangler Transform: A DMD system for transforming Big Data
A collection of actions for working with ROS data
A collection of actions for working with PX4 data
🚚 Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark
Welcome to the Simple Income Statement Dashboard project repository! This project features an income statement dashboard developed using Power BI. The dashboard offers visualizations and insights based on Microsoft's income statement data for FY-21 and FY-22, obtained from the official website's financial statements section.
object flow treatment, data transformation
Solutions for #8WeekSQLChallenge using MySQL
Optimus is an easy-to-use, reliable, and performant workflow orchestrator for data transformation, data modeling, pipelines, and data quality management.
An Extensible Suite of High-Performance and Low-Dependency Packages for Statistical Computing and Data Manipulation in R
A website to help users view, verify and modify data for preprocessing and apply various classical ML algorrithms
SQL repository contains my answers to queries and challenges posed by numerous websites.
Add a description, image, and links to the data-transformation topic page so that developers can more easily learn about it.
To associate your repository with the data-transformation topic, visit your repo's landing page and select "manage topics."