Bruin is a data pipeline tool that is designed to be easy-to-use. It allows building data pipelines using SQL and Python, and has built-in data quality checks.
-
Updated
Jun 10, 2024 - Python
Bruin is a data pipeline tool that is designed to be easy-to-use. It allows building data pipelines using SQL and Python, and has built-in data quality checks.
This project builds an End-to-End Azure Data Engineering Pipeline, performing ETL and Analytics Reporting on the AdventureWorks2017LT Database.
Optimus is an easy-to-use, reliable, and performant workflow orchestrator for data transformation, data modeling, pipelines, and data quality management.
A visual data pipeline builder with various backends
Fictional match results of European leagues transformed and displayed appropriately (season 2023/2024)
SuccessSage is an end-to-end ML project that predicts student exam performance using demographic and academic data, offering educators actionable insights to enhance educational outcomes through a comprehensive web interface.
The goal is to eliminate manual work in identifying faulty wafers. Opening and handling suspected wafers disrupts the entire process. False negatives result in wasted time, manpower, and costs.
A collection of actions for working with PX4 data
Advanced and Fast Data Transformation in R
Data transformation framework for ETL processing with SQL-like syntax and GIS extensions, based on Apache Spark
Extensions to Kiba ETL
This project guides you through processing data from CSV to JSON format using Python. You'll learn to cleanse, validate, and transform data with pandas, numpy, csv, and json libraries, ensuring it's ready for POS system integration. This will help improve data integrity and streamline integration.
Raku package with data reshaping functions for different data structures (full arrays, Red tables, Text::CSV tables.)
This project is a web application for predicting loan approval status based on various financial and personal attributes. It uses a machine learning model that I trained on historical loan data to make predictions. I built the web application using Flask for the web framework, SQLite for the database, and the pre-trained model saved with joblib.
transforming Survey Monkey raw data from wide format to long format in Excel and Python. *Some data have been removed and updated for this demo purposes
Skills: SQL, Tableau.
Skills: Excel, SQL, Tableau
Skills: Python (Pandas, Numpy, Matplotlib, Seaborn, Sklearn, Statsmodels)
object flow treatment, data transformation
Scalable identity resolution, entity resolution, data mastering and deduplication using ML
Add a description, image, and links to the data-transformation topic page so that developers can more easily learn about it.
To associate your repository with the data-transformation topic, visit your repo's landing page and select "manage topics."