data-engineering-pipeline

The Security Reference Architecture (SRA) implements typical security features as Terraform Templates that are deployed by most high-security organizations, and enforces controls for the largest risks that customers ask about most often.

data-engineering databricks data-engineering-pipeline

Updated May 26, 2024
HCL

AtharvTarte / Bing-News-Analysis

Star

In this project, I have created an end to end solution for analyzing the bing latest news data. I have used the microsoft fabric for all the tools.

spark fabric azure powerbi data-engineering-pipeline microsoft-fabric

Updated Apr 23, 2024
Jupyter Notebook

alfredzou / BoardGameGeek_Pipeline

Star

Pipeline to automate the collection of board game and expansion data from BoardGameGeek's XML API2. Data is stored in Google Cloud Storage and BigQuery. Data is modelled using DBT in a star schema. (Terraform, GCP, Mage, Python, dbt)

board-game terraform gcp data-engineering boardgame mage dbt boardgamegeek board-games data-engineering-pipeline

Updated Apr 23, 2024
Python

JessicaHora / JessicaHora

Star

python aws data-science data sql pandas data-visualization data-science-portfolio dataengineering etl-pipeline data-engineering-pipeline

Updated Apr 12, 2024
CSS

data2al / dbt-tutorial-course

Star

sql data-engineering-pipeline dbt-core

Updated Apr 2, 2024

prayagnshah / End-to-End-Pipeline

Star

Zillow Data Pipeline: Extracts data from Zillow, transfers it through AWS services, and performs analytics. Utilizes Python scripts, AWS Lambda, S3, Amazon RedShift, and QuickSight. Explore docs/images for architecture visuals.

python aws-lambda aws-s3 aws-ec2 redshift dag zillow-api quicksight data-engineering-pipeline

Updated Mar 27, 2024
Python

yashksaini-coder / Python-for-Data-Engineering

Star

Data Engineering 🛠️ is like the backbone of data processing 📊, managing data pipelines 🚀, warehouses 🏢, and lakes 🌊. It's the bridge 🌉 between raw data and actionable insights, powering businesses 🚀 with efficient data management and analytics 📈.

python aws data-science kafka data-engineering data-engineer data-engineering-pipeline

Updated Mar 26, 2024
Jupyter Notebook

kkrusere / NHANES-pyTOOL-API

Star

The NHANES Data 'API' is a Python tool that simplifies access to the National Health and Nutrition Examination Survey (NHANES) dataset. This project provides an easy-to-use API to retrieve NHANES data, helping researchers, data scientists, health professionals, and other stakeholders access these valuable datasets.

data-mining data-processing health-data nhanes health-data-analysis data-engineering-pipeline health-data-science

Updated Mar 21, 2024
Python

shiv-rna / Youtube-Data-Engineering-Pipeline

Star

This project repo 📺 offers a robust solution meticulously crafted to efficiently manage, process, and analyze YouTube video data leveraging the power of AWS services. Whether you're diving into structured statistics or exploring the nuances of trending key metrics, this pipeline is engineered to handle it all with finesse.