A data pipeline that provides with web-scraped information from Steam
-
Updated
Mar 15, 2024 - Python
A data pipeline that provides with web-scraped information from Steam
This repository contains an example project (Jaffle Shop) demonstrating integration between Superset and dbt, with BigQuery as the data warehouse.
DE Project to keep track of my personal health metrics in a Data Warehouse
Run an open-source data LakeHouse locally using Docker Compose
IGTI MBA Engenharia de dados - Projeto Aplicado - Repositório de logs
Minimalistic and free Modern Data Stack, hence for all. Visualisation layer.
Dockerised version of apache superset
airflow and data science playground
Created an analytical model for health of a cohort based on latest data using Cube.dev
POC. Using HE data, preparation, ingestion into Druid, vis in superset, orchestration in Airflow
A data pipeline to ingest, process, store storm events datasets so we can access them through different means.
South African Government's expenditure
A batch processing data pipeline, using AWS resources (S3, EMR, Redshift, EC2, IAM), provisioned via Terraform, and orchestrated from locally hosted Airflow containers. The end product is a Superset dashboard and a Postgres database, hosted on an EC2 instance at this address (powered down):
The simple bot to share your favourite movies with friends
Quick demo of modern tech stack for streaming data pipelines
Tool for finding CVE-2023-27524 (Apache Superset - Authentication Bypass)
A realtime ingestion and profiling engine for fast data.
Add a description, image, and links to the apache-superset topic page so that developers can more easily learn about it.
To associate your repository with the apache-superset topic, visit your repo's landing page and select "manage topics."