aws-glue
Here are 194 public repositories matching this topic...
a toolkit that provides an object-oriented interface for working with parquet datasets on AWS
-
Updated
Jun 19, 2023 - Python
Implementation of ETL data pipeline to load data from S3 to snowflake and refresh tableau datasource in AWS
-
Updated
Sep 3, 2023 - Python
Data lake project for a US based Insurance Company
-
Updated
Jun 23, 2023 - HCL
Intro to streaming data with Kafka, Spark and AWS Glue
-
Updated
Sep 12, 2023 - Python
The Practical Data Science Specialization brings together these disciplines using purpose-built ML tools in the AWS cloud. It helps you develop the practical skills to effectively deploy your data science projects and overcome challenges at each step of the ML workflow using Amazon SageMaker. This Specialization is designed for data-focused develop
-
Updated
Oct 8, 2023 - Jupyter Notebook
Daily Incremental load ETL pipeline for Ecommerce company using AWS Lambda and AWS EMR cluster, Deployed using Apache airflow in a docker container.
-
Updated
Mar 17, 2023 - Python
AWS hosted enterprise Data Lake with both batch and realtime data pipelines.
-
Updated
Jul 26, 2020
A simple shell script to delete multiple tables based on table name prefix.
-
Updated
Mar 29, 2021 - Shell
Data Lakehouse solution for data produced by STEDI Step Trainer sensors and the mobile app so that it can train the machine learning module.
-
Updated
Sep 4, 2023 - Python
This project is based for legacy applications that works with positional files to process data. The objetive is read these positional files when they arrives in AWS S3, and then send to a dataware-house like AWS Redshift, and finally read the results with a Business Intelligence tool as AWS QuickSight.
-
Updated
Feb 16, 2022
-
Updated
Jul 11, 2022 - Python
Get the dataset intro a S3 bucket, use AWS glue to transform the dataset, write a Lambda script to clean the dataset, query the dataset via AWS Athena then build a dashboard using AWS Quicksight.
-
Updated
Oct 9, 2022 - Python
Working with Glue Data Catalog and running the using S3 Event Notification and creating the entire stack using AWS CloudFormation
-
Updated
May 8, 2023
A small walkthrough how to create an AWS Glue Job Pipeline with AWS CDK
-
Updated
Oct 1, 2023 - Python
This project was completed as part of our Database Management System coursework, for MS in Data Analytics and received recognition as the best project in the class.
-
Updated
Jan 11, 2024 - Jupyter Notebook
Improve this page
Add a description, image, and links to the aws-glue topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the aws-glue topic, visit your repo's landing page and select "manage topics."