aws-glue-crawler

Here are 28 public repositories matching this topic...

aws-samples / aws-glue-crawler-utilities

This repository has a collection of utilities for Glue Crawlers. These utilities come in the form of AWS CloudFormation templates or AWS CDK applications.

aws-glue aws-glue-crawler

Updated Dec 21, 2021
Python

aws-samples / amazon-rds-export-to-s3-automation

Star

This repository contains source code for the AWS Database Blog Post Reduce data archiving costs for compliance by automating RDS snapshot exports to Amazon S3

aws-lambda aws-kms aws-cloudformation amazon-rds amazon-sns amazon-s3 amazon-athena aws-backup aws-glue amazon-eventbridge aws-glue-crawler

Updated Apr 26, 2023

GabrielDan92 / AWS_Terraform_PySpark-ETL_Job

Star

Terraform configuration that creates several AWS services, uploads data in S3 and starts the Glue Crawler and Glue Job.

aws terraform s3-bucket pyspark glue-job glue-catalog aws-glue-crawler

Updated Feb 10, 2022
Python

fermat01 / ETL-Data-Pipeline-using-AWS-EMR-Spark-Glue-Athena

Star

ETL Data pipeline using aws services

aws aws-ec2 aws-athena emr-cluster aws-glue-crawler

Updated May 17, 2024
Python

masood2iq / AWS-Athena-Glue-S3-Bucket-Deployment-Through-AWSConsole

Star

AWS Athena, Glue Database, Glue Crawler and S3 buckets deployment through AWS GUI console.

aws-athena aws-glue simple-query aws-glue-crawler aws-s3-bucket

Updated Dec 13, 2022

Akanksha-tetwar / YouTube-Trending-video-analysis-ETL-using-AWS-Services

Star

In this project I have used the Trending YouTube Video Statistics data from Kaggle to analyze and prepare it for usage.

python aws aws-s3 aws-athena awslambda quicksight aws-glue-crawler awsglue

Updated Nov 7, 2022

subhamay-cloudworks / 0090-deutzia-cft

Sponsor

Star

Creating an audit table for a DynamoDB table using CloudTrail, Kinesis Data Stream, Lambda, S3, Glue and Athena and CloudFormation

aws-python-lambda aws-iam aws-cloudformation aws-cloudtrail aws-cloudwatch aws-athena aws-cloudwatch-logs aws-kinesis-stream aws-glue-crawler aws-iam-roles aws-iam-policies aws-s3-bucket aws-glue-data-catalog

Updated Jul 6, 2023
Python

Saurabhkhandebharad / BigData-SK

Star

Analyzed a multicategory e-commerce store using big data techniques on a Kaggle dataset with the help of AWS EC2, AWS S3, PySpark, AWS Glue ETL, AWS Athena, AWS CloudFormation, AWS Lambda and Power BI!

aws big-data aws-lambda power-bi pyspark aws-ec2 aws-cloudformation aws-athena kaggle-dataset aws-services end-to-end-pipeline end-to-end-project aws-glue-crawler aws-s3-bucket

Updated Aug 10, 2023
Python

masood2iq / AWS-Athena-Glue-S3-CloudFormation-Deployment-AWSConsole

Star

AWS Athena, Glue Database, Glue Crawler and S3 buckets deployment through CloudFormation stack on AWS console.

aws-cloudformation aws-athena aws-glue simple-query aws-glue-crawler aws-s3-bucket

Updated Dec 14, 2022

subhamay-cloudworks / 0052-agapanthus-cft

Sponsor

Star

Working with Glue Data Catalog and Running the Glue Crawler On Demand

aws-cloudformation aws-glue aws-glue-crawler aws-iam-roles aws-iam-policies aws-glue-data-catalog

Updated May 11, 2023

aws-samples / automated-datastore-discovery-with-aws-glue

Star

Automation framework to catalog AWS data sources using Glue

aws typescript aws-s3 dynamodb glue python3 data-catalog rds gdpr pii data-governance aws-cdk aws-glue-workflow aws-glue-crawler aws-glue-data-catalog

Updated May 24, 2024
Python

Shilpaar90 / AWS-Capturing-Schema-Changes-In-S3

Star

A pipeline within AWS to capture schema changes in S3 files and to update them in a DB.

aws crawler aws-lambda dynamodb s3 aws-dynamodb aws-cloudwatch-logs aws-lambda-python aws-glue aws-eventbridge glue-catalog aws-glue-crawler

Updated Nov 30, 2021

KRISHNASAIRAJ / AWS-Driven-Sales-Performance-Outlook

Star

The Project aims to establish a robust data pipeline for tracking and analyzing sales performance using various AWS services. The process involves creating a DynamoDB database, implementing Change Data Capture (CDC), utilizing Kinesis streams, and finally, storing and querying the data in Amazon Athena.

python aws-lambda dynamodb s3-bucket kinesis kinesis-firehose aws-athena glue-catalog aws-glue-crawler eventbridge-pipes

Updated Feb 11, 2024
Python

sarah-zhan / data_pipeline_amazon_products

Star

An end-to-end data pipeline built with AWS S3, Glue, Crawler, Athena, Tableau visulization

aws s3-bucket tableau aws-athena aws-glue-crawler

Updated Mar 27, 2024
Jupyter Notebook

h-fuzzy-logic / data-analytics-spring

Star

This project combines some of my favorite technologies - open data, cloud computing, and Jupyter notebooks.

python jupyter aws-s3 pandas seaborn openscience aws-athena aws-glue-crawler

Updated Sep 9, 2023
Jupyter Notebook

AirtonLira / aws-bigdata-glue-athena

Star

Este projeto tem como objetivo realizar a coleta, catalogo, governança, processamento e visualização de dados.

aws aws-cloudformation aws-athena aws-glue aws-glue-crawler

Updated Mar 28, 2023

SadafAsad / LinkedIn-Jobs-Analysis

Star

Unveiling job market trends with Scrapy and AWS

python aws-s3 scrapy aws-ec2 aws-athena aws-quicksight aws-glue-crawler aws-glue-data-catalog

Updated Apr 5, 2024
Python

thedatanerdz / DEP-5

Star

Real-Time Data Analysis of the Stock Market Using Kafka

aws-s3 data-engineering aws-ec2 apache-kafka aws-athena aws-glue-crawler real-time-data-pipeline aws-glue-catalog

Updated Jul 30, 2023
Jupyter Notebook

Kartik-Banga / Automated-ETL-Pipeline-for-Playstore-Data

Star

Implemented ETL pipeline on AWS for Playstore data using Lambda, Glue Crawlers, and Glue ETL Jobs. Orchestrated workflow with Step Functions and achieved seamless integration, optimal data merging, and enhanced data quality/accessibility.

python sql aws-lambda aws-s3 data-visualization pyspark data-engineering cloud-computing data-analysis powerbi data-cleaning aws-step-functions aws-glue aws-glue-crawler

Updated Jan 4, 2024

mihirkudale / Stock-Market-Real-Time-Data-Engineering-Project

Star

In this project, you will execute an End-To-End Data Engineering Project on Real-Time Stock Market Data using Kafka. We are going to use different technologies such as Python, Amazon Web Services (AWS), Apache Kafka, Glue, Athena, and SQL.

python aws csv kafka aws-s3 jupyter-notebook consumer amazon-ec2 aws-ec2 apache-kafka producer aws-athena stockmarket aws-glue-crawler stockmarketanalysis aws-glue-catalog

Updated May 23, 2024
Jupyter Notebook

Improve this page

Add a description, image, and links to the aws-glue-crawler topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the aws-glue-crawler topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

aws-glue-crawler

Here are 28 public repositories matching this topic...

aws-samples / aws-glue-crawler-utilities

aws-samples / amazon-rds-export-to-s3-automation

GabrielDan92 / AWS_Terraform_PySpark-ETL_Job

fermat01 / ETL-Data-Pipeline-using-AWS-EMR-Spark-Glue-Athena

masood2iq / AWS-Athena-Glue-S3-Bucket-Deployment-Through-AWSConsole

Akanksha-tetwar / YouTube-Trending-video-analysis-ETL-using-AWS-Services

subhamay-cloudworks / 0090-deutzia-cft

Saurabhkhandebharad / BigData-SK

masood2iq / AWS-Athena-Glue-S3-CloudFormation-Deployment-AWSConsole

subhamay-cloudworks / 0052-agapanthus-cft

aws-samples / automated-datastore-discovery-with-aws-glue

Shilpaar90 / AWS-Capturing-Schema-Changes-In-S3

KRISHNASAIRAJ / AWS-Driven-Sales-Performance-Outlook

sarah-zhan / data_pipeline_amazon_products

h-fuzzy-logic / data-analytics-spring

AirtonLira / aws-bigdata-glue-athena

SadafAsad / LinkedIn-Jobs-Analysis

thedatanerdz / DEP-5

Kartik-Banga / Automated-ETL-Pipeline-for-Playstore-Data

mihirkudale / Stock-Market-Real-Time-Data-Engineering-Project

Improve this page

Add this topic to your repo