data-pipeline

A cutting-edge big data initiative aimed at creating a real-time data pipeline to analyze the popularity and sentiments of trending topics on Twitter.

mongodb bigdata zookeeper spark-streaming business-intelligence realtime-database tableau twitter-sentiment-analysis kafka-streams data-pipeline apache-drill

Updated Jul 24, 2023
Scala

The mini project for the course Database Technologies. The task is to take in data via a pipeline built using spark-streaming and kafka, and store the processed data into a SQLite database for further manipulation

kafka stream-processing spark-streaming sqlite3 data-pipeline

Updated May 3, 2023
Python

ledesma-ivan / docker-postgresql-pipeline

Star

A data pipeline project that leverages Docker and PostgreSQL for efficient data processing and analysis tasks. Uses containerization to ensure portability and reproducibility of the data pipeline.

docker postgresql data-pipeline containerization

Updated May 17, 2023
Jupyter Notebook

jack-white9 / openpowerlifting-data-pipeline

Star

Deployable AWS data platform to process powerlifting data extracted from openpowerlifting.org.

python docker airflow terraform data-pipeline

Updated Mar 3, 2024
Python

suewoon / portfolio-assessment

Star

💸A python module for building portfolio assessment pipeline

api-wrapper stock-data financial-data data-pipeline alphavantage

Updated Apr 20, 2018
Jupyter Notebook

andygeiss / pipeline-example

Star

This is a basic example of using a pipeline in data science.

go golang data-science data protobuf pipeline example data-pipeline iris-dataset

Updated Jun 17, 2020
Go

CivicDataLab / up-fiscal-data-backend

Star

data-mining selenium open-data budget spending data-pipeline

Updated May 4, 2023

Dukes-Wine-Co / url-shortner-pipeline

Star

This is the data pipeline for the url-shortner application. Deprecated in favor of https://github.com/Dukes-Wine-Co/request-parsing-api

pipeline proxy data-pipeline

Updated Sep 12, 2020
JavaScript

jomavera / datapipelineDataproc

Star

ETL pipeline with PySpark on Dataproc for data lake on Google Cloud Storage

google-cloud pyspark data-engineering data-pipeline dataproc

Updated Mar 17, 2021
Python

wakura-mbuya / AgriTech-Project-Data-Engineering

Star

An easy to use, reliable and well designed python module that domain experts and data scientists can use to fetch, visualise, and transform publicly available satellite and LIDAR data.

data-pipeline lidar-point-cloud

Updated Jun 25, 2022
Jupyter Notebook

Improve this page

Add a description, image, and links to the data-pipeline topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the data-pipeline topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

data-pipeline

Here are 618 public repositories matching this topic...

sohamray19 / KafkaSparkStreamingPOCinScala

jomavera / dataPipeline

smmiri / etl-visuals

qusay-elewy / data-modeling-with-cassandra

lukeconibear / intro_ml

Alhamzahalabboodi / dbt-project

ThomasJewson / chess-data-pipeline

schatzederwelt / video-games-sales-analysis

jason-cls / cryptoscout

KrajShuffle / MRI_Brain_CNN_Classification

princebhatt9588 / Real_Time_Twitter_Trends_Analytics

Shreyas-s14 / McFlAi-OTPMS

ledesma-ivan / docker-postgresql-pipeline

jack-white9 / openpowerlifting-data-pipeline

suewoon / portfolio-assessment

andygeiss / pipeline-example

CivicDataLab / up-fiscal-data-backend

Dukes-Wine-Co / url-shortner-pipeline

jomavera / datapipelineDataproc

wakura-mbuya / AgriTech-Project-Data-Engineering

Improve this page

Add this topic to your repo