A movie recommender written in Go that suggests movies considering various factors within a particular dataset, encompassing users, movies, and movie ratings.
-
Updated
Apr 21, 2024 - Go
A movie recommender written in Go that suggests movies considering various factors within a particular dataset, encompassing users, movies, and movie ratings.
Study of French hospital production. (2021)
BigQuery data pipeline with dbt, Spark, Docker, Airflow, Terraform, GCP
Solved tasks of the master's degree courses of speciality "Algorithms and Systems for Big Data Processing".
"Provides tools for parallel pipeline processing of large data structures
Degree diploma project
Software basati su metodi di intelligenza artificiale per l'automazione dell'analisi di big data.
Collection of homework (mostly Spark-based) from the course "Big Data Computing" - University of Padua.
Big Data and AI Engineering bootcamp 2nd capstone project. Using Big Data Tools to predict the probability of university enrollment for Egypt's High School students. 🏫 📚 🔬
Welcome, feel free to navigate through my project. Detail information about each project can be found inside specified directory.
Experiment to record as much data as possible in a given amount of time using a distributed timeseries database.
A Docker Compose Template to deploy Airflow with sync from a remote repository
Tech blog / notes from my various endeavours and exploits
Building Data Lake and ETL pipelines using Amazon EMR, S3, and Apache Spark
Analyzing classified ads data from the used motorcycles market. Tasks involve utilizing Redis Bitmaps for analytics on seller actions and MongoDB for analyzing bike listings. Includes data installation, cleaning, and analysis.
datasets-toolbox are some scripts usefull to generate, transfom and valid large dataset files, not openable with editor because too large. datasets-toolbox provide also a ping script.
Standard Hadoop MapReduce Tasks using Java
Project using Python, Hive and MapReduce to compare various techniques to find the top K words in a very large file i.e. different techniques to process Big Data.
Add a description, image, and links to the big-data-processing topic page so that developers can more easily learn about it.
To associate your repository with the big-data-processing topic, visit your repo's landing page and select "manage topics."