Spark based applications to perform big data analytics
-
Updated
May 28, 2024 - Python
Spark based applications to perform big data analytics
This project was completed as part of the CIT 650 "Intro To Big Data" course at Nile University.
Analyzed Apple's dataset to check how many people bought Airpods after buying Mac or iPhone. Thereafter, using ML and predictive analytics to check future outcomes.
This is the github repo for Learning Spark: Lightning-Fast Data Analytics [2nd Edition]
Spark for Data Science and ETL process.
This is a bibliography survey upon Distributed Machine Learning. The survey contains algorithmic selections and architectures that can facilitate distributed learning on ML models. There is also a part that presents MLlib, a ML library from Apache Spark for distributed ML implementations.
Analysing the taxi trips in New York City and predicting total fare amount of taxi trips
Our own development branch of the well known WPF document docking library
大数据框架 Spark MLlib 机器学习库基础算法全面讲解,附带齐全的测试文件
Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
Projects of the subject Massive Data Processing Engineering at Universidad Internacional de La Rioja.
PySpark pipeline for median house value prediction
Repositório do curso "Spark: processamento de linguagem natural" da Alura.
Repositório do curso "Spark: criando modelos de classificação" da Alura.
Practicum Workshop
A bag of words analisys based on IMDB movie opinions with PySpark
trabalho de pbd
Add a description, image, and links to the mllib topic page so that developers can more easily learn about it.
To associate your repository with the mllib topic, visit your repo's landing page and select "manage topics."