#

pyspark-notebook

Here are 191 public repositories matching this topic...

Ragadeepthi / Loading-different-types-of-data-files-using-Flume-and-pyspark

Loading different types of dataset files using Flume and pyspark

python machine-learning pyspark machinelearning pyspark-notebook pyspark-python

Updated Jul 4, 2019
Python

mhaseebtariq / pyspark-helpers

Useful helper functions for PySpark dataframe operations

pyspark pyspark-notebook pyspark-dataframes pyspark-joins pyspark-helpers flexible-joins join-duplicate-columns

Updated May 25, 2022
Jupyter Notebook

rdeo265 / aws-pyspark

BDAS with PySpark on AWS

aws machine-learning data-mining jupyter-notebook pyspark pyspark-notebook

Updated Oct 24, 2020
Jupyter Notebook

samuelesimone / Pyspark-fundamentals

Pyspark fundamentals

pyspark pyspark-notebook pyspark-examples

Updated Jan 10, 2023
Jupyter Notebook

saraparveen26 / Home-Sales---BigData

This project creates and examines different metrics about Home Sales data.

bigdata pyspark pyspark-notebook googlecolab

Updated Jun 5, 2023
Jupyter Notebook

abhinit21 / data-analysis-pyspark

analyze the data set of world championship chess games using PySpark

pyspark data-analysis pyspark-notebook colab-notebook

Updated Nov 2, 2022
Jupyter Notebook

akanshu22 / Triangle-Counting-Problem-in-Apache-Spark

Implementation of Triangle Counting Problem in Apache Spark

apache-spark acm triangle-counting conference-paper paper-implementations pyspark-notebook

Updated May 15, 2017
Jupyter Notebook

rantoncuadrado / udacity_capstone_project

Udacity Data Engineering Nanodegree. Capstone Project.

spark pyspark pyspark-notebook

Updated Aug 19, 2021
Jupyter Notebook

caiocmb7 / python-rep

Studies about python, including basic stuffs and oop

python basic study oop projects pyspark-notebook

Updated Jan 12, 2023
Jupyter Notebook

prashantpal711 / Avg-Movie-Rating

Simple project to get average of available ratings of the movies from the dataset available using PySpark.

python3 rdd pyspark-notebook

Updated Jun 25, 2021
Jupyter Notebook

rezaneo7 / Persian-Wikipedia-Analysis

python spark pyspark pyspark-notebook

Updated Mar 15, 2022
Jupyter Notebook

dlleonardo / spark-de-ml-assignments

Spark DE&ML assignments from the "Data Engineering and Machine Learning with Spark" course (offered by IBM Skills Network)

spark pyspark pyspark-notebook sparkml-pipelines

Updated Jan 18, 2023
Jupyter Notebook

rsantos2032 / Cardiovascular-Disease-Detection

Cardiovascular Disease Detection using PySpark

spark hadoop python3 pyspark pyspark-notebook pyspark-machine-learning

Updated Apr 26, 2024
Jupyter Notebook

90Nitin / pyspark-jupyter-kernel

Installation instructions for pyspark and a kernel with jupyter

helper tutorial spark jupyter installer pyspark jupyter-notebooks installer-script pyspark-notebook

Updated Feb 5, 2019
Shell

gonzalf1 / pysparky

Customized PySpark Docker image with R support

dockerfile r spark conda pyspark-notebook

Updated Mar 8, 2019
Dockerfile

aashokvardhan / Analyzing-Neuroimaging-Data-with-PySpark-and-Thunder

spark pyspark thunder pyspark-notebook

Updated Dec 12, 2017
Jupyter Notebook

koirand / spark-notebook-on-k8s-example

Sample to run PySpark on Kubernetes cluster.

kubernetes pyspark pyspark-notebook

Updated Jan 11, 2021
Jupyter Notebook

dlleonardo / spark-assignments

Spark assignments from "Introduction to Big Data" course (offered by IBM Skills Network)

spark pyspark spark-sql pyspark-notebook

Updated Dec 4, 2022
Jupyter Notebook

zestyraiden / KMeans-Clustering-Segmentation-Analysis

Online Retail Cassification for Marketing Segmentation Project using KMeans Clustering, Elbow Method and Silhouette Method for Validation

data-analysis kmeans-clustering spark-sql pyspark-notebook

Updated Sep 8, 2023
Jupyter Notebook

Betico1928 / Talleres-ProcesamientoDeDatosAGranEscala

Exploración los principios del Procesamiento de Datos a Gran Escala con talleres de Databricks y Spark. Aprender herramientas como Pandas y PySpark para el análisis eficiente de grandes conjuntos de datos. Impartidos por John Corredor en la Pontificia Universidad Javeriana.

spark jupyter-notebook pyspark pyspark-notebook databricks-notebooks pandas-python dbfs

Updated Apr 29, 2024
Jupyter Notebook

Improve this page

Add a description, image, and links to the pyspark-notebook topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the pyspark-notebook topic, visit your repo's landing page and select "manage topics."