Skip to content

Popular repositories

  1. pyspark-benchmark pyspark-benchmark Public

    A lightweight benchmark utility for PySpark

    Python 14 4

  2. spark-data-analysis-projects spark-data-analysis-projects Public

    A collection of data analysis projects done using PySpark via Jupyter notebooks.

    Jupyter Notebook 8 7

  3. personal-compute-cluster personal-compute-cluster Public

    Software and tools for setting up and operating a personal compute cluster, with focus on big data.

    Jupyter Notebook 6 6

  4. odroid-xu4-cluster odroid-xu4-cluster Public

    Files, config, tools, and example code used for setting up an ODROID XU4 mini cluster

    Shell 5 2

  5. qfs qfs Public

    Forked from quantcast/qfs

    Quantcast File System

    C++

  6. spark-terasort spark-terasort Public

    Forked from ehiggs/spark-terasort

    Spark Terasort

    Java

Repositories

Showing 6 of 6 repositories

Top languages

Loading…

Most used topics

Loading…