Skip to content

adrianmarino/spark-examples

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 
 
 

Repository files navigation

Install Spark & Jupyter on Arch Linux

Step 1: Install spark.

yaourt -S apache-spark

Step 2: Install Anaconda/Jupyter.

yaourt -S anaconda

Step 3: Create ~/.jupyterrc and add:

# Tell spark that use all cpu cores...
export MASTER=local[*]
export SPARK_HOME=/opt/apache-spark
export PYTHONPATH=$SPARK_HOME/python

export ANACONDA_ROOT=/opt/anaconda
export PYSPARK_DRIVER_PYTHON=$ANACONDA_ROOT/bin/ipython
export PYSPARK_PYTHON=$ANACONDA_ROOT/bin/python

export PATH=$ANACONDA_ROOT/bin:$PATH
export PATH=$SPARK_HOME/bin:$PATH

alias jupyter='PYSPARK_DRIVER_PYTHON_OPTS="notebook" pyspark'

Step 4: Include .jupyterrc under shell startup script.

echo "source ~/.jupiterrc" >> ~/.bashrc

Step 5: Include variables in current shell session.

source ~/.jupiterrc

Step 6: Start Jupyter.

jupyter

Step 7: Go to http://localhost:8888.

Notebooks

Releases

No releases published

Packages

No packages published