Zeppelin for Spark + Hadoop (optional Hive)

This Dockerfile only build Zeppelin which has dependency on the rest of other dockers in "docker-spark-bde2020" including Hadoop, Spark, Hive, etc.

Pull Image

You can get from https://hub.docker.com/r/openkbs/docker-spark-bde2020-zeppelin/

docker pull openkbs/docker-spark-bde2020-zeppelin

Build (if you want to build your own)

To build,

./build.sh

Run - "Zeppelin" Only

docker-compose -f docker-compose-hive.yml up -d zeppelin

Run - The entire suite - Hadoop + Spark + (Hive) + Zeppelin + SparkNotebook + Hue

There two options to run the entire suite of "docker-spark-bde2020"

start-hadoop-spark-workbench.sh (no Hive support)
start-hadoop-spark-workbench-with-hive.sh (with Hive support)

For example, to start the entire "docker-spark-bde2020 and zeppelin with Hive support:

./start-hadoop-spark-workbench-with-hive.sh

For example, to start the entire "docker-spark-bde2020 and zeppelin without Hive support:

./start-hadoop-spark-workbench.sh

Reference to BDE2020 projects

To see how this Container work with with the entire big-data-europe/docker-hadoop-spark-workbench, go to "./example-docker-spark-bde2020" directory to explore the entire suite build.

Docs

Zeppelin docker for BDE 2020 project

** For example usage see docker-compose.yml and SANSA-Notebooks repository.

Motivation behind the repo and an example usage @BDE2020 Blog

See

See big-data-europe/docker-spark README.

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
config		config
example-docker-spark-bde2020		example-docker-spark-bde2020
notebook_combined		notebook_combined
notebooks-fixed		notebooks-fixed
.env		.env
.gitignore		.gitignore
Dockerfile		Dockerfile
Makefile		Makefile
README.md		README.md
build.sh		build.sh
clean-all.sh		clean-all.sh
commit-push.sh		commit-push.sh
dbg.sh		dbg.sh
docker-compose-hive.yml		docker-compose-hive.yml
docker-compose.yml		docker-compose.yml
hadoop-hive.env		hadoop-hive.env
hadoop.env		hadoop.env
requirements.txt		requirements.txt
restore.sh		restore.sh
run.sh		run.sh
save.sh		save.sh
scale-up-spark-worker.sh		scale-up-spark-worker.sh
spark_submit_python_remote.sh		spark_submit_python_remote.sh
start-hadoop-spark-workbench-with-hive.sh		start-hadoop-spark-workbench-with-hive.sh
start-hadoop-spark-workbench.sh		start-hadoop-spark-workbench.sh
stop_all.sh		stop_all.sh
worker.sh		worker.sh
worker_loop.sh		worker_loop.sh

DrSnowbird/docker-spark-bde2020-zeppelin

Folders and files

Latest commit

History

Repository files navigation

Zeppelin for Spark + Hadoop (optional Hive)

Pull Image

Build (if you want to build your own)

Run - "Zeppelin" Only

Run - The entire suite - Hadoop + Spark + (Hive) + Zeppelin + SparkNotebook + Hue

Reference to BDE2020 projects

Docs

See

About

Resources

Stars

Watchers

Forks

Languages