Skip to content

dylanmei/docker-zeppelin

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

80 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

This repo is DEPRECATED, Please refer to Apache Zeppelin Official Docker Image

zeppelin

A debian:jessie based Spark and Zeppelin Docker container.

This image is large and opinionated. It contains:

A prior build of dylanmei/zeppelin:latest contained Spark 1.6.0, Python 2.7, and all of the stock interpreters. That image is still available as dylanmei/zeppelin:0.6.0-stable.

simple usage

To start Zeppelin pull the latest image and run the container:

docker pull dylanmei/zeppelin
docker run --rm -p 8080:8080 dylanmei/zeppelin

Zeppelin will be running at http://${YOUR_DOCKER_HOST}:8080.

complex usage

You can use docker-compose to easily run Zeppelin in more complex configurations. See this project's ./examples directory for examples of using Zeppelin with docker-compose to :

  • read and write from local data files
  • read and write documents in ElasticSearch

onbuild

The Docker onbuild container is still a part of this project, but I have no plans to keep it updated. See the onbuild directory to view its Dockerfile.

To use it, create a new Dockerfile based on dylanmei/zeppelin:onbuild and supply a new, executable install.sh file in the same directory. It will override the base one via Docker's ONBUILD instruction.

The steps, expressed here as a script, can be as simple as:

#!/bin/bash
cat > ./Dockerfile <<DOCKERFILE
FROM dylanmei/zeppelin:onbuild

ENV ZEPPELIN_MEM="-Xmx1024m"
DOCKERFILE

cat > ./install.sh <<INSTALL
git pull
mvn clean package -DskipTests \
  -Pspark-1.5 \
  -Dspark.version=1.5.2 \
  -Phadoop-2.2 \
  -Dhadoop.version=2.0.0-cdh4.2.0 \
  -Pyarn
INSTALL

docker build -t my_zeppelin .

license

MIT

About

Docker build for Zeppelin, a web-based Spark notebook

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published