Skip to content

Latest commit

 

History

History
108 lines (85 loc) · 3.62 KB

README.md

File metadata and controls

108 lines (85 loc) · 3.62 KB

Elephant Scale Sandbox

What is it?

This is a virtual machine sandbox image to practice and learn Big Data and Data Science applications.

Why?

Running Big Data applications (Spark / Cassandra / Hadoop) can be a little convoluted because of all the dependencies. This can be even more of a hassle in Windows. We hope this VM Sandbox will make things easier.

Provided By Elephant Scale

Elephant Scale teaches Big Data & AI / Data Science classes. This sandbox is a replica of our virtualized environment.

Looking for training?

Checkout our training classes in Big Data and Data Science

Read more about the Sanxbox

Where to get it?

Currently OVA based virtual machine image is available. Docker images coming 'soon'.
Note : These are LARGE downloads (10G+ in size). Download when you have good bandwidth.

How to run it?

  • You need a virtual machine 'player'. Any of these would work:
  • Download the latest sandbox image
  • Double click on the 'OVA' file open it.

Access

Login : student
password : bigdata123

See intro lab for a screencast.

Connectivity:

  • Use VM GUI : when you open this OVA file in a VM environment you will be logged into the Ubuntu desktop
  • SSH via port 22
  • from host machine
    $   ssh -l student  -p 2222 localhost

What can I run?

This VM is tested with following Big Data stack.

  • Spark v2.x and v1.6
  • BigDL 0.3+
  • Cassandra v3.x
  • Kafka v0.10
  • Storm v1.x
  • Zookeeper v3.4.8

Labs

If you are enrolled in our classes, you will get a lab bundle. Also you can run any open source labs as well.

Tutorials

  1. Introduction
  2. Running Spark
  3. Running Cassandra
  4. Choosing a Python version
  5. Running BigDL

Checkout our Sandbox channel for more videos.

Software Installed

Changelog

See version history in changelog

Feedback

We welcome your feedback about the sandbox.