Skip to content

garystafford/emr-superset-demo

Repository files navigation

Installing Apache Superset on Amazon EMR

Overview

Add data exploration and visualization to your analytics cluster. Project files for the post, Installing Apache Superset on Amazon EMR: Add data exploration and visualization to your analytics cluster. Please see post for complete instructions on using the project's files.

Create CloudFormation Stack

python3 ./create_cfn_stack.py \
    --ec2-key-name <your_key_pair_name> \
    --ec2-subnet-id <your_subnet_id> \
    --environment dev

Run Superset Bootstrap Script

python3 ./install_superset.py \
    --ec2-key-path </path/to/my-key-pair.pem> \
    --superset-port 8280

SSH Tunnel

Open an SSH tunnel to master node using dynamic port forwarding

ssh -i </path/to/my-key-pair.pem> -ND 8157 hadoop@<public_master_dns>

Troubleshooting Superset

Troubleshoot Superset process running on EMR Master.

lsof -i :8280

References

About

Project files for the post: Installing Apache Superset on Amazon EMR: Add data exploration and visualization to your analytics cluster.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published