Skip to content

Deploying Hadoop HDFS & MR cluster on Multi Node across AWS, Azure & GCP using Terraform & Ansible Scripting Automation. Finally settling up Hadoop Client on local system and installing Pig & Hive to connect with the cluster.

License

Notifications You must be signed in to change notification settings

raktim00/Hadoop-Multi-Node-Cluster-on-Multi-Cloud-using-Terraform-Ansible

Repository files navigation

This project is an integration of different tools & technologies like AWS, Azure, GCP, Hadoop, Terraform, Ansible etc.

To see the practical demonstration follow the link : https://youtu.be/VB1jECOcJAk

Project Description :

Setting up Multi Node Hadoop HDFS & MR cluster on Multi Cloud. Here we are setting up total 6 instances across all three cloud. On AWS we are setting up NameNode & JobTracker. On GCP & Azure we are setting up DataNode & TaskTracker. On local node we are setting up Hadoop Client & then installing pig & hive software to work on our cluster. Refer to the below diagram to understand the infrastructure.

Hadoop_Multi_Node_Infrastructure

About

Deploying Hadoop HDFS & MR cluster on Multi Node across AWS, Azure & GCP using Terraform & Ansible Scripting Automation. Finally settling up Hadoop Client on local system and installing Pig & Hive to connect with the cluster.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published