Skip to content

sahild22/Introduction-to-Big-Data-Final-Project

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Introduction-to-Big-Data-Final-Project

##Commands to run Oozie Workflow on Fully Distributed Mode:

  1. Copy the Oozie folder and all flight data onto AWS instance.

  2. Access the instance from Terminal using the given command

ssh -i "key1.pem" ec2-user@ec2-35-163-67-151.us-west-2.compute.amazonaws.com
  1. Format The NameNode
hadoop/bin/hadoop namenode -format
  1. Start Hadoop Services
hadoop/sbin/start-dfs.sh
hadoop/sbin/start-yarn.sh
  1. Put the folder containing all input files on the HDFS
hadoop/bin/hadoop fs -put flightData input1
  1. Put Oozie folder on hdfs
hadoop/bin/hadoop fs -put oozie flight
  1. Check oozie status using following command:
oozie admin -oozie http://172.31.16.104:8080/oozie -status
  1. Execute following command to run oozie workflow:
oozie job -oozie http://172.31.16.104:11000/oozie -config job.properties -run
  1. To check the status of oozie job:
oozie job -oozie http://172.31.16.104:11000/oozie -info <job_Id>

About

This is the final project of Introduction to Big Data class.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages