Skip to content

YARN | MapReduce | Cascading | Hadoop Application for processing Student data file and Finding the various matrices like average marks, total marks and Sinking data to ElasticSearch | There are 5 branches each for Cascading-2.6,Cascading2.5,MapReduce-1,MapReduce2,and YARN respectively. Please checkout the wanted branch and let me know for any is…

License

Notifications You must be signed in to change notification settings

mehikmat/BigSchool

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

67 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

BigSchool endorse Build Status

MapReduce-2 Application for processing raw data file and counting words

Technologies: Cloudera Hadoop CDH5,Maven, IntelliJ IDEA

###NOTE:MRv1 & MRv2 compatibility###

In general, the previous MapReduce runtime (aka MRv1) has been reused and no major surgery has been conducted on it.
Therefore, MRv2 is able to ensure satisfactory compatibility with MRv1 applications.
However, due to some improvements and code refactorings, a few APIs have been rendered backward-incompatible.

NEW API in package: org.apache.hadoop.mapreduce;
OLD API in package: org.apache.hadoop.mapred;

Prerequisites

  1. Java-1.7
  2. Maven-2/3
  3. git
  4. Hadoop-2.3.0

How to run

$ git clone https://github.com/mehikmat/BigSchool.git

$ cd BigSchool

$ git checkout mr2-app

$ mvn clean package

$ yarn jar target/BigSchoolMapReduce-1.0.jar input/input.txt output/output.txt

  OR

$ sh runner.sh

In case you get an error of type "Not valid JAR", check the jar path. It might be different from one plateform to another with mvn.

Browse http://localhost:8088 for job status

MapReduce App Architecture

MapReduce App Architecture

About

YARN | MapReduce | Cascading | Hadoop Application for processing Student data file and Finding the various matrices like average marks, total marks and Sinking data to ElasticSearch | There are 5 branches each for Cascading-2.6,Cascading2.5,MapReduce-1,MapReduce2,and YARN respectively. Please checkout the wanted branch and let me know for any is…

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published