Skip to content

Word co-occurrence and Matrix Multiplication using MapReduce

Notifications You must be signed in to change notification settings

SinghHarshita/MapReduce-Examples

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

14 Commits
 
 
 
 
 
 

Repository files navigation

MapReduce Examples

There are two examples demonstrated in this repository :

  1. Word Co-occurrence
  2. Matrix Multiplication

The codes can be found in the respective directories along with the execution details.

These programs were written as a part of Big Data Analytics Lab.

What is MapReduce?

MapReduce is a programming paradigm that enables massive scalability across hundreds or thousands of servers in a Hadoop cluster. As the processing component, MapReduce is the heart of Apache Hadoop. The term "MapReduce" refers to two separate and distinct tasks that Hadoop programs perform. The first is the map job, which takes a set of data and converts it into another set of data, where individual elements are broken down into tuples (key/value pairs). The reduce job takes the output from a map as input and combines those data tuples into a smaller set of tuples. As the sequence of the name MapReduce implies, the reduce job is always performed after the map job.

More information can be found here.