Skip to content

Hadoop tasks repository for Parallel and Distributed Computing course at MIPT 2015

Notifications You must be signed in to change notification settings

penguin138/hadoop-tasks

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

hadoop-tasks

Hadoop tasks repository for Parallel and Distributed Computing course at MIPT 2015

Contains code of the following tasks:

  • Word Count
  • Inverted Index
  • Matrix Multiplication

Speed-up achieved for Matrix Multiplication

On 4-node Hadoop cluster Matrix Multiplication works for 1.5 min on 500x1000 and 1000x2000 matrices and sequential version of this program, written in Python, works for about 5 min.

About

Hadoop tasks repository for Parallel and Distributed Computing course at MIPT 2015

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published