Skip to content

Latest commit

 

History

History
25 lines (19 loc) · 662 Bytes

README.md

File metadata and controls

25 lines (19 loc) · 662 Bytes

MapReduce Examples

MapReduce is the key programming model for data processing in the Hadoop ecosystem. This repository is used to collect the problems applicable by MapReduce.

  • Summarization Patterns

    • Word Count
    • Inverted Index (demo Tool, ToolRunner)
    • Matrix-vector Multiplication (demo MultipleInputs)
    • Matrix-matrix Multiplication
  • Filtering Patterns

    • Anagram
    • Top K
    • Sentiment Analysis
  • Organization Patterns

    • Partial Sort
    • Secondary Sort
  • Join Patterns

  • Metapatterns

    • NGramAutocomplete
    • Page Rank
    • Recommender System