Skip to content

Latest commit

 

History

History
8 lines (6 loc) · 274 Bytes

File metadata and controls

8 lines (6 loc) · 274 Bytes

Massive Data Analysis with Pyspark

  • Spark MapReduce for Matrix Multiplication
  • Google PageRank for Website Page Ranking
  • Kmeans Algorithm for Clustering
  • Locality Sensitive Hashing for Finding Text Similarity
  • Girvan Newman Algorithm for Social Network Clustering