Massive Data Analysis with Pyspark Spark MapReduce for Matrix Multiplication Google PageRank for Website Page Ranking Kmeans Algorithm for Clustering Locality Sensitive Hashing for Finding Text Similarity Girvan Newman Algorithm for Social Network Clustering