Skip to content

harrisonzhao/distributedKNN

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

88 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

distributedKNN

Distributed system for the K-nearest-neighbors algorithm. It supports backups for the master, accumulator, and consumer nodes.

Structure:

  • Master starts up and waits for a number of consumers to connect.
  • Client sends to master feature vectors
  • Master broadcasts to consumers
  • Consumers compute euclidean distance, send top k results to aggregator
  • Accumulator compiles the results and returns the top k
  • Accumulator finds most frequent and sends to master ID and category
  • Master sends results back to consumer
  • Consumer writes results to file

To compile: make

To run:

  • Master: make runMaster
  • Consumer: make runConsumer
  • Client: make runClient
  • Accumulator: make runAcc

To clean:

  • make clean

To edit the arguments

  • open pom.xml
  • Go down to the profiles section where you'll see profiles for the master, consumer, client, and accumulator. In each profile, there's a list of tags with the argument description next to it. Add your argument here.

About

distributed system for the K-nearest-neighbors algorithm

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 4

  •  
  •  
  •  
  •  

Languages