simhash A thread-safe java implementation for generating SimHashes. Useful reading on the subject: https://web.archive.org/web/20120105154834/http://www.cs.princeton.edu/courses/archive/spring04/cos598B/bib/CharikarEstim.pdf https://web.archive.org/web/20130928112139/http://matpalm.com/resemblance/simhash/ https://web.archive.org/web/20140219034905/http://titouangalopin.com/blog/2013/11/simhash-or-the-way-to-compare-quickly-two-datasets