A small tool created to assist a peer with comparing a large list of strings stored in a csv. The goal was to make a comparison between each string in the list using a handful of different metrics. I created this repository in the interest of attempting to increase the efficiency of these comparisons.
No benchmarking was done as this was thrown together quickly to be used, however, it was a fun exercise in spawning threads and managing some shared state in Rust. There are certainly many ways this could be optimized far better which I did not explore.
The current state of the main()
function is set up to receive arguments on the commandline indicating
which version should be run. It should be noted, however, that the output directory, output
filenames, and input dataset filename are NOT configurable from the commandline. These would need to be
modified in the thread.rs and data.rs files.