Expanding the benchmark coverage of this repository for all the toolkits #143

braceletboy · 2019-12-19T16:37:06Z

@zoq @rcurtin I felt that this is a fantastic project where people can find which ml-toolkits are better for certain algorithms, and where the toolkits can improve themselves. So, I have been doing some work on my own that might be useful for this project. I have made a google sheet of the data I have been collecting in this regard. This google sheet contains:

the names of various machine learning and statistical analysis algorithms supported by the toolkits benchmarked in this repository
in which libraries they are found and in which libraries they aren't found
what are the API classes or functions that correspond to the algorithms
which algorithms have benchmarks and for what libraries are these benchmarks written.

I have till now covered all the algorithms provided by scikit-learn, mlpack and I am in the process of adding all the algorithms provided by Shogun into this list. This is a work in progress. I am going to add more algorithms to this list in the coming future and hopefully complete this. This is the google sheet that I am preparing:

I this regards I have some questions:
a) Is the aim of this project limited to benchmarking the algorithms supported mlpack? If no, I feel that having a sheet like this one, would help. (I got the idea of consolidating all this in a google sheet after I saw a google sheet on tensorflow's github when they were making tensorflow 2.0 and had to list all the API classes that needed some specific change).
b) Also, would it be possible for contributors from mlpack to also contribute to this sheet? I can give edit access. Currently, there are around 166 algorithms that are already listed with many more algorithms not covered and I haven't yet gone through all the library APIs. Would appreciate the help :)

rcurtin · 2019-12-24T02:08:38Z

a) Is the aim of this project limited to benchmarking the algorithms supported mlpack? If no, I feel that having a sheet like this one, would help. (I got the idea of consolidating all this in a google sheet after I saw a google sheet on tensorflow's github when they were making tensorflow 2.0 and had to list all the API classes that needed some specific change).

When we originally started on this project (it was @zoq's GSoC project many years ago :)) the idea was to use this benchmarking system to compare mlpack's implementations against other implementations. But it's grown somewhat since then, and honestly, it's a pretty general-purpose benchmarking system, so I don't see any need to limit only to algorithms that mlpack supports.

b) Also, would it be possible for contributors from mlpack to also contribute to this sheet? I can give edit access. Currently, there are around 166 algorithms that are already listed with many more algorithms not covered and I haven't yet gone through all the library APIs. Would appreciate the help :)

Sure, I would imagine that there would be some interest. You might try posting it on the mlpack chat channel (IRC/Matrix/gitter/etc.): https://www.mlpack.org/community.html#real-time-chat

zoq · 2019-12-28T20:11:32Z

a) Is the aim of this project limited to benchmarking the algorithms supported mlpack? If no, I feel that having a sheet like this one, would help. (I got the idea of consolidating all this in a google sheet after I saw a google sheet on tensorflow's github when they were making tensorflow 2.0 and had to list all the API classes that needed some specific change).

Awesome, thanks for putting everything together.

b) Also, would it be possible for contributors from mlpack to also contribute to this sheet? I can give edit access. Currently, there are around 166 algorithms that are already listed with many more algorithms not covered and I haven't yet gone through all the library APIs. Would appreciate the help :)

Happy to help, just send you a request.

braceletboy · 2020-01-13T09:05:51Z

@zoq and @rcurtin Sorry for the late response. I was on an extended vacation. I have approved your request @zoq. Please have a look at it and let me know what you think about it. Also let me know if you have any questions.

zoq added the enhancement label Dec 28, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Expanding the benchmark coverage of this repository for all the toolkits #143

Expanding the benchmark coverage of this repository for all the toolkits #143

braceletboy commented Dec 19, 2019

rcurtin commented Dec 24, 2019

zoq commented Dec 28, 2019

braceletboy commented Jan 13, 2020 •

edited

Expanding the benchmark coverage of this repository for all the toolkits #143

Expanding the benchmark coverage of this repository for all the toolkits #143

Comments

braceletboy commented Dec 19, 2019

rcurtin commented Dec 24, 2019

zoq commented Dec 28, 2019

braceletboy commented Jan 13, 2020 • edited

braceletboy commented Jan 13, 2020 •

edited