Add documentation for `LogisticRegression` and `SoftmaxRegression` #3564

rcurtin · 2023-11-24T19:00:54Z

Similar to #3560, I added documentation for the logistic regression and softmax regression classifiers. Compiled documentation as HTML can be seen here:

A number of changes were necessary to make all of these things work (as usual...):

Added MatType template parameter to SoftmaxRegression and used throughout.
Modified LogisticRegression and SoftmaxRegression to hold weights that have the same element type as MatType.
Added consistent overloads of constructors and Train(), which all can optionally take instantiated ensmallen optimizers and callbacks.
Added tests for all variants of constructors and Train().
Added Reset() functionality, which is needed because training is by default incremental.
Added reverse-compatible serialization implementations.

All examples were compiled and tested by hand... I would like to build an automated system to do this at some point, but I'll save that for another day.

…ect ensmallen types.

…ads.

…ameters.

…d Train().

shrit · 2023-12-04T16:46:22Z

any reason why the tests are failing ? the docs looks good, I need to go also through the code modification since it seems that there has been some modification and some functions have templates now. However, the failing tests seems to be unrelated

shrit · 2023-12-04T16:49:20Z

some of the links are not working especially related to the examples but I would say this should be resolved before merging right ?

rcurtin · 2023-12-08T17:01:57Z

any reason why the tests are failing ? the docs looks good, I need to go also through the code modification since it seems that there has been some modification and some functions have templates now. However, the failing tests seems to be unrelated

Yeah, it took me a little while to figure it out but I think the failing tests were Armadillo versioning issues. It should pass now... let's see...

some of the links are not working especially related to the examples but I would say this should be resolved before merging right ?

Yeah, links to other Markdown pages aren't working yet. I'll fix that once I put together some framework that puts all the documentation pages together; for now, it's ok that those links don't work. However if you find a link that is inside the page (like to a section or something) and that doesn't work, I should definitely fix that before merge.

…ons.

rcurtin · 2023-12-14T15:19:29Z

This one is ready for review too---the builds are passing now (except for the random failure on the Windows build).

shrit · 2023-12-14T20:42:59Z

Okay will look into this one as well tomorrow

shrit

This looks really good, it is amazing, I love these two pages, they are straightforward to the point, have no repetition and are easy to read. I discovered a lot of stuff I did not know at all.

It took me some time to go over it mainly because I wanted to click on each link to be sure that I did not miss anything.

Here are a couple of things that I have noticed:

The following links are not working in the two pages (the link to one another), I think they should be working before merging as we discussed above unless if I am missing something
I think the types here mean default type because we can use float and other types e,.g., arma::fmat. I was confused when I saw it because it seems as if it refers to the only type which is the case.
Maybe change type to default type or something similar? I know this should be done on all the pages 😒 but it could be worth it.

I see we have added tests, and I am a bit surprised they are passing with the fact that we are declaring numbers without specifying the type in some cases. I know the compiler on my machine is translating this to double when I was testing dbscan and knn. I would say would be nice to cast it or define variables to be safe, but you know better than me.

This could be also the reason we are having a nan here in the failing tests, but it is hard to deduct,

D:\a\1\s\src\mlpack\tests\main_tests\radical_test.cpp(30): FAILED:
due to unexpected exception with message:
  sort(): detected NaN

I do not know if Radical test are using logistic regression somewhere

src/mlpack/methods/logistic_regression/logistic_regression.hpp

src/mlpack/methods/logistic_regression/logistic_regression_function.hpp

src/mlpack/methods/logistic_regression/logistic_regression_function_impl.hpp

src/mlpack/methods/softmax_regression/softmax_regression.hpp

src/mlpack/methods/softmax_regression/softmax_regression_function_impl.hpp

src/mlpack/methods/softmax_regression/softmax_regression_impl.hpp

src/mlpack/tests/logistic_regression_test.cpp

src/mlpack/tests/softmax_regression_test.cpp

rcurtin · 2023-12-18T01:42:30Z

The following links are not working in the two pages (the link to one another), I think they should be working before merging as we discussed above unless if I am missing something

I'd actually rather wait to fix these links; I don't know quite how things should be organized overall quite yet, so I was planning to figure that out (hopefully this week), then fix all the links in the individual Markdown files all at once. Do you think that sounds reasonable?

I think the types here mean default type because we can use float and other types e,.g., arma::fmat. I was confused when I saw it because it seems as if it refers to the only type which is the case. Maybe change type to default type or something similar? I know this should be done on all the pages 😒 but it could be worth it.

I spent a long time thinking about it... I wanted to try and keep the documentation simple and use default types where possible. I could do default type, but I would have to denote in the template parameter section what all the types change to. Another idea would be to note something like this:

 * When using a custom `MatType`, the training parameters will change type:
 
 <table goes here>
 
 * The classification parameters will change type too:
  
  <table goes here>
  
  * `lr.Parameters()` will now return a `MatType&`...

That's just a basic idea, but it keeps the different type information constrained to the section where we talk about different MatTypes. What do you think?

I also thought about a drop-down selector box that could change types, but that gets really ugly and would mean that the Markdown wasn't easily readable anymore.

I see we have added tests, and I am a bit surprised they are passing with the fact that we are declaring numbers without specifying the type in some cases. I know the compiler on my machine is translating this to double when I was testing dbscan and knn. I would say would be nice to cast it or define variables to be safe, but you know better than me.

Thanks for pointing this out... I saw the comments you left and I'll address them. As far as I know, the implicit casting of a double literal to a float isn't an issue, like if we did:

float x = 3.0;

and I think that's done several times in the tests. I won't worry about those instances. I read this on that point.

This could be also the reason we are having a nan here in the failing tests, but it is hard to deduct,
D:\a\1\s\src\mlpack\tests\main_tests\radical_test.cpp(30): FAILED:
due to unexpected exception with message:
 sort(): detected NaN
I do not know if Radical test are using logistic regression somewhere

RADICAL doesn't use logistic regression, so I think the failures are unrelated. I have actually been chasing that failure for a very long time and I've never been able to reproduce it. I wonder if there is a subtle memory error caused by another test, that then happens to result in the RADICAL test failing... but I am not sure.

rcurtin · 2023-12-18T02:32:01Z

@shrit thank you so much for the comprehensive review! It is really helpful. I think I have responded to everything; let me know if not, and let me know what you think of the comments. I am pretty tired right now so I may have overlooked something (I probably should have just gone to bed, but oh well, too late now 😄)

shrit

Looks good to me

shrit · 2023-12-18T10:06:59Z

Regarding the solution for default type, I do not have one, I want something really simple to understand without the need to add more tables or anything.
Maybe if we keep having an example for f32 in the end that would explain that these parameters are not hard coded and are templatized.
Let us keep them this way, maybe we will figure something out in the future for everything

Also I do not want us to mention MatType in the doc, it could scare people a bit, I would go for hiding it completely, if someone is looking for f32, they will find it in the doc, eventually they will understand it themselves at some point

mlpack-bot

Second approval provided automatically after 24 hours. 👍

rcurtin · 2023-12-19T19:27:33Z

Regarding the solution for default type, I do not have one, I want something really simple to understand without the need to add more tables or anything.

I opened #3584 so that the idea doesn't get lost. I have some ideas, but it may be a little while until I return to it.

rcurtin · 2023-12-21T14:00:18Z

The Python tests will be fixed in #3587, so I will go ahead and merge this despite the failing builds. (The Go failure appears to be random.)

rcurtin added 30 commits November 15, 2023 15:58

Add first attempt at logistic regression documentation.

3475819

Add first attempt at adding missing methods required by documentation.

56f485a

Add detectors for ensmallen optimizers and callbacks.

c407c53

Add additional Train() overloads and constructors, with safety to det…

b36213b

…ect ensmallen types.

Clean up signature of function.

7d164d2

Use rowvec for now to match other methods.

1746970

Use std::forward to correctly pass callback arguments to other overlo…

eaa0017

…ads.

Add tests for new versions of constructors, Train(), and Classify().

a051fd5

Allow storing different types for the internal LogisticRegression par…

6ea3300

…ameters.

Clean up test.

c5e3c73

Fix examples so that they compile and run.

dbdc338

Minor cleanups.

62726fd

Minor cleanups.

fd1f16c

First attempt at softmax regression documentation.

eafbf2e

Remove unnecessary line.

fe613c9

Include workaround for allowing templated versions for serialization.

a136a1c

Correct handling for older cereal versions.

e469594

Fix other use of cereal macro not available in older versions.

5fec290

Fix serialization for reverse compatibility.

bc0b2bb

First step at updating softmax regression implementation.

9556fb6

Fix matrix type.

0a2c370

Add a test for LogisticRegression::Reset().

1f8c347

Adapt return type to the matrix element type for Train().

aaf8afa

Add a utility to get a sparse matrix type for a given matrix type.

7431f63

Add MatType template parameter to softmax regression.

4e7d866

Adapt SoftmaxRegression type to new template parameter.

0dcd96d

Fix serialization bug for legacy versions.

aa37b48

Fix minor implementation bugs.

2652b98

Unify constructor and Train() with LogisticRegression constructors an…

55ebd72

…d Train().

Don't use deprecated Classify().

1c68158

rcurtin added 4 commits November 24, 2023 13:44

Use dense matrix type for weights for softmax regression.

3954a68

Use column vectors for single-point probabilities.

72b5433

Fix examples.

cd51ca3

Fix minor typos in documentation.

d614ae8

rcurtin added the c: documentation label Nov 24, 2023

Fix missing separator.

5f1a3a2

Adapt for older Armadillo versions.

b97df54

rcurtin added 3 commits December 8, 2023 17:18

Use placeholder callback that has been in ensmallen since older versi…

2954c54

…ons.

Take a guess that might fix the build issue.

59de5bb

Don't use deprecated functions.

665d345

shrit requested changes Dec 15, 2023

View reviewed changes

rcurtin added 2 commits December 17, 2023 20:54

Fix comment accuracy.

3100965

Use safer types in arithmetic expressions.

24eeca4

shrit approved these changes Dec 18, 2023

View reviewed changes

mlpack-bot bot approved these changes Dec 19, 2023

View reviewed changes

rcurtin mentioned this pull request Dec 19, 2023

Documentation should make types clearer when using non-default MatTypes #3584

Open

Fix other places where we might incorrectly cast to double.

3090cf1

shrit approved these changes Dec 20, 2023

View reviewed changes

rcurtin mentioned this pull request Dec 21, 2023

Fix python test duplicate filename error #3587

Merged

rcurtin merged commit d678052 into mlpack:master Dec 21, 2023
15 of 19 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add documentation for `LogisticRegression` and `SoftmaxRegression` #3564

Add documentation for `LogisticRegression` and `SoftmaxRegression` #3564

rcurtin commented Nov 24, 2023

shrit commented Dec 4, 2023

shrit commented Dec 4, 2023

rcurtin commented Dec 8, 2023

rcurtin commented Dec 14, 2023

shrit commented Dec 14, 2023

shrit left a comment

rcurtin commented Dec 18, 2023

rcurtin commented Dec 18, 2023

shrit left a comment

shrit commented Dec 18, 2023 •

edited

mlpack-bot bot left a comment

rcurtin commented Dec 19, 2023

rcurtin commented Dec 21, 2023

Add documentation for LogisticRegression and SoftmaxRegression #3564

Add documentation for LogisticRegression and SoftmaxRegression #3564

Conversation

rcurtin commented Nov 24, 2023

shrit commented Dec 4, 2023

shrit commented Dec 4, 2023

rcurtin commented Dec 8, 2023

rcurtin commented Dec 14, 2023

shrit commented Dec 14, 2023

shrit left a comment

Choose a reason for hiding this comment

rcurtin commented Dec 18, 2023

rcurtin commented Dec 18, 2023

shrit left a comment

Choose a reason for hiding this comment

shrit commented Dec 18, 2023 • edited

mlpack-bot bot left a comment

Choose a reason for hiding this comment

rcurtin commented Dec 19, 2023

rcurtin commented Dec 21, 2023

Add documentation for `LogisticRegression` and `SoftmaxRegression` #3564

Add documentation for `LogisticRegression` and `SoftmaxRegression` #3564

shrit commented Dec 18, 2023 •

edited