[QUESTION] Chapter 4 Exercise Question 12 - cost function with l2 regularization seems incorrect #118

wowthecoder · 2023-12-26T19:46:19Z

When attempting the question, there is a bonus part to add l2 regularization to the softmax regression code (In [75]):

According to the book, in the section about Ridge Regression, we are supposed to add ($\dfrac{\alpha}{m}$ * sum of thetas) to the original cost function. However in line 2 in the picture above, l2_loss is somehow calculated with 1/2 multiplied at the front. Shouldn't it be 1/m instead?

According to the same section of the book, we should add $2\alpha w / m$ to the MSE gradient vector. So in line 3 of the picture above, shouldn't it be 2 * alpha * Theta[1:] / m instead?

Maybe this is why the validation loss suddenly increased a lot when the regularization is applied.

If this is indeed a typo, the bottom sections involving the hyperparameter C also has to be changed.

The text was updated successfully, but these errors were encountered:

wowthecoder · 2024-01-01T20:02:58Z

Can someone clarify on this?

tooniesnguyen · 2024-03-02T15:52:58Z

I'm also curious about the same. But I think the more correct sklearn formula in the book is wrong

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[QUESTION] Chapter 4 Exercise Question 12 - cost function with l2 regularization seems incorrect #118

[QUESTION] Chapter 4 Exercise Question 12 - cost function with l2 regularization seems incorrect #118

wowthecoder commented Dec 26, 2023

wowthecoder commented Jan 1, 2024

tooniesnguyen commented Mar 2, 2024

[QUESTION] Chapter 4 Exercise Question 12 - cost function with l2 regularization seems incorrect #118

[QUESTION] Chapter 4 Exercise Question 12 - cost function with l2 regularization seems incorrect #118

Comments

wowthecoder commented Dec 26, 2023

wowthecoder commented Jan 1, 2024

tooniesnguyen commented Mar 2, 2024