Add `CrossEntropyLoss` to classifier #1053

amine759 · 2024-05-11T05:24:37Z

I had to use CrossEntropyLoss for my use case, I thought of creating this PR since NLLLoss is the only loss function supported :).

BenjaminBossan · 2024-05-13T09:19:07Z

Thanks for proposing the PR. A few points:

We don't have type annotations in PEFT. Even if we did, we would support more than those two criteria.
CrossEntropyLoss should already work with NeuralNetClassifier.
The actual addition of your PR seems to be converting one-hot encoded targets to label-encoded targets. I don't think that this is specifically related to CrossEntropyLoss, is it?
Also, AFAIK, sklearn does not generally accept one-hot encoded targets for classification, so I don't think skorch should either.

amine759 · 2024-05-14T16:17:11Z

@BenjaminBossan Hi, Thanks for your reply,
As far as I understood it seems we're restricted to using only the NLLLoss loss function with skorch.NeuralNetClassifier. When attempting to use any other loss function, like CrossEntropyLoss, we encounter an error indicating that the criterion parameter only accepts NLLLoss.
Whereas the optimizer we can use any other than the default SGD. I assumed this is because get_loss does not handle the instance of a torch.nn.CrossEntropyLoss and since skorch don't use type annotation, I see now it wasn't necessary in the first place. I could have just changed get_loss and that's it, right?

BenjaminBossan · 2024-05-14T16:28:17Z

When attempting to use any other loss function, like CrossEntropyLoss, we encounter an error indicating that the criterion parameter only accepts NLLLoss.

Could you please provide an example to reproduce this error? CE should absolutely work in skorch as is, e.g.:

import numpy as np
from sklearn.datasets import make_classification
from torch import nn

from skorch import NeuralNetClassifier


X, y = make_classification(1000, 20, n_informative=10, random_state=0)
X = X.astype(np.float32)
y = y.astype(np.int64)

class MyModule(nn.Module):
    def __init__(self, num_units=10, nonlin=nn.ReLU()):
        super().__init__()

        self.dense0 = nn.Linear(20, num_units)
        self.nonlin = nonlin
        self.dropout = nn.Dropout(0.5)
        self.dense1 = nn.Linear(num_units, num_units)
        self.output = nn.Linear(num_units, 2)

    def forward(self, X, **kwargs):
        X = self.nonlin(self.dense0(X))
        X = self.dropout(X)
        X = self.nonlin(self.dense1(X))
        X = self.output(X)
        return X


net = NeuralNetClassifier(
    MyModule,
    max_epochs=10,
    criterion=nn.CrossEntropyLoss(),
    lr=0.1,
    # Shuffle training data on each epoch
    iterator_train__shuffle=True,
)

net.fit(X, y)
y_proba = net.predict_proba(X)

amine759 added 2 commits May 11, 2024 05:51

added crossentropy loss

cdc4e1f

remove space

1b0c1cb

amine759 changed the title ~~Add CrossEntropyLoss loss to classifier~~ Add CrossEntropyLoss to classifier May 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add `CrossEntropyLoss` to classifier #1053

Add `CrossEntropyLoss` to classifier #1053

amine759 commented May 11, 2024

BenjaminBossan commented May 13, 2024

amine759 commented May 14, 2024

BenjaminBossan commented May 14, 2024

Add CrossEntropyLoss to classifier #1053

Are you sure you want to change the base?

Add CrossEntropyLoss to classifier #1053

Conversation

amine759 commented May 11, 2024

BenjaminBossan commented May 13, 2024

amine759 commented May 14, 2024

BenjaminBossan commented May 14, 2024

Add `CrossEntropyLoss` to classifier #1053

Add `CrossEntropyLoss` to classifier #1053