[v2 BUG]: Cross entropy loss is too big #845

KnathanM · 2024-04-29T18:51:19Z

chemprop.nn.MulticlassClassificationFFN sets n_tasks = n_tasks * n_classes. This is a problem because then the default task weights is task_weights = torch.ones(n_tasks). F.cross_entropy in CrossEntropyLoss will reduce the n_tasks * n_classes predicted values to n_tasks loss values, but then L = L * self.task_weights.view(1, -1) will broadcast the n_tasks loss values to n_tasks * n_classes, effectively multiplying the loss value by n_classes when we use L.sum().

This doesn't break anything so I'll plan to fix it when we take another look at loss functions and metrics.

The text was updated successfully, but these errors were encountered:

davidegraff · 2024-04-30T14:27:26Z

specifically, the problem starts here:

chemprop/chemprop/nn/predictors.py

Lines 256 to 283 in cc5b3c1

    
           def __init__( 
        
               self, 
        
               n_classes: int, 
        
               n_tasks: int = 1, 
        
               input_dim: int = DEFAULT_HIDDEN_DIM, 
        
               hidden_dim: int = 300, 
        
               n_layers: int = 1, 
        
               dropout: float = 0.0, 
        
               activation: str = "relu", 
        
               criterion: LossFunction | None = None, 
        
               task_weights: Tensor | None = None, 
        
               threshold: float | None = None, 
        
               output_transform: UnscaleTransform | None = None, 
        
           ): 
        
               super().__init__( 
        
                   n_tasks * n_classes, 
        
                   input_dim, 
        
                   hidden_dim, 
        
                   n_layers, 
        
                   dropout, 
        
                   activation, 
        
                   criterion, 
        
                   task_weights, 
        
                   threshold, 
        
                   output_transform, 
        
               ) 
        
               self.n_classes = n_classes

and continues to here:

chemprop/chemprop/nn/predictors.py

Lines 108 to 131 in cc5b3c1

    
           def __init__( 
        
               self, 
        
               n_tasks: int = 1, 
        
               input_dim: int = DEFAULT_HIDDEN_DIM, 
        
               hidden_dim: int = 300, 
        
               n_layers: int = 1, 
        
               dropout: float = 0.0, 
        
               activation: str = "relu", 
        
               criterion: LossFunction | None = None, 
        
               task_weights: Tensor | None = None, 
        
               threshold: float | None = None, 
        
               output_transform: UnscaleTransform | None = None, 
        
           ): 
        
               super().__init__() 
        
               self.save_hyperparameters() 
        
               self.hparams["cls"] = self.__class__ 
        
               self.ffn = MLP.build( 
        
                   input_dim, n_tasks * self.n_targets, hidden_dim, n_layers, dropout, activation 
        
               ) 
        
               task_weights = torch.ones(n_tasks) if task_weights is None else task_weights 
        
               self.criterion = criterion or Factory.build( 
        
                   self._T_default_criterion, task_weights=task_weights, threshold=threshold 
        
               )

IMO it will probably be easier to fix this by abstracting the multiclass FFN into its own Predictor subclass rather than abusing the _FFNPredictorBase class like we do currently.

KnathanM added the bug Something isn't working label Apr 29, 2024

KnathanM added this to the v2.0.1 milestone Apr 29, 2024

KnathanM self-assigned this Apr 29, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[v2 BUG]: Cross entropy loss is too big #845

[v2 BUG]: Cross entropy loss is too big #845

KnathanM commented Apr 29, 2024

davidegraff commented Apr 30, 2024

[v2 BUG]: Cross entropy loss is too big #845

[v2 BUG]: Cross entropy loss is too big #845

Comments

KnathanM commented Apr 29, 2024

davidegraff commented Apr 30, 2024