The loss computed in the last step is significantly smaller compared to the previous steps #709

rayuron · 2024-03-05T09:02:01Z

tfrs.tasks.Retrieval uses the tf.keras.losses.CategoricalCrossentropy loss function by default.
I think that the number of classes in tf.keras.losses.CategoricalCrossentropy equals the batch size when we don't set the num_hard_negatives.

Usually, the batch size in the last step is smaller than the batch size we have set.
Therefore, it seems that the loss in the last steps during training is smaller than in the previous steps.

I was confused by the small loss values because the logs only display the loss from the last step when outputting the loss for each epoch.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The loss computed in the last step is significantly smaller compared to the previous steps #709

The loss computed in the last step is significantly smaller compared to the previous steps #709

rayuron commented Mar 5, 2024 •

edited

The loss computed in the last step is significantly smaller compared to the previous steps #709

The loss computed in the last step is significantly smaller compared to the previous steps #709

Comments

rayuron commented Mar 5, 2024 • edited

rayuron commented Mar 5, 2024 •

edited