Question about the loss function of Tf-reg KD #24

HowieMa · 2021-03-07T23:46:30Z

Hi, thank you for sharing such an awesome project.
For the TF-reg KD, in line 47 of my_loss_function.py, should we also divide the temperature T on the output variable, like:
loss_soft_regu = nn.KLDivLoss()(F.log_softmax(outputs / T, dim=1), F.softmax(teacher_soft/T, dim=1))*params.multiplier

As in Eq (9) of your paper, the loss function is $$D_{KL}(p^d_\tau, p_\tau)$$.

I would really appreciate it if you could help me. Look forward to your reply, thanks!

The text was updated successfully, but these errors were encountered:

DLoveS1314 · 2022-04-07T07:04:42Z

I am also very confused about this issue, looking forward to the author's answer
#19 answer your question

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question about the loss function of Tf-reg KD #24

Question about the loss function of Tf-reg KD #24

HowieMa commented Mar 7, 2021 •

edited

DLoveS1314 commented Apr 7, 2022 •

edited

Question about the loss function of Tf-reg KD #24

Question about the loss function of Tf-reg KD #24

Comments

HowieMa commented Mar 7, 2021 • edited

DLoveS1314 commented Apr 7, 2022 • edited

HowieMa commented Mar 7, 2021 •

edited

DLoveS1314 commented Apr 7, 2022 •

edited