Shouldn't Prototypical loss use torch.cdist instead of F.pairwise_distance? #169

theolepage · 2023-01-26T11:02:13Z

Hello,

I have a question regarding the following part used for the Prototypical loss computation.

Line 31 in 343af8b

    
           output  = -1 * (F.pairwise_distance(out_positive.unsqueeze(-1),out_anchor.unsqueeze(-1).transpose(0,2))**2)

From my understanding, output should be similar to the cosine similarity matrix used for the Angular Prototypical loss but based on Euclidean distances instead.

Thus, the output tensor should have a shape of $(N, N)$ (with $N$ the number of samples in the mini-batch) and values at $i, j$ should be the squared Euclidean distance between sample $i$ of out_positive and sample $j$ of out_anchor.

However, F.pairwise_distance computes the pairwise distance between out_positive and out_anchor and not the distance between each pair between two sets of row vectors like torch.cdist.

Visualization of the difference between F.pairwise_distance and F.cosine_similarity

As a result the output shape will be $(N, D)$ (with $D$ the output dimension of the model) and the following loss computation is not coherent.

Thanks.

The text was updated successfully, but these errors were encountered:

AlexGranger-scn · 2023-07-13T10:23:18Z

Same confusion..... Have you solved this problem now? Thanks for your reply!

deGennesMarc · 2024-03-15T14:34:14Z

Same issue. In the spirit of @theolepage 's suggestion, I replaced the line with :
output = -torch.cdist(out_positive, out_anchor, p=2).pow(2)
but as of today it does not work for me.

Also it seems to me the definition of the prototypical loss from the "In defence of metric learning" paper is wrong as there should be a minus sign in front of the distances S_{j,k} in the softmax.

lbjcom added the question Further information is requested label Jan 31, 2023

lbjcom assigned Jungjee Jan 31, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Shouldn't Prototypical loss use torch.cdist instead of F.pairwise_distance? #169

Shouldn't Prototypical loss use torch.cdist instead of F.pairwise_distance? #169

theolepage commented Jan 26, 2023

AlexGranger-scn commented Jul 13, 2023

deGennesMarc commented Mar 15, 2024 •

edited

Shouldn't Prototypical loss use torch.cdist instead of F.pairwise_distance? #169

Shouldn't Prototypical loss use torch.cdist instead of F.pairwise_distance? #169

Comments

theolepage commented Jan 26, 2023

AlexGranger-scn commented Jul 13, 2023

deGennesMarc commented Mar 15, 2024 • edited

deGennesMarc commented Mar 15, 2024 •

edited