A question about calculating precision@k #2

caolingyu · 2018-04-03T09:40:28Z

Hello,

I have a question about the function precision_at_k in evaluation.py. I think the denominator should be the amount of 1 predictions made among the top k predictions, however, in the code, length of top k is used. For example, if only 1 true prediction in the top 5, the denominator should be 1 but in this case it would still be 5.

Here is my modification:

def precision_at_k(yhat, yhat_raw, y, k):
    #num true labels in top k predictions / num 1 predictions in top k 
    sortd = np.argsort(yhat_raw)[:,::-1]
    topk = sortd[:,:k]

    #get precision at k for each example
    vals = []
    for i, tk in enumerate(topk):
        if len(tk) > 0:
            num_true_in_top_k = y[i,tk].sum()
            denom = yhat[i,tk].sum()
            if denom == 0: # in case no true predictions made in top k
                vals.append(1)
            else:
                vals.append(num_true_in_top_k / float(denom))

    return np.mean(vals)

Could you take a look at it? Correct me if I am wrong.

The text was updated successfully, but these errors were encountered:

jamesmullenbach · 2018-04-03T15:18:07Z

For example, if only 1 true prediction in the top 5, the denominator should be 1 but in this case it would still be 5.

Do you mean if there's only one true positive code? Precision @ k is generally defined as the fraction of the k highest scored-labels that are in the set ground truth labels (it is defined this way at least in our paper). So we always want the denominator to be k, even if there are fewer than k ground truth labels.

caolingyu · 2018-04-04T01:33:03Z

Thanks for your reply.
I mean, I think in general the denominator should be how many positive predictions the model makes among the top k predictions.
For example, the top 5 predictions are [1,0,0,1,1] and the true labels are [1,0,1,0,1]. I think in this case the precision @ 5 should be 2/3 but in the code it will be 2/5.
I notice that, in the code comment you define precision @ k as 'num true labels in top k predictions / num 1 predictions in top k', where the denominator is not k.

jamesmullenbach · 2018-04-05T18:39:20Z

Ah, I see what you mean now. Yes you can define it that way. I'm not actually sure what the standard practice is in information retrieval, which is where I think this metric is most prevalent. I suspect the difference will not be major for this ICD coding task, as a trained model will usually predict more than 8 codes. For internal consistency I think I will update that comment, but keep the implementation as is now.

simon19891101 · 2018-05-21T21:57:01Z

The original implementation doesn't seem to be wrong. This article may help: https://medium.com/@m_n_malaeb/recall-and-precision-at-k-for-recommender-systems-618483226c54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

A question about calculating precision@k #2

A question about calculating precision@k #2

caolingyu commented Apr 3, 2018 •

edited

jamesmullenbach commented Apr 3, 2018

caolingyu commented Apr 4, 2018

jamesmullenbach commented Apr 5, 2018

simon19891101 commented May 21, 2018

A question about calculating precision@k #2

A question about calculating precision@k #2

Comments

caolingyu commented Apr 3, 2018 • edited

jamesmullenbach commented Apr 3, 2018

caolingyu commented Apr 4, 2018

jamesmullenbach commented Apr 5, 2018

simon19891101 commented May 21, 2018

caolingyu commented Apr 3, 2018 •

edited