Replies: 1 comment 3 replies
-
Hi @vaslyb , PyKEEN's default evaluation follows the standard ranking-based setting from the literature: For each evaluation triple The same setting is applied in the classification evaluation, i.e., in most cases, we have a rather imbalanced set with more negative examples than positive ones. See also: https://pykeen.readthedocs.io/en/stable/tutorial/understanding_evaluation.html |
Beta Was this translation helpful? Give feedback.
-
I have a query regarding the usage of the "pykeen.evaluation.ClassificationEvaluator()" evaluator within my pipeline. Specifically, I have been retrieving the ROC-AUC metric from the evaluation results.
I do not understand how the ROC-AUC metric is computed. I am particularly interested in understanding the process of generating negative samples for the testing set. Are similar negative sampling techniques utilized for testing as those employed during the training phase?
Beta Was this translation helpful? Give feedback.
All reactions