Add link to PharmKG8k about alternate version of data #1276
Labels
blocked:submitter
Waiting on clarification or input from the submitter
documentation
Improvements or additions to documentation
The PharmKG8k dataset that you use, and the one which the people behind the PharmKG8k paper point to have some significant issues.
We think that we have solved these significant issues. The issues consist of data leakage and the split being inductive.
More can be read about it on this GitHub:
https://github.com/skingi20/improvement_of_PharmKG8k_split
The new split that we propose, which has no data leakage and is transductive, can be found on there as well as proof of the original PharmKG8k split's problems.
The text was updated successfully, but these errors were encountered: