ASReviewData: deduplication method #1230
gimoAI
started this conversation in
Show and tell
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
We have added deduplication methods to the ASReviewData object. This feature will be available in the v1.1 release.
How to use
Extra Options
Per default deduplication is done based on
doi
and duplicate titles/abstracts. It is possible to use a custom persistent identifier (PID) other than DOI by passing it as a parameter. Consider a dataset with PubMed identifiers (PMID
), the identifier can be used for deduplication.For the drop_duplicates function it is also possible to do the action inplace and to prevent resetting the existing column index.
Beta Was this translation helpful? Give feedback.
All reactions