New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Deduping a single df #55
Comments
I posted a potential solution in #56 I hacked together a solution for my dataset. My solution joins the dataframe to a copy of itself and exclude any links where your unique identifier matches. In my case, it was for football players. I have one column I need to de-duplicate on: 'Name' and an identifier: 'player_id'.
|
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
I'm looking to do this on one df (and then on more than 2). Is there a way to do this that I've missed in the docs?
The text was updated successfully, but these errors were encountered: