Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Remove only exact duplicates (same ID, not same name, title and duration) #136

Open
JustMyGithub opened this issue Nov 17, 2021 · 2 comments

Comments

@JustMyGithub
Copy link

Your fine tool matches not only songs with the same ID, but also songs with same name, title and duration that have different IDs. While this is often helpful, it is also dangerous and besides this not what you want in some situations.
It is dangerous, because not always the Title reflects different versions of a song. Sometimes the difference is in the album name (like studio vs acoustic vs live or explicit vs clean)

I would love if it was possible to only remove same-ID songs.

If someone is interested: "some situations" in my case is my list of songs that I ever listened to on Spotify.

@reecestart
Copy link

Can you give some examples of songs that you consider not duplicates that have the same name, title and duration?

@JustMyGithub
Copy link
Author

Well, depending on the usage it does not matter whether they are the same song or not. Nevertheless I am pretty sure that there are false-positives, for example live albums that do have "live" in the song title of each track. Similarly the following two songs have same name and title and a duration that is almost identical, but one is acoustic version and the other is not.

https://open.spotify.com/track/4h2wAqBUu9jaTAdrLttSRh
https://open.spotify.com/track/6ZOBP3NvffbU4SZcrnt1k6
(I just remember this because I wondered about that "weird" version which is not name as not being the common one)

Interestingly, https://open.spotify.com/track/6ZOBP3NvffbU4SZcrnt1k6 has a different duration in different views on spotify (some locations say 3:53 other show 3:54)

My use case is the following: I am exploring music on spotify a lot, so I want to keep track of what I already listened to. For that reason, I have a huge playlist with all these songs. If I find an interesting playlist, I add all songs of the new playlist to my huge playlist, utilizing Spotify's feature to not add same-ID songs. That way I can filter the new playlist from songs that I listened to before. If I would remove songs with same name, title and duration (but different ID) from that list, I would listen to those songs again during exploration (and waste time).

Why are same-ID duplicates on that huge playlist in the first place? Because at some point in time, I think in December 2020, Spotify introduced a new Desktop-UI which silently removed the "want top add same-ID songs again?"-question and it took several month until they readded the feature. I think that took about half a year, which even caused me to try Deezer instead for some month.

Btw: Obviously, there are songs of different length that in deed are the same song, because CD masterings often include a silence at the end of a song, often 2 seconds. So if those CD masterings were provided to spotify for some albums and not for all, it is likely that <2-3 seconds difference are actually the same song and version.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants