Feature Request: String to bitset #11989
david-cortes
started this conversation in
Ideas
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
There is a function which calculates the jaccard similarity of strings:
duckdb/src/core_functions/scalar/string/jaccard.cpp
Line 41 in d9efdd1
This function involves creating a bitset of the two elements being compared during the function call:
duckdb/src/core_functions/scalar/string/jaccard.cpp
Line 32 in d9efdd1
Would be helpful to have a dedicated function to instead get the bitsets of the strings without immediately calculating these similarities, so that a user could later on do those same set operations on them and get information like set intersection and union separately.
This would allow for example:
Among others.
Beta Was this translation helpful? Give feedback.
All reactions