Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Rewrite some rlike expression to StartsWith/Contains #10742

Closed
Tracked by #10741
thirtiseven opened this issue Apr 25, 2024 · 0 comments · Fixed by #10715
Closed
Tracked by #10741

Rewrite some rlike expression to StartsWith/Contains #10742

thirtiseven opened this issue Apr 25, 2024 · 0 comments · Fixed by #10715
Assignees

Comments

@thirtiseven
Copy link
Collaborator

thirtiseven commented Apr 25, 2024

We can speed up rlike by rewrite some common pattern into some faster operations. First two steps can be rewrite ^literal to startsWith and literal to Contains.

Now we use a RegexParser to parse the input regex to a transpiledAST, then generate a new regex string from it that can be supported by cuDF. With this RegexParser, we can match the input regex AST, and replace it with GpuStartsWith or GpuContains when overriding.

@thirtiseven thirtiseven changed the title Rewrite some rlike expression to StartsWith/EndsWith/Contains/EqualTo Rewrite some rlike expression to StartsWith/Contains May 21, 2024
@thirtiseven thirtiseven self-assigned this May 21, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging a pull request may close this issue.

1 participant