Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[VL] Enable local sort-based shuffle #5811

Merged
merged 7 commits into from
May 23, 2024
Merged

Conversation

marin-ma
Copy link
Contributor

@marin-ma marin-ma commented May 20, 2024

This is an experimental feature. In TPCH benchmark, we observed significant performance drop and higher memory pressure
using default shuffle partitions (2x~4x vcore#) comparing with hash-based shuffle. The sort-based shuffle will be enabled if shuffle partitions is greater than spark.gluten.sql.columnar.shuffle.sort.threshold (default value 100'000). It's recommended not to enable local sort-base shuffle until we fix the performance.

Copy link

Thanks for opening a pull request!

Could you open an issue for this pull request on Github Issues?

https://github.com/apache/incubator-gluten/issues

Then could you also rename commit message and pull request title in the following format?

[GLUTEN-${ISSUES_ID}][COMPONENT]feat/fix: ${detailed message}

See also:

Copy link

Run Gluten Clickhouse CI

1 similar comment
Copy link

Run Gluten Clickhouse CI

Copy link

Run Gluten Clickhouse CI

1 similar comment
Copy link

Run Gluten Clickhouse CI

Copy link

Run Gluten Clickhouse CI

@marin-ma
Copy link
Contributor Author

The UT failure of commit 56580aa is due to metrics changed which causes AQE suites failure https://github.com/apache/incubator-gluten/actions/runs/9185004959

Copy link

Run Gluten Clickhouse CI

Copy link
Contributor

@kerwin-zk kerwin-zk left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM. Thanks!

@marin-ma marin-ma merged commit fbb4ec5 into apache:main May 23, 2024
42 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants