Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Optimise Sorting in SegmentProcessorFramework [WIP] #13091

Draft
wants to merge 3 commits into
base: master
Choose a base branch
from

Conversation

aishikbh
Copy link
Contributor

@aishikbh aishikbh commented May 6, 2024

Load only the sort columns in memory and call compare on the loaded columns instead of calling compare on the whole memory mapped datafile.

* Create the sort order only using the sort columns. Currently we mmap
  the whole data file and compare fucntion causes thrashing if the
dataset is too large.

* We just load the sort columns and create the sort order using only
  that comparisons.
@codecov-commenter
Copy link

codecov-commenter commented May 6, 2024

Codecov Report

Attention: Patch coverage is 0% with 81 lines in your changes are missing coverage. Please review.

Project coverage is 27.75%. Comparing base (59551e4) to head (6936bac).
Report is 409 commits behind head on master.

Files Patch % Lines
...cessing/genericrow/GenericRowFileRecordReader.java 0.00% 41 Missing ⚠️
.../processing/genericrow/GenericRowDeserializer.java 0.00% 38 Missing ⚠️
...nt/processing/genericrow/GenericRowFileReader.java 0.00% 2 Missing ⚠️
Additional details and impacted files
@@              Coverage Diff              @@
##             master   #13091       +/-   ##
=============================================
- Coverage     61.75%   27.75%   -34.00%     
+ Complexity      207      192       -15     
=============================================
  Files          2436     2507       +71     
  Lines        133233   137864     +4631     
  Branches      20636    21345      +709     
=============================================
- Hits          82274    38266    -44008     
- Misses        44911    96687    +51776     
+ Partials       6048     2911     -3137     
Flag Coverage Δ
custom-integration1 ?
integration 0.00% <0.00%> (-0.01%) ⬇️
integration1 ?
integration2 0.00% <0.00%> (ø)
java-11 27.72% <0.00%> (-33.99%) ⬇️
java-21 27.75% <0.00%> (-33.87%) ⬇️
skip-bytebuffers-false 27.74% <0.00%> (-34.00%) ⬇️
skip-bytebuffers-true 27.73% <0.00%> (+<0.01%) ⬆️
temurin 27.75% <0.00%> (-34.00%) ⬇️
unittests 27.75% <0.00%> (-34.00%) ⬇️
unittests1 ?
unittests2 27.75% <0.00%> (+0.02%) ⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants