New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Ghost entries skipped for ILU apply and SpMV operator in all levels of AMG/CPR hierarchy #5182
base: master
Are you sure you want to change the base?
Conversation
8a44018
to
f418bd5
Compare
jenkins build this please |
f418bd5
to
562c15a
Compare
jenkins build this please |
Fortunately, no time stepping changes and hence failed tests. Just new warnings that need to be removed. |
562c15a
to
9f95301
Compare
jenkins build this please |
9f95301
to
aa72833
Compare
jenkins build this please |
benchmark please |
@blattms: I believe the last comment "benchmark please" did not have any effect - can you show me where I can check that? |
aa72833
to
d443d52
Compare
jenkins build this please |
turns out you can't. I asked Michael to whitelist you and @aritorto. It is "benchmark please" and should be used scarcely. We cannot really see whether it worked. Normally the benchmarking report is added to the PR after a few hours, but currently that is broken, too. |
benchmark please |
Benchmark result overview:
View result details @ https://www.ytelses.com/opm/?page=result&id=2370 |
Benchmark result overview:
View result details @ https://www.ytelses.com/opm/?page=result&id=2371 |
Benchmark result overview:
View result details @ https://www.ytelses.com/opm/?page=result&id=2372 |
Benchmark result overview:
View result details @ https://www.ytelses.com/opm/?page=result&id=2373 |
Looks like no change in the benchmarks, which is as expected, since they all run using the default linear solver (i.e. ILU0 preconditioner, no CPR/AMG) I believe. |
Yes that makes sense. Curretly, I'm still waiting for Andreas to get back to me with the measurements he has done previously. |
dddb44e
to
3818c75
Compare
New results using the current master (N = number of processes, simulation time in seconds for normal and skipping ghost) below, for 32 and 64 processes the improvement is still around 5% and for 128 processes, the improvement is still around 10%!
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Review is not complete yet.
Looking at this PR I suddenly realize that we are already assuming that copy/overlap entries come after owner. The certainly holds when using CpGrid, but I kind of doubt that it holds for other parallel grids 😬 . It seems like there is no check for this in master anywhere, but I'll double check this.
Anyway, if this assumption is true then this restriction has been there for quite some time without us noticing. Hence it should not hold merging back.
for(int j=0; j < N; j++) | ||
for(int i=overlapStart; i < overlapEnd; i++, ++iter) { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Don't we need to change this, to make the sure that the guys that are GridAttributes::copy
come after the rest?
I think we would need some kind of mapping of the indices to ensure that?
Not sure whether we want an expensive check in GhostLastMatrxAdapter that checks the GridAttributes::owner
comes first.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can we discuss this tomorrow once more and then eventually merge?
3818c75
to
05b4af1
Compare
jenkins build this please |
05b4af1
to
3caa86e
Compare
jenkins build this please |
…chy. This works since the ghost entries are the last entries
3caa86e
to
aa9a848
Compare
Replacement for #4296