Fix performance issue in CoroutineDescendantAxis #609

JohannesLichtenberger · 2023-05-10T07:09:44Z

https://github.com/sirixdb/sirix/blob/master/bundles/sirix-kotlin-api/src/main/kotlin/org/sirix/axis/CoroutineDescendantAxis.kt

It should read mainly from the page cache (BufferManagerImpl => caches) / in the test, actually always, but I assume it'll make I/Os all the time. That's why it's more than 100x slower than the standard "serial" DescendantAxis.

The idea of the axis is to prefetch right sibling nodes in parallel in a second read-only trx.

So we'll have to debug and profile...

JohannesLichtenberger · 2023-05-22T21:37:10Z

It seems in the Producer there's a right sibling key stack, which is an issue with the proposed algorithm (I thought maybe we could process all right siblings in parallel (perhaps we can also take the descendantCount of each inner node into account and only fetch in parallel if the count is beyond a threshold.

Kiriakos1998 · 2023-05-31T19:36:56Z

Hello, @JohannesLichtenberger, just realized that perhaps it was not clear that I looked into this issue eventually. So I specifically added a timestamp in the testAxisConventions in AbsAxisTest.
iteration -> 1001400
iteration -> 4000400
iteration -> 4000400
iteration -> 87000500
iteration -> 87000500
iteration -> 97001000
iteration -> 97001000
iteration -> 97001000
iteration -> 98001800
iteration -> 98001800
Here is the result in Nanos per iteration. Perhaps the way to go is to see which methods are specifically involved in these iterations. Also, it will be interesting to see which line of code inside the while loop is creating these delays.

JohannesLichtenberger · 2023-05-31T21:17:17Z

I think the implementation itself is not good. First of all the producer seems to explicitly block until it's finished and only afterwards the next right sibling of a node during preorder is traversed to send the nodes to the axis. Second, AFAICS in the producer itself the right sibling key stack is used instead of a kind of fork join approach.

I'm also not sure if thread synchronization itself might be an issue as well as starting of new transactions...

Maybe there's also a better approach for parallelization of preorder traversal with the DOM like pointer based encoding (firstChild/lastChild/rightSibling/leftSibling/parent) of every node.

The other issue with the slow IO using based backend might also be interesting...

JohannesLichtenberger added good first issue help wanted labels May 17, 2023

JohannesLichtenberger mentioned this issue May 22, 2023

Investigate and fix bad I/O performance #558

Open

JohannesLichtenberger closed this as completed May 31, 2023

JohannesLichtenberger reopened this May 31, 2023

JohannesLichtenberger added the Hacktoberfest label Sep 17, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix performance issue in CoroutineDescendantAxis #609

Fix performance issue in CoroutineDescendantAxis #609

JohannesLichtenberger commented May 10, 2023

JohannesLichtenberger commented May 22, 2023

Kiriakos1998 commented May 31, 2023

JohannesLichtenberger commented May 31, 2023 •

edited

Fix performance issue in CoroutineDescendantAxis #609

Fix performance issue in CoroutineDescendantAxis #609

Comments

JohannesLichtenberger commented May 10, 2023

JohannesLichtenberger commented May 22, 2023

Kiriakos1998 commented May 31, 2023

JohannesLichtenberger commented May 31, 2023 • edited

JohannesLichtenberger commented May 31, 2023 •

edited