Large geometry intersection check perf speedup #701

urschrei · 2021-12-22T12:26:24Z

Above some threshold, it may be faster to load the geometry or geometries into an R*-star tree and query for a subset of intersection candidates.

This is a draft PR because:

I haven't benchmarked the tree-query logic in order to figure out even an approximate value for MAX_NAIVE_SEGMENTS
I haven't figured out whether checking the number of segments in order to decide to switch to trees is an unacceptable perf hit
I agree to follow the project's code of conduct.
I added an entry to CHANGES.md if knowledge of this change could be valuable to users.

urschrei · 2021-12-22T13:39:57Z

This is with length checking turned on, RTree (old) vs naïve (new) polygon-line intersection check using the louisiana.rs text fixture (1350 vertices). We're going to need a bigger polygon.

urschrei · 2021-12-22T13:48:13Z

Same test setup (RTree is old, naïve is new), using norway_main.rs (8534 vertices)

urschrei · 2021-12-22T16:09:56Z

Same test setup (RTree is old, naïve is new), using a new big.rs (100610 vertices) valid polygon:

If I'm interpreting this correctly:

Using a tree for a single Polygon-line intersection check probably isn't worth it for many applications
The length checks aren't significant in wall-clock terms
The existing logic is very fast in wall-clock terms!

Now, on to Polygon-Polygon intersection checking…

urschrei · 2021-12-22T18:26:39Z

Now we're talking:

Old (red, naïve) vs new (blue, tree) (sorry for reversing the setup this time!)
Polygon-Polygon intersection check using big.rs and norway_main.rs (100610 and 8534 vertices, respectively)

The naïve version has to potentially do ~860 million line-line intersection checks so 165 ms isn't bad, but the R*-tree mean is only 35.4 ms…

michaelkirk · 2021-12-22T18:34:51Z

Cool!

Just as a reminder, @rmanoka did a somewhat related investigation of intersection perf here:
#649

It's a slightly different use case, but the findings might be relevant. In particular, I sort of expect the performance of R-Tree to be better than the naive approach as a.segments() * b.segements() scales.

So if you did want to have a threshold to toggle between "naive" vs. "rtree" using a multiplicative, rather than an additive might make more sense.

if poly_a.num_segments() * poly_b.num_segements() > x {
    use_rtree()
} else {
    use_naive()
}

urschrei · 2021-12-22T18:47:33Z

Yeah, the multiplicative is definitely better. Next up, I need to figure out a way of getting the large polygon data into the benchmark; include!ing it is taking about 45 mins to compile, and it's only around 2.8 mb.

michaelkirk · 2021-12-22T18:53:54Z

Maybe #566?

(lol not to just dump my wishlist on you or anything...)

urschrei · 2021-12-23T15:49:37Z

Hmm as constructed intersection_candidates_with_other_tree isn't actually producing any candidates for our (known-to-intersect) test data. I wonder what's going on…

urschrei · 2021-12-23T16:08:57Z

Update: we're checking for line intersections in this case, since we're decomposing the polygons into lines. But that excludes the possibility of polygon a completely enclosing polygon b (e.g. large square with smaller square inside). 🤔

urschrei · 2021-12-23T17:35:59Z

If / when #351 lands, the easier case (A contains B) will be solved, but the more tricky case (A contains B inside one of its holes) remains.

frewsxcv · 2021-12-30T17:54:58Z

geo/src/algorithm/intersects/polygon.rs

+    geo_types::Line<T>: RTreeObject,
+{
+    fn intersects(&self, linestring: &LineString<T>) -> bool {
+        if (self.exterior().0.len() + self.interiors().iter().map(|ls| ls.0.len()).sum::<usize>())


if (self.exterior().0.len() + self.interiors().iter().map(|ls| ls.0.len()).sum::<usize>())

#707 😄

frewsxcv · 2021-12-30T18:00:36Z

geo/src/algorithm/intersects/polygon.rs

+            let lines_a: Vec<_> = self
+                .exterior()
+                .lines()
+                .chain(self.interiors().iter().flat_map(|ls| ls.lines()))


Should the last coordinate of the exterior ring connect to the first coordinate of the first interior ring?

Oh wait, that's not what's happening here, nevermind!

Meep meep #708

michaelkirk · 2022-06-16T02:43:17Z

Now that #829 is merged, we might get comparable perf via something like a.relate(b).is_intersects()

urschrei · 2022-06-25T10:32:24Z

a.relate(b).is_intersects()

relate requires a GeoFloat bound (currently GeoNum), and from a first cut, use of relate introduces extreme perf regressions (900 %+ in some case) in our nice new boolean ops intersection benchmark suite…

michaelkirk · 2022-07-15T10:56:29Z

from a first cut, use of relate introduces extreme perf regressions (900 %+ in some case) in our nice new boolean ops intersection benchmark suite…

Do you still have an example of the data you were using to see this kind of performance?

I'd expect the R-Tree approach to be a constant factor slower, but better asymptotically — much better for big inputs and a little slower for small inputs. I am curious though as to what range of geometry sizes we can expect to see the tradeoffs.

urschrei · 2022-07-15T11:08:00Z

If you check out this branch and run the boolean ops benchmarks you should see the perf regression in the criterion output. I should also note that this was a quick check that I didn't dig into, so I may have messed something up.

Above some threshold, it may be faster to load the geometry or geometries into an R*-star tree and query for a subset of intersection candidates.

Also use multiplicative threshold (TODO: what should it be?)

urschrei · 2022-07-15T11:09:35Z

(I rebased against main just in case)

urschrei added enhancement refactoring idea labels Dec 22, 2021

urschrei force-pushed the rtree_speedups branch from 853620f to 332e949 Compare December 27, 2021 12:13

frewsxcv reviewed Dec 30, 2021

View reviewed changes

This was referenced Dec 30, 2021

Add num_rings or rings_count to Polygon #706

Closed

Add num_coords or coords_count to Polygon #707

Closed

frewsxcv reviewed Dec 30, 2021

View reviewed changes

urschrei force-pushed the rtree_speedups branch from 332e949 to d0b4006 Compare June 25, 2022 11:04

urschrei added 4 commits July 15, 2022 12:08

Large geometry intersection check perf speedup

cdd6c10

Above some threshold, it may be faster to load the geometry or geometries into an R*-star tree and query for a subset of intersection candidates.

Only use R*-tree for polygon-polygon intersection

e0de30c

Also use multiplicative threshold (TODO: what should it be?)

Add LineSTring-Polygon intersection

01475f0

Cleaner imports

827be0b

urschrei force-pushed the rtree_speedups branch from d0b4006 to 827be0b Compare July 15, 2022 11:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Large geometry intersection check perf speedup #701

Large geometry intersection check perf speedup #701

urschrei commented Dec 22, 2021

urschrei commented Dec 22, 2021 •

edited

urschrei commented Dec 22, 2021

urschrei commented Dec 22, 2021

urschrei commented Dec 22, 2021

michaelkirk commented Dec 22, 2021 •

edited

urschrei commented Dec 22, 2021

michaelkirk commented Dec 22, 2021

urschrei commented Dec 23, 2021

urschrei commented Dec 23, 2021

urschrei commented Dec 23, 2021 •

edited

frewsxcv Dec 30, 2021 •

edited

frewsxcv Dec 30, 2021

frewsxcv Dec 30, 2021

frewsxcv Dec 30, 2021 •

edited

michaelkirk commented Jun 16, 2022

urschrei commented Jun 25, 2022

michaelkirk commented Jul 15, 2022

urschrei commented Jul 15, 2022

urschrei commented Jul 15, 2022

Large geometry intersection check perf speedup #701

Are you sure you want to change the base?

Large geometry intersection check perf speedup #701

Conversation

urschrei commented Dec 22, 2021

urschrei commented Dec 22, 2021 • edited

urschrei commented Dec 22, 2021

urschrei commented Dec 22, 2021

urschrei commented Dec 22, 2021

michaelkirk commented Dec 22, 2021 • edited

urschrei commented Dec 22, 2021

michaelkirk commented Dec 22, 2021

urschrei commented Dec 23, 2021

urschrei commented Dec 23, 2021

urschrei commented Dec 23, 2021 • edited

frewsxcv Dec 30, 2021 • edited

Choose a reason for hiding this comment

frewsxcv Dec 30, 2021

Choose a reason for hiding this comment

frewsxcv Dec 30, 2021

Choose a reason for hiding this comment

frewsxcv Dec 30, 2021 • edited

Choose a reason for hiding this comment

michaelkirk commented Jun 16, 2022

urschrei commented Jun 25, 2022

michaelkirk commented Jul 15, 2022

urschrei commented Jul 15, 2022

urschrei commented Jul 15, 2022

urschrei commented Dec 22, 2021 •

edited

michaelkirk commented Dec 22, 2021 •

edited

urschrei commented Dec 23, 2021 •

edited

frewsxcv Dec 30, 2021 •

edited

frewsxcv Dec 30, 2021 •

edited