Explicit chunking on all interaction simulate models #870

dhensle · 2024-05-09T01:14:47Z

Adds the option of explicit chunking to all interaction simulate models that were not already hooked-up. These include destination choice, location choice, and scheduling.

Also implemented a feature where the explicit_chunk setting can be less than 1. If less than one, it specifies the fraction. So explicit_chunk: 0.1 would mean that there would be 10 chunks. If greater than 1, explicit_chunk remains the total number of rows in the chooser table.

dhensle · 2024-05-21T20:46:10Z

Sharing some testing results for posterity.

Used TransLink's model at a 10% sample size. No chunking looks like this:

I then set chunk_training to explicit with the following explicit_chunk settings for submodels:

workplace_location: 0.5
mandatory_tour_scheduling: 0.25
non_mandatory_tour_destination: 0.5
non_mandatory_tour_scheduling: 0.5
trip_destination: 0.5

(notice the y-axis scale difference from the above plot)

Run time for no chunking was 113.9 minutes and for explicit chunking was 115 minutes -- very minimal increase in runtime.

jpn--

This looks great. Couple minor changes to simplify.

jpn-- · 2024-05-21T23:13:12Z

activitysim/core/chunk.py

@@ -1232,12 +1232,19 @@ def adaptive_chunked_choosers(

    chunk_tag = chunk_tag or trace_label

+    num_choosers = len(choosers.index)
+
+    explicit_and_odd_num_choosers = False


The value of explicit_and_odd_num_choosers is not needed. It's perfectly fine for some multiple of rows_per_chunk to overrun the end of the choosers by a bit, slicing beyond the end of the range. If it were needed, checking for odd wouldn't be enough, we'd need to adjust based on the inverse of the number of chunks (e.g. 0.25 won't line up unless the total is divisible by 4 not 2)

Please take a look at the changes in this commit: dhensle@1ed94c5

I was hitting the assert statement for the alts, (not the choosers) which prompted me to try to adjust the rows_per_chunk. But good point about odd not being good enough. I removed this functionality and replace the assert statement with a simple check on overflow of the alt index. I think the solution in the above commit fixes the issue (it runs successfully), but suggest you take a look. Thanks!

jpn-- · 2024-05-21T23:14:27Z

activitysim/core/chunk.py

+            & (i == estimated_number_of_chunks)
+            & (rows_per_chunk > 1)
+        ):
+            # last chunk may be smaller than chunk_size due to rounding error


We don't need to update the rows_per_chunk here, as noted above we can overrun the end of the choosers and be fine.

see above response.

…tysim into explicit_chunking

dhensle and others added 2 commits May 8, 2024 14:37

explicit chunking on interaction simulate models

0293dd2

accounting for small and odd num_choosers

5c2c126

dhensle marked this pull request as ready for review May 21, 2024 17:43

Merge branch 'main' into explicit_chunking

f031e1e

jpn-- requested changes May 21, 2024

View reviewed changes

dhensle added 2 commits May 22, 2024 09:16

rethinking chunk overflow

1ed94c5

Merge branch 'explicit_chunking' of https://github.com/dhensle/activi…

e73d976

…tysim into explicit_chunking

jpn-- approved these changes May 22, 2024

View reviewed changes

jpn-- merged commit 29d12bc into ActivitySim:main May 22, 2024
17 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Explicit chunking on all interaction simulate models #870

Explicit chunking on all interaction simulate models #870

dhensle commented May 9, 2024

dhensle commented May 21, 2024

jpn-- left a comment

jpn-- May 21, 2024

dhensle May 22, 2024

jpn-- May 21, 2024

dhensle May 22, 2024

Explicit chunking on all interaction simulate models #870

Explicit chunking on all interaction simulate models #870

Conversation

dhensle commented May 9, 2024

dhensle commented May 21, 2024

jpn-- left a comment

Choose a reason for hiding this comment

jpn-- May 21, 2024

Choose a reason for hiding this comment

dhensle May 22, 2024

Choose a reason for hiding this comment

jpn-- May 21, 2024

Choose a reason for hiding this comment

dhensle May 22, 2024

Choose a reason for hiding this comment