We should shuffle around the realization order to minimize peak memory usage #8150

abadams · 2024-03-12T17:20:56Z

Consider a pipeline with three outputs f2, g2, h2. These call Funcs f1, g1, h1 respectively. Everything is compute_root. The realization order f1 f2 g1 g2 h1 h2 is going to use a lot less intermediate memory than the order f1 g1 h1 f2 g2 h2.

We should shuffle the realization order of realizations at each loop level in schedule_functions to minimize the number of overlapping lifetimes. This could be done by identifying each loop level used in a compute_at, and then for each, coming up with a new realization order for that loop level. This would have to be done at the level of fused groups, not Funcs.

rootjalex · 2024-03-19T17:47:15Z

This seems like something that should be scheduable, instead of automatic?

steven-johnson · 2024-03-19T17:56:18Z

This seems like something that should be scheduable, instead of automatic?

Is there ever a situation where we'd choose to use more than the minimum?

abadams · 2024-03-19T19:15:53Z

It also affects locality, so there might be a trade-off here. Also if the allocations are all dynamic-size, the peak usage and thus the order will depend on those sizes, so the compiler won't be able to infer it.

You can already sort of schedule it with compute_at(Var::outermost(), the_func_you_want_to_go_before)

abadams added the enhancement New user-visible features or improvements to existing features. label Mar 12, 2024

abadams changed the title ~~We should shuffle around the realization order to minimize peak memory usages~~ We should shuffle around the realization order to minimize peak memory usage Mar 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

We should shuffle around the realization order to minimize peak memory usage #8150

We should shuffle around the realization order to minimize peak memory usage #8150

abadams commented Mar 12, 2024

rootjalex commented Mar 19, 2024

steven-johnson commented Mar 19, 2024

abadams commented Mar 19, 2024

We should shuffle around the realization order to minimize peak memory usage #8150

We should shuffle around the realization order to minimize peak memory usage #8150

Comments

abadams commented Mar 12, 2024

rootjalex commented Mar 19, 2024

steven-johnson commented Mar 19, 2024

abadams commented Mar 19, 2024