WIP: Added support for timm in unet #3717

madhavajay · 2022-06-30T04:43:56Z

This PR attempts to add timm models to the unet_learner, as per conversations during Live Coding 17: https://forums.fast.ai/t/live-coding-17/97166

I have several issues so far:

I don't know if theres a way to get the cut preferences from timm, I have added some code which tries to match them to fastai model types as a backup but they don't seem to match up so perhaps this is pointless and instead we either get them somewhere else or rely on manual user input
I tried training with them, but it seems like they use up a tonne of memory meaning my batch size can only be 1, not sure whats going on but something seems wrong especially considering i tried a smaller timm model resnet18 than my default unet fastai model resnet32.

I am sure I have done something wrong, and would appreciate some direction on what to do next.

review-notebook-app · 2022-06-30T04:44:00Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

madhavajay · 2022-07-01T22:00:56Z

So, I tried training it just to make sure its still working.

I got this far before my paperspace machine shutdown:

So I guess its definitely training but extremely slowly. For comparison I can do the same dataset, resnet34 with batch size 4 and get epochs of about 6 minutes on that free paperspace gpu. So assuming the architecture was equally complex and the batch size was 4 times lower that should only take 24 minutes per epoch.

It seems like when the model is allocated theres only 7% gpu memory in use, and then once training with 1 batch starts it goes to 87+%

I guess perhaps I don't understand the model cutting code and how to use it with timm models. Any advice on how to debug this?

madhavajay · 2022-08-08T01:38:34Z

Okay, I have changed the code use timm.create_model features_only=True.
So far it seems to be training a convnext_tiny

madhavajay requested a review from jph00 as a code owner June 30, 2022 04:43

jph00 marked this pull request as draft June 30, 2022 04:45

Added initial timm support for unet learner

543a530

madhavajay force-pushed the madhava/timm_unet branch from 844dc7a to 543a530 Compare August 8, 2022 01:23

madhavajay marked this pull request as ready for review August 8, 2022 22:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WIP: Added support for timm in unet #3717

WIP: Added support for timm in unet #3717

madhavajay commented Jun 30, 2022

review-notebook-app bot commented Jun 30, 2022

madhavajay commented Jul 1, 2022

madhavajay commented Aug 8, 2022

WIP: Added support for timm in unet #3717

Are you sure you want to change the base?

WIP: Added support for timm in unet #3717

Conversation

madhavajay commented Jun 30, 2022

review-notebook-app bot commented Jun 30, 2022

madhavajay commented Jul 1, 2022

madhavajay commented Aug 8, 2022