feat: torch 2.0 #3682

aarnphm · 2023-03-17T10:56:21Z

This PR aims to bring Torch 2.0 support to save_model via torch.compile

codecov · 2023-03-17T11:01:00Z

Codecov Report

Merging #3682 (7ab3e55) into main (bcc10ac) will decrease coverage by 0.04%.
The diff coverage is 0.00%.

@@            Coverage Diff             @@
##             main    #3682      +/-   ##
==========================================
- Coverage   31.74%   31.71%   -0.04%     
==========================================
  Files         149      149              
  Lines       12149    12162      +13     
  Branches     2001     2003       +2     
==========================================
  Hits         3857     3857              
- Misses       8008     8021      +13     
  Partials      284      284

Impacted Files	Coverage Δ
src/bentoml/_internal/frameworks/pytorch.py	`0.00% <0.00%> (ø)`

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>

larme · 2023-03-19T14:30:03Z

src/bentoml/_internal/frameworks/pytorch.py

+class ModelOptions(PartialKwargsModelOptions):
+    fullgraph: bool = False
+    dynamic: bool = False
+    backend: t.Union[str, t.Callable[..., t.Any]] = "inductor"
+    mode: t.Optional[str] = None
+    options: t.Optional[t.Dict[str, t.Union[str, int, bool]]] = None
+    disable: bool = False
+
+


First, let’s call this PytorchOptions and only when import to bentoml.pytorch we rename it to ModelOptions.

Second, I think maybe it’s better to have single compile_kwargs dict in PytorchOptions instead of polluting the name space of PytorchOptions. Maybe we will have only 2 entries:

enable_compile

compile_kwargs

larme · 2023-03-19T14:31:10Z

src/bentoml/_internal/frameworks/pytorch.py

 def load_model(
    bentoml_model: str | Tag | Model,
    device_id: t.Optional[str] = "cpu",
+    **compile_kwargs: t.Any,


I don’t think we need do compile at load_model level. Let’s just save the original model and do torch.compile when init the runner (if user set enable_compile=True)

larme · 2023-03-19T14:33:44Z

src/bentoml/_internal/frameworks/pytorch.py

+    opts = t.cast(ModelOptions, bento_model.info.options)
+    if get_pkg_version("torch") >= "2.0.0":
+        _load_model = partial(
+            load_model,
+            fullgraph=opts.fullgraph,
+            dynamic=opts.dynamic,
+            backend=opts.backend,
+            mode=opts.mode,
+            options=opts.options,
+            disable=opts.disable,
+        )
+    else:
+        _load_model = load_model


I think we can just model = load_model(…), and then torch.compile(model, **compile_kwargs) if enable_compile is True.

feat: support torch 2.0

7ab3e55

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>

aarnphm force-pushed the feat/torch-2.0 branch from f04e60a to 7ab3e55 Compare March 17, 2023 14:08

aarnphm marked this pull request as ready for review March 17, 2023 14:08

aarnphm requested a review from a team as a code owner March 17, 2023 14:08

aarnphm requested review from larme and removed request for a team March 17, 2023 14:08

larme requested changes Mar 19, 2023

View reviewed changes

bojiang self-requested a review March 23, 2023 06:43

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: torch 2.0 #3682

feat: torch 2.0 #3682

aarnphm commented Mar 17, 2023

codecov bot commented Mar 17, 2023 •

edited

larme Mar 19, 2023

larme Mar 19, 2023

larme Mar 19, 2023 •

edited

feat: torch 2.0 #3682

Are you sure you want to change the base?

feat: torch 2.0 #3682

Conversation

aarnphm commented Mar 17, 2023

codecov bot commented Mar 17, 2023 • edited

Codecov Report

larme Mar 19, 2023

Choose a reason for hiding this comment

larme Mar 19, 2023

Choose a reason for hiding this comment

larme Mar 19, 2023 • edited

Choose a reason for hiding this comment

codecov bot commented Mar 17, 2023 •

edited

larme Mar 19, 2023 •

edited