feat: data parallel inference examples #2805

bowang007 · 2024-05-02T00:29:08Z

Description

This PR shows a simple example about using accelerate library for data parallel inference.

Checklist:

My code follows the style guidelines of this project (You can use the linters)
I have performed a self-review of my own code
I have commented my code, particularly in hard-to-understand areas and hacks
I have made corresponding changes to the documentation
I have added tests to verify my fix or my feature
New and existing unit tests pass locally with my changes
I have added the relevant labels to my PR in so that relevant reviewers are notified

github-actions

There are some changes that do not conform to Python style guidelines:

--- /home/runner/work/TensorRT/TensorRT/examples/distributed_inference/data_parallel_gpt2.py	2024-05-02 00:29:27.054073+00:00
+++ /home/runner/work/TensorRT/TensorRT/examples/distributed_inference/data_parallel_gpt2.py	2024-05-02 00:31:18.785078+00:00
@@ -13,12 +13,26 @@

distributed_state = PartialState()

model = GPT2LMHeadModel.from_pretrained("gpt2").eval().to(distributed_state.device)

-model.forward = torch.compile(model.forward, backend="torch_tensorrt", options={"truncate_long_and_double": True, "enabled_precisions": {torch.float16}, "debug": True}, dynamic=False,)
+model.forward = torch.compile(
+    model.forward,
+    backend="torch_tensorrt",
+    options={
+        "truncate_long_and_double": True,
+        "enabled_precisions": {torch.float16},
+        "debug": True,
+    },
+    dynamic=False,
+)

with distributed_state.split_between_processes([input_id1, input_id2]) as prompt:
    cur_input = torch.clone(prompt[0]).to(distributed_state.device)

-    gen_tokens = model.generate(cur_input, do_sample=True, temperature=0.9, max_length=100,)
+    gen_tokens = model.generate(
+        cur_input,
+        do_sample=True,
+        temperature=0.9,
+        max_length=100,
+    )
    gen_text = tokenizer.batch_decode(gen_tokens)[0]

narendasan

Need a requirements.txt
Annotate the script with description of whats happening https://github.com/pytorch/TensorRT/blob/main/examples/dynamo/torch_compile_advanced_usage.py
Add a reference to index.rst so that it gets rendered in the docs:

TensorRT/docsrc/index.rst

Line 113 in 12e885a

tutorials/_rendered_examples/dynamo/torch_compile_stable_diffusion

narendasan

LGTM

HolyWu · 2024-05-17T12:27:44Z

@bowang007 You didn't properly clean up the merge conflicts, therefore db24b3b had <<<<<<< HEAD, ======= and >>>>>>> dfbf6ea84 (feat: data parallel inference sample) remaining in docsrc/index.rst.

facebook-github-bot added the cla signed label May 2, 2024

github-actions bot requested changes May 2, 2024

View reviewed changes

bowang007 changed the title ~~feat: data parallel inference sample~~ feat: data parallel inference examples May 2, 2024

bowang007 requested review from narendasan, peri044, gs-olive, zewenli98, apbose and chohk88 May 3, 2024 01:05

narendasan reviewed May 3, 2024

View reviewed changes

github-actions bot added the documentation Improvements or additions to documentation label May 7, 2024

narendasan approved these changes May 14, 2024

View reviewed changes

bowang007 force-pushed the multi_gpu_support branch from 4bc05b7 to dfbf6ea Compare May 16, 2024 23:07

feat: data parallel inference sample

7b4b504

bowang007 force-pushed the multi_gpu_support branch from dfbf6ea to 7b4b504 Compare May 16, 2024 23:08

bowang007 merged commit db24b3b into main May 17, 2024
35 of 36 checks passed

bowang007 added a commit that referenced this pull request May 17, 2024

chore: cherry pick of #2805

014ab40

peri044 pushed a commit that referenced this pull request May 21, 2024

chore: cherry pick of #2805 (#2851)

93e8f29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: data parallel inference examples #2805

feat: data parallel inference examples #2805

bowang007 commented May 2, 2024

github-actions bot left a comment

narendasan left a comment

narendasan left a comment

HolyWu commented May 17, 2024

feat: data parallel inference examples #2805

feat: data parallel inference examples #2805

Conversation

bowang007 commented May 2, 2024

Description

Checklist:

github-actions bot left a comment

Choose a reason for hiding this comment

narendasan left a comment

Choose a reason for hiding this comment

narendasan left a comment

Choose a reason for hiding this comment

HolyWu commented May 17, 2024