Made requested change by GKE to meet expectation #5535

mengdong · 2023-03-22T05:59:02Z

1: remove use of istio and use simple ingress solution
2, update hpa version
3, usage of service account.

Tabrizian · 2023-03-22T16:07:37Z

deploy/gke-marketplace-app/server-deployer/schema.yaml

@@ -89,7 +89,7 @@ properties:
  modelRepositoryPath:
    type: string
    title: Bucket where models are stored. Please make sure the user/service account to create the GKE app has permission to this GCS bucket. Read Triton documentation on configs and formatting details, supporting TensorRT, TensorFlow, Pytorch, Onnx ... etc.
-    default: gs://triton_sample_models/models
+    default: gs://triton_sample_models/23_02


Does the models change with every release? Also, is the GCS bucket going to be created for every release before we publish on NGC?

CC @mc-nv

it's a template, value must be populated by user.

In clean scenario yes, we create new storage bucket, but that's depend on user needs and particular cluster setup

we should be able to push new model for every release, I can talk to Christina to have it set up.

mengdong · 2023-03-23T00:51:39Z

rebased

…

On Wed, Mar 22, 2023 at 10:43 AM Misha Chornyi ***@***.***> wrote: ***@***.**** commented on this pull request. ------------------------------ In deploy/gke-marketplace-app/server-deployer/schema.yaml <#5535 (comment)> : > @@ -89,7 +89,7 @@ properties: modelRepositoryPath: type: string title: Bucket where models are stored. Please make sure the user/service account to create the GKE app has permission to this GCS bucket. Read Triton documentation on configs and formatting details, supporting TensorRT, TensorFlow, Pytorch, Onnx ... etc. - default: gs://triton_sample_models/models + default: gs://triton_sample_models/23_02 I'm afraid it's outdated PR - need to be rebased — Reply to this email directly, view it on GitHub <#5535 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/ABLQN46CTAPXCXSGS6KG5ODW5M22LANCNFSM6AAAAAAWDLWCSY> . You are receiving this because you authored the thread.Message ID: ***@***.***>

mengdong · 2023-03-27T20:58:11Z

@nv-kmcgill53 @rmccorm4 any updates? Thanks!

rmccorm4

Nice changes, mostly just typos/formatting suggestions

deploy/gke-marketplace-app/README.md

deploy/gke-marketplace-app/trt-engine/README.md

Co-authored-by: Ryan McCormick <mccormick.codes@gmail.com>

mengdong · 2023-03-30T17:00:25Z

thanks for the changes @rmccorm4

rmccorm4 · 2023-03-30T19:55:17Z

@mengdong LGTM, just need to fix the merge conflicts

mengdong · 2023-03-30T20:56:11Z

@rmccorm4 just resolved the merge conflict, thanks!

rmccorm4 · 2023-03-30T22:48:27Z

Lastly @mengdong, is there any reason why we use a TensorRT model for this example? I feel like it adds unnecessary complexity for the user. i.e. if we used an ONNX model or something more portable, the public demo bucket wouldn't have to be updated every release. This would also simplify our own upkeep and making sure the example works more broadly. What do you think?

And if TRT is an important part to demo, we could just keep the documentation on how users would create their own TRT model from the ONNX model for reference.

mengdong · 2023-03-30T23:01:55Z

Thanks @rmccorm4, I chose TRT mostly out of performance consideration and TRT provide best performance with Triton.

We do provide a section (README) on how to convert the onnx model to TRT.

For our upkeep, it should very easy process (could be automated) following the README to update the TRT model in the bucket.

rmccorm4 · 2023-03-30T23:05:48Z

For our upkeep, it should very easy process (could be automated) following the README to update the TRT model in the bucket.

The extra restriction for only working on T4 machines can be cumbersome as well, but if no one's complained so far then I guess there's no reason to change.

mengdong · 2023-04-04T18:09:26Z

any updates... ?

deploy/gke-marketplace-app/client-sample/locustfile_bert_large.py

mengdong · 2023-04-06T20:54:13Z

any more updates... ?

mengdong added 5 commits March 21, 2023 20:56

remove istio and use GKE ingress

c09c117

remove istio and use GKE ingress

a2e0598

modify

ff3d96d

gke marketplace change

d14e369

remove project specfic information

9757100

Tabrizian reviewed Mar 22, 2023

View reviewed changes

mengdong and others added 2 commits March 22, 2023 12:48

channge project

0519a4a

Merge branch 'triton-inference-server:main' into main

d9fc5ba

Tabrizian requested review from nv-kmcgill53 and rmccorm4 March 24, 2023 15:56

rmccorm4 reviewed Mar 29, 2023

View reviewed changes

mengdong and others added 7 commits March 30, 2023 09:57

Update deploy/gke-marketplace-app/README.md

ecd578e

Co-authored-by: Ryan McCormick <mccormick.codes@gmail.com>

Update deploy/gke-marketplace-app/README.md

8657c90

Co-authored-by: Ryan McCormick <mccormick.codes@gmail.com>

Update deploy/gke-marketplace-app/README.md

595cc0b

Co-authored-by: Ryan McCormick <mccormick.codes@gmail.com>

Update deploy/gke-marketplace-app/README.md

a9124a1

Co-authored-by: Ryan McCormick <mccormick.codes@gmail.com>

Update deploy/gke-marketplace-app/README.md

0bb7b49

Co-authored-by: Ryan McCormick <mccormick.codes@gmail.com>

Update deploy/gke-marketplace-app/README.md

173e6bc

Co-authored-by: Ryan McCormick <mccormick.codes@gmail.com>

Update README.md

61c3169

Merge branch 'main' into main

9c27240

rmccorm4 previously approved these changes Mar 30, 2023

View reviewed changes

nv-kmcgill53 previously approved these changes Apr 4, 2023

View reviewed changes

deploy/gke-marketplace-app/client-sample/locustfile_bert_large.py Show resolved Hide resolved

change file name

bd9775d

mengdong dismissed stale reviews from nv-kmcgill53 and rmccorm4 via bd9775d April 4, 2023 23:56

rmccorm4 requested review from nv-kmcgill53 and rmccorm4 April 5, 2023 00:08

rmccorm4 approved these changes Apr 5, 2023

View reviewed changes

nv-kmcgill53 approved these changes Apr 6, 2023

View reviewed changes

nv-kmcgill53 merged commit fd8d3fa into triton-inference-server:main Apr 6, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Made requested change by GKE to meet expectation #5535

Made requested change by GKE to meet expectation #5535

mengdong commented Mar 22, 2023

Tabrizian Mar 22, 2023

mc-nv Mar 22, 2023 •

edited

mengdong Mar 22, 2023

mengdong commented Mar 23, 2023 via email

mengdong commented Mar 27, 2023

rmccorm4 left a comment

mengdong commented Mar 30, 2023

rmccorm4 commented Mar 30, 2023

mengdong commented Mar 30, 2023

rmccorm4 commented Mar 30, 2023

mengdong commented Mar 30, 2023

rmccorm4 commented Mar 30, 2023

mengdong commented Apr 4, 2023

mengdong commented Apr 6, 2023

Made requested change by GKE to meet expectation #5535

Made requested change by GKE to meet expectation #5535

Conversation

mengdong commented Mar 22, 2023

Tabrizian Mar 22, 2023

Choose a reason for hiding this comment

mc-nv Mar 22, 2023 • edited

Choose a reason for hiding this comment

mengdong Mar 22, 2023

Choose a reason for hiding this comment

mengdong commented Mar 23, 2023 via email

mengdong commented Mar 27, 2023

rmccorm4 left a comment

Choose a reason for hiding this comment

mengdong commented Mar 30, 2023

rmccorm4 commented Mar 30, 2023

mengdong commented Mar 30, 2023

rmccorm4 commented Mar 30, 2023

mengdong commented Mar 30, 2023

rmccorm4 commented Mar 30, 2023

mengdong commented Apr 4, 2023

mengdong commented Apr 6, 2023

mc-nv Mar 22, 2023 •

edited