[feature] Provider unified offline batch inference interface #47

gaocegege · 2020-06-14T12:32:44Z

Is this a BUG REPORT or FEATURE REQUEST?:

Uncomment only one, leave it on its own line:

/kind bug
/kind feature

What happened:

Investigate if we can use https://github.com/uber/neuropod to provide a unified offline batch inference interface for users. They can use ormb python sdk to download the model first then use neuropod to run offline inference.

Thank @terrytangyuan for introducing the project.

What you expected to happen:

How to reproduce it (as minimally and precisely as possible):

Anything else we need to know?:

gaocegege · 2020-06-15T02:41:45Z

It is related to our python SDK. Ref #49

judgeeeeee · 2020-07-20T10:05:26Z

There are roughly two ways to consider:

Transform all models into a unified type (eg onnx. Use onnxruntime to provide inference). At this time, the task needs some resources and dependencies. It is recommended to put it in model-rejistry.
Use the interface to unify the model (eg Neuropod). Consider using SDK for this implementation.Using yaml to generate config used by something like Neuropod

gaocegege · 2020-07-20T10:18:10Z

Transform all models into a unified type (eg onnx. Use onnxruntime to provide inference). At this time, the task needs some resources and dependencies. It is recommended to put it in model-rejistry.

I think we are using this approach in model registry (triton inference server). But we wanna support offline inference here.

judgeeeeee · 2020-07-20T10:46:11Z

Transform all models into a unified type (eg onnx. Use onnxruntime to provide inference). At this time, the task needs some resources and dependencies. It is recommended to put it in model-rejistry.

I think we are using this approach in model registry (triton inference server). But we wanna support offline inference here.

We use the converted model for offline inference. But we need convert out model first ,maybe use model registry。
If we use only one model type, we can provide only one library for #40

gaocegege · 2020-07-21T01:23:05Z

Personally prefer the latter.

If we can unify the API on top of models, we can support multiple framework formats. If we wanna support the offline inference, we always need an SDK, I think.

judgeeeeee · 2020-07-28T07:17:45Z

/assign

gaocegege added kind/feature Categorizes issue or PR as related to a new feature. priority/P2 Must be staffed and worked on either currently, or very soon, ideally in time for the next release. labels Jun 14, 2020

gaocegege mentioned this issue Jul 17, 2020

[feature] Support Edge Deployment kleveross/klever-model-registry#40

Open

caicloud-bot assigned judgeeeeee Jul 28, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[feature] Provider unified offline batch inference interface #47

[feature] Provider unified offline batch inference interface #47

gaocegege commented Jun 14, 2020 •

edited

gaocegege commented Jun 15, 2020

judgeeeeee commented Jul 20, 2020 •

edited

gaocegege commented Jul 20, 2020

judgeeeeee commented Jul 20, 2020 •

edited

gaocegege commented Jul 21, 2020

judgeeeeee commented Jul 28, 2020

[feature] Provider unified offline batch inference interface #47

[feature] Provider unified offline batch inference interface #47

Comments

gaocegege commented Jun 14, 2020 • edited

gaocegege commented Jun 15, 2020

judgeeeeee commented Jul 20, 2020 • edited

gaocegege commented Jul 20, 2020

judgeeeeee commented Jul 20, 2020 • edited

gaocegege commented Jul 21, 2020

judgeeeeee commented Jul 28, 2020

gaocegege commented Jun 14, 2020 •

edited

judgeeeeee commented Jul 20, 2020 •

edited

judgeeeeee commented Jul 20, 2020 •

edited