[feature] Support Edge Deployment #40

gaocegege · 2020-07-16T03:40:48Z

Is this a BUG REPORT or FEATURE REQUEST?:

Uncomment only one, leave it on its own line:

/kind bug
/kind feature

What happened:

We are glad to investigate if we can support deploy the model on edge servers/devices.

The workflow will be like: Users downloads the binary from the model registry and use it to server at edge.

Ref https://aws.amazon.com/cn/blogs/aws/amazon-sagemaker-neo-train-your-machine-learning-models-once-run-them-anywhere/

What you expected to happen:

How to reproduce it (as minimally and precisely as possible):

Anything else we need to know?:

hicaistar · 2020-07-16T16:33:19Z

A brief flow of Amazon SageMaker Neo:

Compile a model to an executable binary that can be run by Neo runtime(a python package)

Application run on eade device loads Neo runtime

Neo runtime loads model and starts serving

Neo runtime is the core. It's very small. Is that compiling serving program together with a model to a binary that can run on different platform be feasible?

gaocegege · 2020-07-17T01:35:12Z

Is that compiling serving program together with a model to a binary that can run on the different platforms be feasible?

I do not think we should provide one unified binary, I think we will provide one binary for one arch. WDYT

hicaistar · 2020-07-17T02:12:04Z

Is that compiling serving program together with a model to a binary that can run on the different platforms be feasible?

I do not think we should provide one unified binary, I think we will provide one binary for one arch. WDYT

LGTM

gaocegege · 2020-07-17T02:42:53Z

It may be related to kleveross/ormb#47

gaocegege · 2020-08-05T02:22:39Z

/assign @judgeeeeee

xieydd · 2020-08-17T15:08:51Z

For different platform edge device, there are two keypoints:

speed is the key point.
Supported platform , like ios、Android or NVIDIA edge device
There is no doubt, TVM is a best choice, but TVM is too complicated for users and developer .We have deploy many application on edge device, the pipeline is :

pytorch/tensorflow -> onnx(or direct) -> mnn/ncnn/tnn (We will compare with two framework using internal tools, but an opensource like this https://github.com/AI-performance/embedded-ai.bench)
# the framework model is easy convert to binary.

Looking forward to klever edge deployment Design.

gaocegege added the priority/P2 Must be staffed and worked on either currently, or very soon, ideally in time for the next release. label Jul 17, 2020

caicloud-bot assigned judgeeeeee Aug 5, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[feature] Support Edge Deployment #40

[feature] Support Edge Deployment #40

gaocegege commented Jul 16, 2020

hicaistar commented Jul 16, 2020 •

edited

gaocegege commented Jul 17, 2020

hicaistar commented Jul 17, 2020

gaocegege commented Jul 17, 2020 •

edited

gaocegege commented Aug 5, 2020

xieydd commented Aug 17, 2020 •

edited

[feature] Support Edge Deployment #40

[feature] Support Edge Deployment #40

Comments

gaocegege commented Jul 16, 2020

hicaistar commented Jul 16, 2020 • edited

gaocegege commented Jul 17, 2020

hicaistar commented Jul 17, 2020

gaocegege commented Jul 17, 2020 • edited

gaocegege commented Aug 5, 2020

xieydd commented Aug 17, 2020 • edited

hicaistar commented Jul 16, 2020 •

edited

gaocegege commented Jul 17, 2020 •

edited

xieydd commented Aug 17, 2020 •

edited