Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Service interfaces for ML accelerators #1241

Open
AutonomicPerfectionist opened this issue Jan 8, 2023 · 1 comment
Open

Service interfaces for ML accelerators #1241

AutonomicPerfectionist opened this issue Jan 8, 2023 · 1 comment

Comments

@AutonomicPerfectionist
Copy link
Contributor

With the prevalence of hardware machine learning accelerators, we should have services that can take advantage of them. Currently, however, interfaces don't exist to support the generic implementation of such services.

We should discuss the overall requirements of such interfaces. The following are what I believe is the bare minimum:

  1. Model loading from either the filesystem or as a byte stream. Alternatively, have the service scan for models in a specified directory and list the available models via a method call
  2. List types of supported models, such as tensorflow or caffe.
  3. Raw inputs and outputs of the model execution. I believe a float list should suffice for this
  4. Inputs and outputs represented as common object types, such as images or strings. This could be selected based on the model set. For example: instead of passing a raw filename, we could pass a composite object that contains the filename as well as the model input type and output type. Input methods would just be overloaded based on input type but output/publishing methods would need a separate publishOutput{type}() method for each type. Only one input overload/publishing method pair would function at a time
@supertick
Copy link
Contributor

Sounds great 👍
breadth scan at a known relative location data/... sounds good
The last one sounds good to .. having the ability to route by type is something downstream apps could utilize.
The rest I don't know much, as I haven't worked much with dnns tensorflow or caffe models

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants