New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We鈥檒l occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add mindir
format support.
#1227
base: main
Are you sure you want to change the base?
Conversation
@Ethan-Chen-plus can you provide some context:
|
9a4785e
to
1a817ad
Compare
1f388bc
to
0cfad16
Compare
994aca4
to
0de981a
Compare
PS I:\netron> npm run validate
eslint
pylint
test
gguf/phi-2.Q2_K.gguf
onnx/candy.onnx
keras/1151.4.keras
coreml/Exermote.mlmodel
pytorch/alexnet.ptl
pytorch/DCGAN2.pt
tf/conv-layers.pb.zip
tflite/mobilenet_v1_0.75_160_quantized.tflite
tflite/squeezenet.tflite I have updated my code, please review. |
@Ethan-Chen-plus can you provide some context:
|
66d40fe
to
19e4123
Compare
8abf19b
to
dc7a947
Compare
1e39f71
to
6c74b13
Compare
35e69f7
to
9787a2f
Compare
@lutzroeder Thank you again and we have updated our code. And let me introduce some background infomation:
The user base for the MindIR format primarily comprises researchers, developers, and enterprises needing to deploy AI models across multiple hardware platforms. Given its compatibility with diverse hardware including Ascend AI processors, GPUs, and CPUs, MindIR is particularly well-suited for applications that require deployment both in the cloud and on edge devices. For instance, MindIR can be utilized in Huawei's Atlas 200/300/500 inference products, which are specifically designed for edge AI processing scenarios. -Why is a new format needed given there are many similar formats? Compared to other similar IR formats such as ONNX and AIR, MindIR presents unique advantages and application scenarios. ONNX (Open Neural Network Exchange) is a universal format designed for the expression of machine learning models, primarily used for model transfer between different frameworks or on inference engines like TensorRT. AIR (Ascend Intermediate Representation), defined by Huawei, is an open file format tailored for machine learning to better accommodate Huawei AI processors, typically used for inference tasks on Ascend 310. MindIR, on the other hand, is a functional IR based on graph representation. It not only defines an extensible graph structure and operator IR representations but also eliminates model discrepancies across different backends. It is generally used for cross-platform inference tasks, such as executing inference on Ascend 310, GPUs, and MindSpore Lite for models trained on Ascend 910. |
514284f
to
0250d3e
Compare
@lutzroeder Hello.馃憢 I noticed that eslint failed. Could you please tell me which config file eslint is using? I'll make the necessary fixes based on that config file. Thank you! |
952b6ed
to
2781faa
Compare
Workflow requires approval, thank you馃 |
b448189
to
1d342d8
Compare
d9a23ec
to
5028ef6
Compare
馃馃憢Hi! We have added our
.mindir
format. Please review our commit.