LCEInterpreter and converter design #713

simonmaurer · 2022-02-17T12:14:09Z

The LCE interpreter from lce.testing.Interpreter is a standalone class and exposes different properties of the quantized model (scale and zero-point for example). the converter on the other hand is built upon the two methods convert_keras_model/convert_saved_model.
can you elaborate on your design decision ?

boiling down to follow-up questions:

how could one use different delegates in the LCE Interpreter (e.g. use_xnnpack=True as in the cmd line parameters of lce_benchmark_model)
how to add code and support LCE converter to use options such as tf.lite.OpsSet.SELECT_TF_OPS (as in regular TFLite)

could you give me a hint on this ?

The text was updated successfully, but these errors were encountered:

Tombana · 2022-02-17T16:37:12Z

Coincidentally I happened to be looking into this today and it seems to works as follows:

The CLI flag use_xnnpack of lce_benchmark_model basically decides which OpResolver to use: when use_xnnpack=false then it uses BuiltinOpResolverWithoutDefaultDelegates, otherwise it uses BuiltinOpResolver. The op resolver has a GetDelegates() method, and the BuiltinOpResolver will choose the XNNPACK delegate when it is enabled via bazel build options, and by default they enable XNNPACK.

So the simplest would be to add a use_xnnpack bool argument to the Python interpreter and then on this line choose the BuiltinOpResolverWithoutDefaultDelegates whenever use_xnnpack == false:

compute-engine/larq_compute_engine/tflite/python/interpreter_wrapper_lite.cc

Line 40 in 7db969f

resolver_ = std::make_unique<tflite::ops::builtin::BuiltinOpResolver>();

To have more control over the options passed to XNNPACK (for example in the TF master branch we can enable/disable int8 kernels through these options), it seems that the easiest way is to follow the instructions at "Enable XNNPACK via low level delegate API". I haven't tried it, but I would think the best way is to always use BuiltinOpResolverWithoutDefaultDelegates and then use that 'low level delegate API' to set the desired XNNPACK options whenever use_xnnpack == true.

I have never looked at the Select TF Ops mechanism of TFLite, so I'm afraid I can't give any hints for this.

simonmaurer mentioned this issue Mar 7, 2022

Lce benchmark and interpreter flags #717

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LCEInterpreter and converter design #713

LCEInterpreter and converter design #713

simonmaurer commented Feb 17, 2022

Tombana commented Feb 17, 2022

LCEInterpreter and converter design #713

LCEInterpreter and converter design #713

Comments

simonmaurer commented Feb 17, 2022

Tombana commented Feb 17, 2022