04 Jan 08:20

mhs4670go

3f51fd8

ONE Release 1.26.0 Latest

Latest

Release Note 1.26.0

ONE Compiler

Support more Op(s): HardSwish, CumSum, BroadcastTo
Support more optimization option(s): decompose_softmax, decompose_hardswish, fuse_slice_with_tconv,
fuse_mul_with_conv, remove_unnecessary_add, fuse_horizontal_fc_layers, common_subexpression_elimination,
remove_unnecessary_transpose
one-quantize supports more option
- Requantization option to convert TF2-quantized int8 model to uint8 model (--requantize)
- A new option to automatically find mixed-precision configuration (--ampq)
- A new option to save calibrated min/max values (--save_min_max)
- Add new parameters for moving average calibration (--moving_avg_batch, --moving_avg_const)
Introduce q-implant that writes quantization parameters and weights into the circle model
Introduce minmax-embedder that embeds min/max values into the circle model

Assets 8

12 Oct 10:31

mhs4670go

1.24.1

38b334e

ONE Release 1.24.1

Release Note 1.24.1

ONE Compiler

Updates error message of rawdata2hdf5 test

Assets 8

27 Sep 00:05

chunseoklee

onert-micro1.0.0

4d5a78f

ONERT-MICRO 1.0.0

onert-micro-cortexm.tar.gz

Release Notes for onert-micro 1.0

Supported operations

More operations are supported as follows:

AveragePool2D, Elu, Exp, Abs, Neg, Div, AddN, Relu, Relu6, Leak_Relu, Pad, PadV2, ArgMin, ArgMax, Resize_Bilinear, LogicalAnd, LogicalOr, Equal, NotEqual, Greater, GreaterEqual, LessEqual

Etc

Address sanitizer build option(ENABLE_SANITIZER) is added
Fix buffer overflow in static analyzer's defect

Assets 4

08 Sep 10:11

hseok-oh

1.25.0

f0a308f

ONE Release 1.25.0

Release Note 1.25.0

ONE Runtime

Support ubuntu 20.04

CPU Backend Operation

CPU backend supports per-channel hybrid quantization of int8 type weight and float activation. (TFLite's dynamic range quantization)

On-device Quantization

onert supports new experimental API for on-device quantization.
As the 1st step, onert supports per-channel hybrid quantization of int8/int16 type weight and float activation.
API requires file path to export quantized model.

Minmax Recorder

onert` support minmax recording of each layer as experimental feature. It is not supported by API yet.
Output file format is HDF5. (File format may change later).

Assets 9

18 Jul 15:15

mhs4670go

1.24.0

9abbed1

ONE Release 1.24.0

Release Note 1.24.0

ONE Compiler

Introduce one-import-onnx extension interface
onecc supports profiling of multiple backends with a single cfg file
Enable more Quantize operator: FloorMod, Squeeze
visq supports multi-out nodes
onecc introduces dynamic_batch_to_single_batch option option

Assets 8

09 Jun 13:55

chunseoklee

onert-micro-0.1.0

72da3b8

ONERT-MICRO 0.1.0

Release Notes for onert-micro 0.1.0

onert-micro is tiny runtime specialized for running NN model in MCU boards. Note that onert-micro is under active development and is subject to change.

Supported operations

For MCU board, we support 22 operations as follows :

ADD, FULLY_CONNECTED, CONV_2D, LOGISTIC ,GATHER, EXPAND_DIMS, PACK, RESHAPE, REDUCE_PROD, LESS, MUL, MAX_POOL_2D, CONCATENATION, SHAPE, SLICE, SUB, SPLIT, STRIDED_SLICE, TANH, SOFTMAX, WHILE, UNIDIRECTIONAL_SEQUENCE_LSTM

RNN Model

LSTM

onert-micro supports Keras model with LSTM operations. But, it should be converted to UNIDIRECTIONAL_SEQUENCE_LSTM operation in circle format.

GRU

onert-micro supports model with GRU Operations, which is converted from Keras Model. Please refer to #10465 to see GRU operation supported by onert-micro.

Benchmark

onert-micro shows better performance than tflite-micro especially in memory consumption, binary size.

The measurement is done on TizenRT running reference models on the development board with the following spec :

32-bit Arm Cortex-M33 200MHz
4MB RAM, 8MB Flash

Commit for measurement :

tflite-micro commit: tensorflow/tflite-micro@4e62ea7
onert-micro commit: c763867

L model

Params	Tflite micro	Onert-micro
Execution time(us)*	2 912 700	2 953 000
RAM consumption(bytes)	126 800	93 376
Binary file size overhead (bytes)	57 676	32 248

T1 model

Params	Tflite micro	Onert-micro
Execution time(us)*	1 340	1 510
RAM consumption(bytes)	1 640	1 152
Binary file size overhead (bytes)	35 040	19 432

T2 model

Params	Tflite micro**	Onert-micro
Execution time(us)*	N/A	5 090
RAM consumption(bytes)	N/A	3 360
Binary file size overhead (bytes)	N/A	30 488

Model with GRU operations

model link : https://github.com/Samsung/ONE/files/8368702/gru.zip

Params	Tflite micro**	Onert-micro
Execution time(us)*	N/A	335 000
RAM consumption(bytes)	N/A	14 816
Binary file size overhead (bytes)	N/A	43 444

(*) Average for 100 inferences
(**) Tflite-micro has not launched this model

Assets 4

19 May 07:09

mhs4670go

1.23.0

3e7b9c6

ONE Release 1.23.0

Release Note 1.23.0

ONE Compiler

Support more Op(s): GeLU
Support more option(s): --fuse-gelu
Support multiple backends compilation with a single configuration file
Upgrade Circle schema to 0.5

Assets 8

26 Apr 01:25

hseok-oh

1.22.1

4187e74

ONE Release 1.22.1

Release Note 1.22.1

ONE Runtime

Multimodel nnpackage

Runtime supports to run nnpackage with 3 or more models
Runtime supports to run multimodel nnpackage with multiple subgraphs
Runtime supports type casting when tensor's data type between edge is different

Assets 9

28 Mar 04:43

mhs4670go

1.22.0

cb6d356

ONE Release 1.22.0

Release Note 1.22.0

ONE Compiler

Introduce new optimization options: unroll_unidirseqlstm, forward_transpose_op, fold_fully_connected, fuse_prelu
Support more Ops for fake quantization: Depth2Space, Space2Depth, Pack, Unpack, Abs
Support more Ops for quantization: Abs, ReduceProd
Introduce visq tool for quantization error visualization
Introduce Environment section into configuration file
Improve speed of convert_nchw_to_nhwc option
Support Add, Mul of index-type (int32, int64) tensors in one-quantize
Support ubuntu 20.04

Assets 8

07 Sep 04:35

chunseoklee

1.21.0

3d20228

ONE Release 1.21.0

Release Note 1.21.0

ONE Compiler

Support unrolling of LSTM and RNN Ops in one-import-onnx tool
Introduced new tools one-infer, circle-operator, circle-interpreter
Introduced Workflow(WIP) in one-cmds
New option quant_config in one-quantize
New option fake_quantize in one-quantize
More Ops supported: Densify
More Ops for quantization: ReduceMax
More Ops for mixed-precision quantization (MPQ): LeakyRelu, Neg, Relu6, Squeeze
More Ops for convert_nchw_to_nhwc option: LogSoftmax, ReduceMax, SplitV, Softmax
New optimization options in one-optimize: replace_non_const_fc_with_bmm, resolve_customop_splitv, fold_densify
Improved reshape elimination in convert_nchw_to_nhwc option.
Support fusion of Channel-wise Add + Relu with TConv
Support negative axis in ArgMin/Max
Show errors for unrecognized options in one-optimize
Fix shape inference for StridedSlice
Fix FuseBatchNormWithTConvPass to support TConv with bias
Deprecate --O1 option in circle2circle
Support gcc-11
Support limited Float16 for kernels constants with dequantization to Float32

ONE Runtime

Basic Multimodel nnpackage

Runtime supports to run nnpackage with two models

Channel Wise Quantization on Conv2D and Depthwise Conv2D

Conv2D and Depthwise Conv2D supports per-channel quantization of uint8 type.

Batch Execution with TRIX backend

TRIX backend supports batch execution which run in parallel with multicore

Assets 12

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Release Note 1.26.0

ONE Compiler

Release Note 1.24.1

ONE Compiler

Release Notes for onert-micro 1.0

Supported operations

Etc

Release Note 1.25.0

ONE Runtime

CPU Backend Operation

On-device Quantization

Minmax Recorder

Release Note 1.24.0

ONE Compiler

Release Notes for onert-micro 0.1.0

Supported operations

RNN Model

LSTM

GRU

Benchmark

L model

T1 model

T2 model

Model with GRU operations

Release Note 1.23.0

ONE Compiler

Release Note 1.22.1

ONE Runtime

Multimodel nnpackage

Release Note 1.22.0

ONE Compiler

Release Note 1.21.0

ONE Compiler

ONE Runtime

Basic Multimodel nnpackage

Channel Wise Quantization on Conv2D and Depthwise Conv2D

Batch Execution with TRIX backend

Releases: Samsung/ONE

ONE Release 1.26.0

Release Note 1.26.0

ONE Compiler

ONE Release 1.24.1

Release Note 1.24.1

ONE Compiler

ONERT-MICRO 1.0.0

Release Notes for onert-micro 1.0

Supported operations

Etc

ONE Release 1.25.0

Release Note 1.25.0

ONE Runtime

CPU Backend Operation

On-device Quantization

Minmax Recorder

ONE Release 1.24.0

Release Note 1.24.0

ONE Compiler

ONERT-MICRO 0.1.0

Release Notes for onert-micro 0.1.0

Supported operations

RNN Model

LSTM

GRU

Benchmark

L model

T1 model

T2 model

Model with GRU operations

ONE Release 1.23.0

Release Note 1.23.0

ONE Compiler

ONE Release 1.22.1

Release Note 1.22.1

ONE Runtime

Multimodel nnpackage

ONE Release 1.22.0

Release Note 1.22.0

ONE Compiler

ONE Release 1.21.0

Release Note 1.21.0

ONE Compiler

ONE Runtime

Basic Multimodel nnpackage

Channel Wise Quantization on Conv2D and Depthwise Conv2D

Batch Execution with TRIX backend