25 Mar 17:57

aimetci

1.31.0

7292545

version 1.31.0 Latest

Latest

Release of the AI Model Efficiency toolkit package
User guide: https://quic.github.io/aimet-pages/releases/1.31.0/user_guide/index.html
API documentation: https://quic.github.io/aimet-pages/releases/1.31.0/api_docs/index.html
Documentation main page: https://quic.github.io/aimet-pages/index.html

Assets 13

aimet_onnx-onnx_cpu_1.31.0-cp38-cp38-linux_x86_64.whl

12.9 MB 2024-03-25T17:57:51Z
aimet_onnx-onnx_gpu_1.31.0-cp38-cp38-linux_x86_64.whl

26.3 MB 2024-03-25T17:57:40Z
aimet_tensorflow-tf_cpu_1.31.0-cp38-cp38-linux_x86_64.whl

11.7 MB 2024-03-25T17:57:44Z
aimet_tensorflow-tf_gpu_1.31.0-cp38-cp38-linux_x86_64.whl

22.4 MB 2024-03-25T17:57:33Z
aimet_torch-onnx_cpu_1.31.0-cp38-cp38-linux_x86_64.whl

13.3 MB 2024-03-25T17:57:49Z
aimet_torch-onnx_gpu_1.31.0-cp38-cp38-linux_x86_64.whl

26.7 MB 2024-03-25T17:57:39Z
aimet_torch-torch_cpu_1.31.0-cp38-cp38-linux_x86_64.whl

10.3 MB 2024-03-25T17:57:46Z
aimet_torch-torch_cpu_pt19_1.31.0-cp38-cp38-linux_x86_64.whl

9.19 MB 2024-03-25T17:57:54Z
aimet_torch-torch_gpu_1.31.0-cp38-cp38-linux_x86_64.whl

14.6 MB 2024-03-25T17:57:36Z
LICENSE.pdf

128 KB 2024-03-25T17:57:35Z
Source code (zip)

2024-03-25T17:57:30Z
Source code (tar.gz)

2024-03-25T17:57:30Z

17 Jan 10:39

aimetci

1.30.0

cbcbca0

version 1.30.0

What's New

ONNX

Upgraded AIMET to support Onnx version 1.14 and ONNXRUNTIME version 1.15.
Added support for AutoQuant.

Documentation

Release main page: https://github.com/quic/aimet/releases/tag/1.30.0
Installation guide: https://quic.github.io/aimet-pages/releases/1.30.0/install/index.html
User guide: https://quic.github.io/aimet-pages/releases/1.30.0/user_guide/index.html
API documentation: https://quic.github.io/aimet-pages/releases/1.30.0/api_docs/index.html
Documentation main page: https://quic.github.io/aimet-pages/index.html

Assets 27

29 Nov 22:00

aimetci

1.29.0

4d3047f

version 1.29.0

What's New

Keras

Fixes issues with TF Op Lambda Layers in Qc Quantize Wrappers call.

PyTorch

[experimental] Support for embedding AIMET encodings within the graph using ONNX quantize/dequantize operators. Currently this option is only supported when using 8bit per-tensor quantization.

ONNX

Added support for Adaround.

TensorFlow

No significant updates

Documentation

Release main page: https://github.com/quic/aimet/releases/tag/1.29.0
Installation guide: https://quic.github.io/aimet-pages/releases/1.29.0/install/index.html
User guide: https://quic.github.io/aimet-pages/releases/1.29.0/user_guide/index.html
API documentation: https://quic.github.io/aimet-pages/releases/1.29.0/api_docs/index.html
Documentation main page: https://quic.github.io/aimet-pages/index.html

Assets 30

20 Oct 23:54

aimetci

1.28.1

d8096c9

version 1.28.1

Release of the AI Model Efficiency toolkit package
User guide: https://quic.github.io/aimet-pages/releases/1.28.1/user_guide/index.html
API documentation: https://quic.github.io/aimet-pages/releases/1.28.1/api_docs/index.html
Documentation main page: https://quic.github.io/aimet-pages/index.html

Assets 30

06 Sep 10:04

aimetci

1.28.0

354d605

version 1.28.0

What's New

Keras

Added Support for Spatial SVD Compression feature.
[experimental] Debugging APIs have been added for dumping intermediate tensor outputs. This data can be used with current QNN/SNPE tools for debugging accuracy problems.

PyTorch

Upgraded AIMET Pytorch default version to 1.13. AIMET remains compatible with Pytorch version 1.9.

ONNX

[experimental] Debugging APIs have been added for dumping intermediate tensor outputs. This data can be used with current QNN/SNPE tools for debugging accuracy problems.

TensorFlow

No significant updates

Documentation

Release main page: https://github.com/quic/aimet/releases/tag/1.28.0
Installation guide: https://quic.github.io/aimet-pages/releases/1.28.0/install/index.html
User guide: https://quic.github.io/aimet-pages/releases/1.28.0/user_guide/index.html
API documentation: https://quic.github.io/aimet-pages/releases/1.28.0/api_docs/index.html
Documentation main page: https://quic.github.io/aimet-pages/index.html

Assets 24

28 Jul 19:14

aimetci

1.27.0

7bbe05d

version 1.27.0

What's New

Keras

Update support for TFOpLambda layers in Batch Norm Folding with extra call args/kwargs.

PyTorch

Added AIMET to support PyTorch version 1.13.0. Only ONNX opset 14 is supported for export.
[experimental] Debugging APIs have been added for dumping intermediate tensor data. This data can be used with current QNN/SNPE tools for debugging accuracy problems. Layer Output Generation API gives incorrect tensor data for the layer just before Relu when used for original FP32 model.
[experimental] Support for embedding AIMET encodings within the graph using ONNX quantize/dequantize operators. Currently this is option is only supported when using 8bit per-tensor quantization.
Fixed a bug in AIMET QuantSim for PyTorch models to handle non-contiguous tensors.

ONNX

AIMET support for ONNX 1.11.0 has been added. However there is currently limited op support in QNN/SNPE. If the model fails to load please continue to use opset 11 for export.

TensorFlow

[experimental] Debugging APIs have been added for dumping intermediate tensor outputs. This data can be used with current QNN/SNPE tools for debugging accuracy problems.

Documentation

Release main page: https://github.com/quic/aimet/releases/tag/1.27.0
Installation guide: https://quic.github.io/aimet-pages/releases/1.27.0/install/index.html
User guide: https://quic.github.io/aimet-pages/releases/1.27.0/user_guide/index.html
API documentation: https://quic.github.io/aimet-pages/releases/1.27.0/api_docs/index.html
Documentation main page: https://quic.github.io/aimet-pages/index.html

Assets 16

12 Jul 00:30

aimetci

1.26.1

2a8b8e1

version 1.26.1

What's New

TensorFlow

Upgraded AIMET to support TensorFlow version 2.10.1 (AIMET remains compatible with TensorFlow 2.4).
Several bug fixes

Common

Upgraded to Ubuntu 20 base image for all variants.

Documentation

Release main page: https://github.com/quic/aimet/releases/tag/1.26.1
Installation guide: https://quic.github.io/aimet-pages/releases/1.26.1/install/index.html
User guide: https://quic.github.io/aimet-pages/releases/1.26.1/user_guide/index.html
API documentation: https://quic.github.io/aimet-pages/releases/1.26.1/api_docs/index.html
Documentation main page: https://quic.github.io/aimet-pages/index.html

Assets 16

12 May 22:34

aimetci

1.26.0

57ed1b5

version 1.26.0

What's New

Keras

Added a feature called BN Re-estimation that can improve model accuracy after QAT for INT4 quantization.
Updated the AutoQuant feature to automatically choose the optimal calibration scheme, create an HTML report on which optimizations were applied.
Update to Model Preparer to replace separable conventional with depth wise and point wise conv layers.
Fixes BN fold implementation to account for a subsequent multi-input layer
Fixed a bug where min/max encoding values were not aligned with scale/offset during QAT.

PyTorch

Several bug fixes

TensorFlow

Added a feature called BN Re-estimation that can improve model accuracy after QAT for INT4 quantization
Updated the AutoQuant feature to automatically choose the optimal calibration scheme, create an HTML report on which optimizations were applied.
Fixed a bug where min/max encoding values were not aligned with scale/offset during QAT.

Common

Documentation updates for taking AIMET models to target.
Standalone Batchnorm layers parameter’s conversion such that it will behave as linear/dense layer.

Experimental

Added new Architecture Checker feature to identify and report model architecture constructs that are not ideal for quantized runtimes. Users can utilize this information to change their model architectures accordingly.

Documentation

Release main page: https://github.com/quic/aimet/releases/tag/1.26.0
Installation guide: https://quic.github.io/aimet-pages/releases/1.26.0/install/index.html
User guide: https://quic.github.io/aimet-pages/releases/1.26.0/user_guide/index.html
API documentation: https://quic.github.io/aimet-pages/releases/1.26.0/api_docs/index.html
Documentation main page: https://quic.github.io/aimet-pages/index.html

Assets 16

09 Mar 23:14

aimetci

1.25.0

0824ecb

version 1.25.0

What's New

Keras

Added QuantAnalyzer feature
Adds Batch Normalization folding for Functional Keras Models. This allows the default config files to work for super grouping.
Resolved an issue with quantizer placement in Sequential blocks in subclassed models

PyTorch

Added AutoQuant V2 which includes advanced features such as out-of-the-box inference, model preparer, quant scheme search, improved summary report, etc.
Fixes to resolve minor accuracy diffs in the learnedGrid quantizer for per-channel quantization
Fixes to improve EfficientNetB4 accuracy w/respect to target
Fixed rare case where quantizer may calculate incorrect offset when generating QAT 2.0 learned encodings

TensorFlow

Added QuantAnalyzer feature
Fixed an accuracy issue due to rare cases where the incorrect BN epsilon was being used
Fixed an accuracy issue due to Quantsim export incorrectly recomputing QAT2.0 encodings

Common

Updated AIMET python package version format to support latest pip
Fixed an issue where not all inputs might be quantized properly

Documentation

Release main page: https://github.com/quic/aimet/releases/tag/1.25.0
Installation guide: https://quic.github.io/aimet-pages/releases/1.25.0/install/index.html
User guide: https://quic.github.io/aimet-pages/releases/1.25.0/user_guide/index.html
API documentation: https://quic.github.io/aimet-pages/releases/1.25.0/api_docs/index.html
Documentation main page: https://quic.github.io/aimet-pages/index.html

Assets 16

20 Jan 00:18

aimetci

1.24.0

eda99b2

version 1.24.0

What's New

Export quantsim configuration for configuring downstream target quantization

PyTorch

Fixes to resolve minor accuracy diffs in the learnedGrid quantizer for per-channel quantization
Added support for AMP 2.0 which enables faster automatic mixed precision
Added support for QAT for INT4 quantized models – includes a feature for performing BN Re-estimation after QAT

Keras

Added support for AMP 2.0 which enables faster automatic mixed precision
Support for basic transformer networks
Added support for subclassed models. The current subclassing feature includes support for only a single level of subclassing and does not support lambdas.
Added QAT per-channel gradient support
Minor updates to the quantization configuration
Fixed QuantSim bug where layers using dtypes other than float were incorrectly quantized

TensorFlow

Added an additional prelu mapping pattern to ensure proper folding and quantsim node placement
Fixed per-channel encoding representation to align with Pytorch and Keras

Documentation

Release main page: https://github.com/quic/aimet/releases/tag/1.24.0
Installation guide: https://quic.github.io/aimet-pages/releases/1.24.0/install/index.html
User guide: https://quic.github.io/aimet-pages/releases/1.24.0/user_guide/index.html
API documentation: https://quic.github.io/aimet-pages/releases/1.24.0/api_docs/index.html
Documentation main page: https://quic.github.io/aimet-pages/index.html

Assets 16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

What's New

Documentation

What's New

Documentation

What's New

Documentation

What's New

Documentation

What's New

Documentation

What's New

Documentation

What's New

Documentation

What's New

Documentation

Releases: quic/aimet

version 1.31.0

version 1.30.0

What's New

Documentation

version 1.29.0

What's New

Documentation

version 1.28.1

version 1.28.0

What's New

Documentation

version 1.27.0

What's New

Documentation

version 1.26.1

What's New

Documentation

version 1.26.0

What's New

Documentation

version 1.25.0

What's New

Documentation

version 1.24.0

What's New

Documentation