Skip to content

Releases: IntelLabs/Model-Compression-Research-Package

v0.1.0 Model Compression Research Package Initial Release

23 Nov 18:51
Compare
Choose a tag to compare

First release of Intel Labs' Model Compression Research Package, the current version includes model compression methods from previous published papers and our own research papers implementations:

  • Pruning, quantization and knowledge distillation methods and schedulers that may fit various PyTorch models out-of-the-box
  • Integration to HuggingFace/transformers library for most of the available methods
  • Various examples showing how to use the library
  • Prune Once for All: Sparse Pre-Trained Language Models reproduction guide and scripts