add mx quant #1728

mengniwang95 · 2024-04-11T06:15:04Z

Type of Change

feature

Description

support mx quant

Expected Behavior & Potential Risk

the expected behavior that triggered by this PR

How has this PR been tested?

UT

Dependency Change?

any library dependency introduced or removed

github-actions · 2024-04-11T06:15:29Z

⛈️ Required checks status: Has failure 🔴

Warning
If you do not have the access to re-run the Probot, please contact XuehaoSun for help. If you push a new commit, all of the workflow will be re-triggered.

Groups summary

🟢 Code Scan Tests workflow

Check ID	Status
Code-Scan	success	✅
Code-Scan (Bandit Code Scan Bandit)	success	✅
Code-Scan (DocStyle Code Scan DocStyle)	success	✅
Code-Scan (Pylint Code Scan Pylint)	success	✅

These checks are required after the changes to neural_compressor/common/utils/constants.py, neural_compressor/torch/algorithms/mx_quant/__init__.py, neural_compressor/torch/algorithms/mx_quant/mx.py, neural_compressor/torch/algorithms/mx_quant/utils.py, neural_compressor/torch/quantization/__init__.py, neural_compressor/torch/quantization/algorithm_entry.py, neural_compressor/torch/quantization/config.py.

🟢 Model Tests 3x workflow

Check ID	Status
Model-Test-3x	success	✅
Model-Test-3x (Generate Report GenerateReport)	success	✅
Model-Test-3x (Run PyTorch Model opt_125m_woq_gptq_int4)	success	✅
Model-Test-3x (Run PyTorch Model opt_125m_woq_gptq_int4_dq_bnb)	success	✅
Model-Test-3x (Run PyTorch Model opt_125m_woq_gptq_int4_dq_ggml)	success	✅

These checks are required after the changes to neural_compressor/common/utils/constants.py, neural_compressor/torch/algorithms/mx_quant/__init__.py, neural_compressor/torch/algorithms/mx_quant/mx.py, neural_compressor/torch/algorithms/mx_quant/utils.py, neural_compressor/torch/quantization/__init__.py, neural_compressor/torch/quantization/algorithm_entry.py, neural_compressor/torch/quantization/config.py.

🟢 Unit Tests 3x-TensorFlow workflow

Check ID	Status
UT-3x-TensorFlow	success	✅
UT-3x-TensorFlow (Coverage Compare CollectDatafiles)	success	✅
UT-3x-TensorFlow (Unit Test 3x TensorFlow Unit Test 3x TensorFlow)	success	✅
UT-3x-TensorFlow (Unit Test 3x TensorFlow baseline Unit Test 3x TensorFlow baseline)	success	✅

These checks are required after the changes to neural_compressor/common/utils/constants.py.

🔴 Unit Tests 3x-PyTorch workflow

Check ID	Status	Error details
UT-3x-Torch	failure		❌
UT-3x-Torch (Coverage Compare CollectDatafiles)	failure	download	❌
UT-3x-Torch (Unit Test 3x Torch Unit Test 3x Torch)	success		✅
UT-3x-Torch (Unit Test 3x Torch baseline Unit Test 3x Torch baseline)	success		✅

These checks are required after the changes to neural_compressor/common/utils/constants.py, neural_compressor/torch/algorithms/mx_quant/__init__.py, neural_compressor/torch/algorithms/mx_quant/mx.py, neural_compressor/torch/algorithms/mx_quant/utils.py, neural_compressor/torch/quantization/__init__.py, neural_compressor/torch/quantization/algorithm_entry.py, neural_compressor/torch/quantization/config.py, test/3x/torch/quantization/test_mx_quant.py.

🔴 Unit Tests 3x-ONNXRT workflow

Check ID	Status	Error details
UT-3x-ONNXRT	failure		❌
UT-3x-ONNXRT (Coverage Compare CollectDatafiles)	no_status		❓
UT-3x-ONNXRT (Unit Test 3x ONNXRT Unit Test 3x ONNXRT)	failure	download	❌
UT-3x-ONNXRT (Unit Test 3x ONNXRT baseline Unit Test 3x ONNXRT baseline)	failure	download	❌

These checks are required after the changes to neural_compressor/common/utils/constants.py.

Thank you for your contribution! 💜

Note
This comment is automatically generated and will be updates every 180 seconds within the next 6 hours. If you have any other questions, contact chensuyue or XuehaoSun for help.

xin3he

Could you please add a document to introduce the MX workflow and demo the usage?

neural_compressor/torch/algorithms/mx_quant/mx.py

...x_api/pytorch/nlp/huggingface_models/language-modeling/quantization/mx/run_clm_no_trainer.py

neural_compressor/torch/quantization/config.py

neural_compressor/torch/algorithms/mx_quant/mx.py

chensuyue · 2024-04-25T02:18:17Z

@mengniwang95 any update?

chensuyue · 2024-05-09T01:31:07Z

Does this feature target v2.6?