You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Calibrating batch 510
Calibrating batch 511
Quantization done. Total time used: 98.99 s.
Unknown model type Starcoder2ForCausalLM. Continue exporting...
torch.distributed not initialized, assuming single world_size.
torch.distributed not initialized, assuming single world_size.
torch.distributed not initialized, assuming single world_size.
torch.distributed not initialized, assuming single world_size.
torch.distributed not initialized, assuming single world_size.
torch.distributed not initialized, assuming single world_size.
torch.distributed not initialized, assuming single world_size.
torch.distributed not initialized, assuming single world_size.
current rank: 0, tp rank: 0, pp rank: 0
torch.distributed not initialized, assuming single world_size.
torch.distributed not initialized, assuming single world_size.
Cannot export model to the model_config. The AMMO optimized model state_dict (including the quantization factors) is saved to /tmp/checkpoint/ammo_model.0.pth using torch.save for further inspection.
Detailed export error: 'unknown:Starcoder2ForCausalLM'
Traceback (most recent call last):
File "/usr/local/lib/python3.10/dist-packages/ammo/torch/export/model_config_export.py", line 332, in export_tensorrt_llm_checkpoint
for tensorrt_llm_config, weights in torch_to_tensorrt_llm_checkpoint(
File "/usr/local/lib/python3.10/dist-packages/ammo/torch/export/model_config_export.py", line 278, in torch_to_tensorrt_llm_checkpoint
tensorrt_llm_config = convert_to_tensorrt_llm_config(model_config)
File "/usr/local/lib/python3.10/dist-packages/ammo/torch/export/tensorrt_llm_utils.py", line 78, in convert_to_tensorrt_llm_config
"architecture": MODEL_NAME_TO_HF_ARCH_MAP[decoder_type],
KeyError: 'unknown:Starcoder2ForCausalLM'
additional notes
I can provide if anyother information is needed
The text was updated successfully, but these errors were encountered:
System Info
Who can help?
@Tracin
Information
Tasks
examples
folder (such as GLUE/SQuAD, ...)Reproduction
python3 ../quantization/quantize.py --model_dir starcoder2
--dtype float16
--qformat fp8
--kv_cache_dtype fp8
--output_dir xxx
Expected behavior
it should output checkpoints with no error
actual behavior
additional notes
I can provide if anyother information is needed
The text was updated successfully, but these errors were encountered: