Not able to save model with device_map ="auto" with MLFLOW #11632

OriAlpha · 2024-04-05T09:51:30Z

OriAlpha
Apr 5, 2024

Hello,

I have seen an issue, where if i mention

model = AutoModelForCausalLM.from_pretrained(
    model_name,
    device_map="auto",
    quantization_config=quantization_config,
    attn_implementation="flash_attention_2",
    torch_dtype=torch.bfloat16,
)

When trying to log model with

mlflow.transformers.log_model(
    transformers_model={"model": trainer.model, "tokenizer": tokenizer_no_pad},
    artifact_path="model",
)

It just fails with error:
mlflow.exceptions.MlflowException: The model that is attempting to be saved has been loaded into memory with an incompatible configuration. If you are using the accelerate library to load your model, please ensure that it is saved only after loading with the default device mapping. Do not specify device_map and please try again.

But if i just remove device map it works. But without device_map its hard to traning model since its not follow distrubted traning.
So anyone have an idea how fix this. Even in code its mentioned cooment as and has not been updated.

 # Verify that the model has not been loaded to distributed memory
# NB: transformers does not correctly save a model whose weights have been loaded
# using accelerate iff the model weights have been loaded using a device_map that is
# heterogeneous. There is a distinct possibility for a partial write to occur, causing an
# invalid state of the model's weights in this scenario. Hence, we raise.
# We might be able to remove this check once this PR is merged to transformers:
# https://github.com/huggingface/transformers/issues/20072
if _is_model_distributed_in_memory(built_pipeline.model):
    raise MlflowException(
        "The model that is attempting to be saved has been loaded into memory "
        "with an incompatible configuration. If you are using the accelerate "
        "library to load your model, please ensure that it is saved only after "
        "loading with the default device mapping. Do not specify `device_map` "
        "and please try again."
    )

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Not able to save model with device_map ="auto" with MLFLOW #11632

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 0 comments

Select a reply

Not able to save model with device_map ="auto" with MLFLOW #11632

OriAlpha Apr 5, 2024

Replies: 0 comments

OriAlpha
Apr 5, 2024