Phi3 support #1826

martinlyons · 2024-04-23T15:54:21Z

Feature request

Microsoft's new phi3 mode, in particular the 128K context mini model, is not supported by Optimum export.

Error is:
"ValueError: Trying to export a phi3 model, that is a custom or unsupported architecture, but no custom export configuration was passed as custom_export_configs. Please refer to https://huggingface.co/docs/optimum/main/en/exporters/onnx/usage_guides/export_a_model#custom-export-of-transformers-models for an example on how to export custom models. Please open an issue at https://github.com/huggingface/optimum/issues if you would like the model type phi3 to be supported natively in the ONNX export."

Motivation

Phi3-mini is potentially very significant as it has a large context but a small size. This could be used in lots of scenarios if it has good performance.

Your contribution

Unlikely I could do a PR as ONNX work is not my forte.

The text was updated successfully, but these errors were encountered:

Whadup · 2024-04-24T08:50:53Z

optimum/optimum/utils/normalized_config.py

Line 254 in 56aabbe

"phi": NormalizedTextConfig,

Add "phi3": NormalizedTextConfig in this dict and you seem to be all set for phi3-mini

IlyasMoutawwakil · 2024-04-24T14:21:17Z

patching the TasksManager and NormalizedConfigManager works (until it's added naively):

from transformers import AutoTokenizer

from optimum.exporters import TasksManager
from optimum.exporters.onnx import main_export
from optimum.onnxruntime import ORTModelForCausalLM
from optimum.utils import NormalizedConfigManager

TasksManager._SUPPORTED_MODEL_TYPE["phi3"] = TasksManager._SUPPORTED_MODEL_TYPE["phi"]
NormalizedConfigManager._conf["phi3"] = NormalizedConfigManager._conf["phi"]

# output = "phi3_onnx"
# main_export(
#     model_name_or_path="microsoft/Phi-3-mini-4k-instruct",
#     task="text-generation-with-past",
#     trust_remote_code=True,
#     output=output,
# )

model = ORTModelForCausalLM.from_pretrained("microsoft/Phi-3-mini-4k-instruct", trust_remote_code=True, export=True)
tokenizer = AutoTokenizer.from_pretrained("microsoft/Phi-3-mini-4k-instruct")
inputs = tokenizer(["Hello, my dog is cute"], return_tensors="pt")

outputs = model.generate(**inputs, max_length=50)
print(tokenizer.batch_decode(outputs, skip_special_tokens=True))

kwankong-git · 2024-05-23T04:43:46Z

Although I had following patch applied, it still cann't export Phi-3 ONNX model.

commit db6db6f
Author: kunal-vaishnavi 115581922+kunal-vaishnavi@users.noreply.github.com
Date: Thu May 9 02:12:55 2024 -0700

Add Phi-3 mini to Optimum (#1841)

* Load config from folder

* Add Phi-3 to normalized config

My command is as follows:
ptimum-cli export onnx -m ./phi3_128k_lora_sft --monolith --task text-generation --trust-remote-code --framework pt ./phi3_128k_lora_sft_onnx_3

The output and error are as following:
Loading checkpoint shards: 100%|█████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 4/4 [00:07<00:00, 1.85s/it]
Special tokens have been added in the vocabulary, make sure the associated word embeddings are fine-tuned or trained.
Special tokens have been added in the vocabulary, make sure the associated word embeddings are fine-tuned or trained.
Traceback (most recent call last):
File "/usr/local/bin/optimum-cli", line 8, in
sys.exit(main())
File "/usr/local/lib/python3.9/dist-packages/optimum/commands/optimum_cli.py", line 163, in main
service.run()
File "/usr/local/lib/python3.9/dist-packages/optimum/commands/export/onnx.py", line 265, in run
main_export(
File "/usr/local/lib/python3.9/dist-packages/optimum/exporters/onnx/main.py", line 352, in main_export
onnx_export_from_model(
File "/usr/local/lib/python3.9/dist-packages/optimum/exporters/onnx/convert.py", line 1048, in onnx_export_from_model
raise ValueError(
ValueError: Trying to export a phi3 model, that is a custom or unsupported architecture, but no custom onnx configuration was passed as custom_onnx_configs. Please refer to https://huggingface.co/docs/optimum/main/en/exporters/onnx/usage_guides/export_a_model#custom-export-of-transformers-models for an example on how to export custom models. Please open an issue at https://github.com/huggingface/optimum/issues if you would like the model type phi3 to be supported natively in the ONNX export.

JingyaHuang · 2024-05-23T09:19:51Z

With the PR #1841 submitted by the ORT team, we will be able to load onnx checkpoints(eg. microsoft/Phi-3-mini-4k-instruct-onnx) of phi3 uploaded by the ORT team. But for exporting the model we would need to add an onnx config for phi3. I will open a PR today.

For phi3 small, the team will wait until it becomes a part of a stable transformers release (it is using remote code now).

Whadup mentioned this issue Apr 24, 2024

Add support for phi-3 models #1828

Closed

3 tasks

JingyaHuang mentioned this issue May 23, 2024

Add phi3 support in ONNX exporter #1870

Merged

3 tasks

JingyaHuang closed this as completed in #1870 May 24, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Phi3 support #1826

Phi3 support #1826

martinlyons commented Apr 23, 2024

Whadup commented Apr 24, 2024 •

edited

IlyasMoutawwakil commented Apr 24, 2024

kwankong-git commented May 23, 2024

JingyaHuang commented May 23, 2024

Phi3 support #1826

Phi3 support #1826

Comments

martinlyons commented Apr 23, 2024

Feature request

Motivation

Your contribution

Whadup commented Apr 24, 2024 • edited

IlyasMoutawwakil commented Apr 24, 2024

kwankong-git commented May 23, 2024

JingyaHuang commented May 23, 2024

Whadup commented Apr 24, 2024 •

edited