-
Notifications
You must be signed in to change notification settings - Fork 190
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
add gaudi modeling support in itrex #1438
Conversation
Signed-off-by: Clark Chin <xi2.chen@intel.com>
⚡ Required checks status: All passing 🟢Groups summary🟢 Format Scan Tests workflow
These checks are required after the changes to 🟢 Optimize Unit Test workflow
These checks are required after the changes to 🟢 NeuralChat Unit Test
These checks are required after the changes to 🟢 Engine Unit Test workflow
These checks are required after the changes to 🟢 Chat Bot Test workflow
These checks are required after the changes to Thank you for your contribution! 💜
|
for more information, see https://pre-commit.ci
...ion_for_transformers/neural_chat/examples/finetuning/multi_modal/eval/mmmu_eval/run_llava.py
Show resolved
Hide resolved
intel_extension_for_transformers/transformers/modeling/modeling_gaudi/models/__init__.py
Show resolved
Hide resolved
Signed-off-by: Chen Xi <xi2.chen@intel.com>
for more information, see https://pre-commit.ci
Signed-off-by: Chen Xi <xi2.chen@intel.com>
for more information, see https://pre-commit.ci
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
no requirements updated? At least optimum-habana shall be added
Signed-off-by: Meng, Hengyu <hengyu.meng@intel.com>
Signed-off-by: Chen Xi <xi2.chen@intel.com>
@lkk12014402 kaokao, please take a look on the new commit on measurement of ppl |
Signed-off-by: Chen Xi <xi2.chen@intel.com>
* initial commit Signed-off-by: Yu, Zhentao <zhentao.yu@intel.com> * mv example Signed-off-by: Yu Zhentao <zhentao.yu@intel.com> * update model dtype Signed-off-by: Yu Zhentao <zhentao.yu@intel.com> * fix multi-round generation without streaming_llm Signed-off-by: Yu, Zhentao <zhentao.yu@intel.com> * add mem and token num log Signed-off-by: Yu, Zhentao <zhentao.yu@intel.com> * rebase Signed-off-by: Yu, Zhentao <zhentao.yu@intel.com> * initial fp8 Signed-off-by: Yu, Zhentao <zhentao.yu@intel.com> * add ppl eval scripts Signed-off-by: Yu, Zhentao <zhentao.yu@intel.com> * typo Signed-off-by: Yu, Zhentao <zhentao.yu@intel.com> * add llama2-13b ppl eval script (align paper) Signed-off-by: Yu, Zhentao <zhentao.yu@intel.com> * hide kv cache operation inside (v0.1) Signed-off-by: Yu, Zhentao <zhentao.yu@intel.com> * hide kv cache operation inside (v0.2) Signed-off-by: Yu, Zhentao <zhentao.yu@intel.com> * hide kv cache operation inside (v0.3) Signed-off-by: Yu, Zhentao <zhentao.yu@intel.com> * update scripts Signed-off-by: Yu, Zhentao <zhentao.yu@intel.com> * add README Signed-off-by: Yu, Zhentao <zhentao.yu@intel.com> * update test scripts Signed-off-by: Yu, Zhentao <zhentao.yu@intel.com> * remove useless code Signed-off-by: Yu, Zhentao <zhentao.yu@intel.com> * update README and rename shell scripts Signed-off-by: Yu, Zhentao <zhentao.yu@intel.com> --------- Signed-off-by: Yu, Zhentao <zhentao.yu@intel.com> Signed-off-by: Yu Zhentao <zhentao.yu@intel.com>
for more information, see https://pre-commit.ci
Signed-off-by: Clark Chin <xi2.chen@intel.com>
for more information, see https://pre-commit.ci
Signed-off-by: Chen Xi <xi2.chen@intel.com>
for more information, see https://pre-commit.ci
Signed-off-by: Clark Chin <xi2.chen@intel.com>
Signed-off-by: Chen Xi <xi2.chen@intel.com>
Signed-off-by: Clark Chin <xi2.chen@intel.com>
Signed-off-by: Clark Chin <xi2.chen@intel.com>
Signed-off-by: Clark Chin <xi2.chen@intel.com>
Signed-off-by: VincyZhang <wenxin.zhang@intel.com>
Signed-off-by: Clark Chin <xi2.chen@intel.com>
for more information, see https://pre-commit.ci
Signed-off-by: Clark Chin <xi2.chen@intel.com>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Great start on HPU
Signed-off-by: Clark Chin <xi2.chen@intel.com>
Type of Change
gaudi modeling used in itrex for int4 kv-cache support