You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Thanks, since phi 3 support has been merged will close this issue. But I have another question and do not want to create a separate issue, so asking here.
@bil-ash Hi, AVX512F here means devices without AVX512_VNNI, and I don't implement u8s8 and s8s8 for AVX512. So it's better to use fp32 for computation. AVX2 devices without AVX_VNNI have u8s8 & s8s8 kernels for backup.
Please add support for the phi-3-mini-128k(context length) model in neural-speed.
The text was updated successfully, but these errors were encountered: