鲲鹏920环境session推理采用fp16推理结果错误 #2855

zhenjing · 2024-04-29T09:08:10Z

代码：
MNN::ScheduleConfig sConfig;
sConfig.type = static_cast(MNN_FORWARD_CPU);
sConfig.numThread = threadNum;
BackendConfig bConfig;
bConfig.precision = static_castBackendConfig::PrecisionMode(precision);
sConfig.backendConfig = &bConfig;
auto session = net->createSession(sConfig);

当precision=2时，推理结果是错误，其他精度可正常推理。
若采用module方式，启用fp16推理结果正常，速度可加速。

看文档这么写: 后端 CPU precision 为 Low 时，根据设备情况开启 FP16 计算 precision 为 Low_BF16 时，根据设备情况开启 BF16 计算
session应该怎么开启fp16？

zhenjing · 2024-04-29T10:07:22Z

版本：2.8.4

jxt1234 · 2024-05-08T06:36:05Z

大概率是 session api 的调用代码问题。fp16 的输入输出必须用 copyFromHost / copyToHost ，不能直接访问 tensor 的 host 指针。建议都用 Module API.

jxt1234 added the question Further information is requested label May 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

鲲鹏920环境session推理采用fp16推理结果错误 #2855

鲲鹏920环境session推理采用fp16推理结果错误 #2855

zhenjing commented Apr 29, 2024

zhenjing commented Apr 29, 2024

jxt1234 commented May 8, 2024

鲲鹏920环境session推理采用fp16推理结果错误 #2855

鲲鹏920环境session推理采用fp16推理结果错误 #2855

Comments

zhenjing commented Apr 29, 2024

zhenjing commented Apr 29, 2024

jxt1234 commented May 8, 2024