[Paddle inference]conv2d_add_act_pass support cutlass #64201
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
PR Category
Inference
PR Types
Others
Description
pcard-71500
1.conv2d_add_act_pass支持调入cutlass kernel
2.ppyoloe_crn_s_300e_coco_upload 模型 精度正常
batch:1 fp16 开启 cutlass2.2 8.28597ms
batch:8 fp16 开启 cutlass2.2 45.29349ms
batch:1 fp16 11.61632ms
batch:8 fp16 34.96974ms
batch:1 fp16 开启cutlass3.0 4.42932
batch:8 fp16 开启cutlass3.0 17.41582ms
a30下 sd1-5 打开cutlass
3.820813s->3.173564s