DOT via GEMM TPP #803

alheinecke · 2023-08-06T10:35:52Z

For various LLM and GNN operators, we need vector a fast dot. Right now we have only very slow A^T GEMM for M=1 or we have to run a sequence of unary/binary TPPs.

Plan for improvement: Add fast A^T GEMM for M=1 which uses an inner-product approach + vector reduce in the end.

hfp · 2023-08-14T07:23:53Z

General Q (raised by others), this would not benefit from AMX as the latter needs matrices/tiles to benefit from reuse etc.
Is this a correct assumption?

egeor · 2023-08-14T07:53:34Z

DOT and GEMV are BW BOUND ops, so AMX is irrelevant

…

On Mon, Aug 14, 2023 at 10:24 AM Hans Pabst ***@***.***> wrote: General Q (raised by others), this would not benefit from AMX as the latter needs matrices/tiles to benefit from reuse etc. Is this a correct assumption? — Reply to this email directly, view it on GitHub <#803 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAIZ2NX7F5LV6W2AS55S3UDXVHHBJANCNFSM6AAAAAA3F2OUUI> . You are receiving this because you were assigned.Message ID: ***@***.***>

hfp · 2023-08-14T08:00:09Z

DOT and GEMV are BW BOUND ops, so AMX is irrelevant

Indeed. However, this came up in a context of "batched dot-products" which in turn can be thought like a matrix multiplication. I still wonder if there is something useful in it and if I should get the exact use case.

egeor · 2023-08-14T08:05:30Z

Well, then one is better off by reformulating the algorithm/math to use matmul. This is a standard trick in linear algebra…

…

On Mon, Aug 14, 2023 at 11:00 AM Hans Pabst ***@***.***> wrote: DOT and GEMV are BW BOUND ops, so AMX is irrelevant Indeed. However, this came up in a context of "batched dot-products" which in turn can be thought like a matrix multiplication. I still wonder if there is something useful in it and if I should get the exact use case. — Reply to this email directly, view it on GitHub <#803 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAIZ2NVTT4MXRG75QZFM6MLXVHLJJANCNFSM6AAAAAA3F2OUUI> . You are receiving this because you were assigned.Message ID: ***@***.***>

hfp · 2023-08-14T09:10:47Z

For the record, this can be related to https://github.com/vondele/Stockfish/tree/amx_v1.

alheinecke assigned egeor and alheinecke Aug 6, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DOT via GEMM TPP #803

DOT via GEMM TPP #803

alheinecke commented Aug 6, 2023

hfp commented Aug 14, 2023

egeor commented Aug 14, 2023 via email

hfp commented Aug 14, 2023

egeor commented Aug 14, 2023 via email

hfp commented Aug 14, 2023

DOT via GEMM TPP #803

DOT via GEMM TPP #803

Comments

alheinecke commented Aug 6, 2023

hfp commented Aug 14, 2023

egeor commented Aug 14, 2023 via email

hfp commented Aug 14, 2023

egeor commented Aug 14, 2023 via email

hfp commented Aug 14, 2023