[Operators] Add batch support for x86 CPU matrix multiplication + resolve rule #415

BolinSNLHM · 2024-01-11T19:22:08Z

Preliminary performance testing results compared to PyTorch:

deleted redundant file

use the original name

yaoyaoding

Glad to see that you extend it to batch version. But there are still something we can do to make it better. See the comments below.

yaoyaoding · 2024-01-11T20:57:16Z

python/hidet/graph/ops/matmul/matmul_f32_x86.py

            ),
        )

        super().__init__(
            name='matmul_f32_x86',
            inputs=[a, b],
            outputs=[c],
-            attributes={'m_size': a_shape[-2], 'n_size': b_shape[-1], 'k_size': a_shape[-1]},
+            attributes={'batch_size': batch_size, 'm_size': m_size, 'n_size': n_size, 'k_size': k_size},


We can use this function https://github.com/hidet-org/hidet/blob/main/python/hidet/ir/compute/cops/matmul.py#L30 to add computation definition for matmul.

Like

hidet/python/hidet/graph/ops/matmul/matmul.py

Line 20 in 359c5f3

c = cops.matmul(a, b, allow_1d=True)

python/hidet/graph/ops/matmul/matmul_f32_x86.py

yaoyaoding · 2024-01-11T20:59:08Z

python/hidet/graph/ops/matmul/matmul_f32_x86.py

+            and (not is_constant(a.shape[0], b.shape[0]) or a.shape[0] == b.shape[0])
+            and (not is_constant(a.shape[2], b.shape[1]) or a.shape[2] == b.shape[1])


Are you require the shape of a and b constant?

Is it possible to support dynamic shape like https://github.com/hidet-org/hidet/blob/main/python/hidet/graph/ops/matmul/batch_matmul.py

yaoyaoding · 2024-01-11T21:01:19Z

python/hidet/graph/ops/matmul/matmul_f32_x86.py

+        # if not (len(a.shape) == len(b.shape) == 2 and a.shape[1] == b.shape[0]):
+        #     raise ValueError('Matrix multiplication: incompatible sizes: {} and {}'.format(a.shape, b.shape))
+        if not (
+            len(a.shape) == len(b.shape) == 3


Can we use the same template to support matmul like:

[12, 1024, 1024] @ [1024, 1024]

[12, 1024, 1024] @ [1, 1024, 1024]

[1024, 1024] @ [4, 5, 1024, 1024]

You can have a look at https://github.com/hidet-org/hidet/blob/main/python/hidet/graph/ops/matmul/batch_matmul.py as a reference.

python/hidet/graph/ops/matmul/resolve.py

tests/operators/test_matmul.py

Co-authored-by: Yaoyao Ding <dingyaoyao.cs@gmail.com>

BolinSNLHM added 30 commits May 27, 2023 22:01

.

efe3e14

Merge branch 'hidet-org:main' into main

b19a212

.

d7e4043

Merge branch 'main' of github.com:BolinSNLHM/hidet into main

e13af0a

added basic openMP primitives

a7bce75

Merge branch 'main' into omp

bad483c

added those primitives back

d7f6469

let me pretend like it's all good for tonight

f211a48

...

bbb5afc

working on refactoring

569fb49

ready to be tested on the eco server

b32ea73

fix stupid error

dbbb2b6

..

014f5c1

fix more error

2d82325

..

11c9e70

fixing hidet script error

4586e89

...:

65c3b9d

....

286c107

...

bfacaf8

..

8246466

..

7518042

fixing strange error

f8a97b2

more errors

1a87c27

more err

3104473

...

68bc03d

...

9059ca3

global

df5a177

global var

27da1ba

.

fca3694

.

14973b4

BolinSNLHM added 14 commits November 16, 2023 20:40

Merge branch 'fix-zero-init' into main

6f572a4

ready for PR

3fbb635

......

656bbd0

avoid changing function attributes from outside

ebcc78f

Delete python/mat_new.py

fa39456

deleted redundant file

Update matmul_f32_x86.py

b61722d

use the original name

Merge branch 'hidet-org:main' into main

575acaf

Merge branch 'hidet-org:main' into main

ef57171

adding batch support

0e7eb63

Merge branch 'hidet-org:main' into main

d33093e

.

4d9505d

fix conflict

091492c

resolve rule + batch support

170896e

modify test

71fcd6a

yaoyaoding requested changes Jan 11, 2024

View reviewed changes

BolinSNLHM and others added 14 commits January 11, 2024 19:58

Update python/hidet/graph/ops/matmul/matmul_f32_x86.py

60319ca

Co-authored-by: Yaoyao Ding <dingyaoyao.cs@gmail.com>

Update python/hidet/graph/ops/matmul/resolve.py

4a6f641

Co-authored-by: Yaoyao Ding <dingyaoyao.cs@gmail.com>

Update python/hidet/graph/ops/matmul/resolve.py

c4152e2

Co-authored-by: Yaoyao Ding <dingyaoyao.cs@gmail.com>

Update python/hidet/graph/ops/matmul/resolve.py

f1bddb5

Co-authored-by: Yaoyao Ding <dingyaoyao.cs@gmail.com>

Update tests/operators/test_matmul.py

c2ad5de

Co-authored-by: Yaoyao Ding <dingyaoyao.cs@gmail.com>

Update python/hidet/graph/ops/matmul/resolve.py

95dd0fb

Co-authored-by: Yaoyao Ding <dingyaoyao.cs@gmail.com>

Update python/hidet/graph/ops/matmul/resolve.py

8d09697

Co-authored-by: Yaoyao Ding <dingyaoyao.cs@gmail.com>

Merge branch 'hidet-org:main' into main

40b7f51

Merge branch 'hidet-org:main' into main

0cc633a

Merge branch 'main' into gpt2-benchmark

94c97b8

Merge branch 'main' of github.com:BolinSNLHM/hidet into main

b843c57

Merge branch 'main' into gpt2-benchmark

04cc68a

resolve asdfasdf

f9caaf6

commit before fixing matmul for global var

2220e8f

BolinSNLHM closed this Mar 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Operators] Add batch support for x86 CPU matrix multiplication + resolve rule #415

[Operators] Add batch support for x86 CPU matrix multiplication + resolve rule #415

BolinSNLHM commented Jan 11, 2024

yaoyaoding left a comment

yaoyaoding Jan 11, 2024

yaoyaoding Jan 11, 2024

yaoyaoding Jan 11, 2024

		and (not is_constant(a.shape[0], b.shape[0]) or a.shape[0] == b.shape[0])
		and (not is_constant(a.shape[2], b.shape[1]) or a.shape[2] == b.shape[1])

[Operators] Add batch support for x86 CPU matrix multiplication + resolve rule #415

[Operators] Add batch support for x86 CPU matrix multiplication + resolve rule #415

Conversation

BolinSNLHM commented Jan 11, 2024

yaoyaoding left a comment

Choose a reason for hiding this comment

yaoyaoding Jan 11, 2024

Choose a reason for hiding this comment

yaoyaoding Jan 11, 2024

Choose a reason for hiding this comment

yaoyaoding Jan 11, 2024

Choose a reason for hiding this comment