Add sve targets #2886

vorj · 2023-05-31T06:32:27Z

related: #2884

This PR contains below changes:

Add new optlevel sve
- ARM SVE is extension of ARMv8, so it should be treated similar to AVX2 IMO
Add targets for ARM SVE, faiss_sve and swigfaiss_sve
- These targets will be built when you give -DFAISS_OPT_LEVEL=sve at build time
- Design decision: Don't fix SVE register length.
  - The python package of faiss is "fat binary" (for example, the package for avx2 contains _swigfaiss_avx2.so and _swigfaiss.so)
  - SVE is scalable instruction set (= doesn't fix vector length), but actually we can specify the vector length at compile time.
    - with -msve-vector-length= option
    - When this option is specified, the binary can't work correctly on the CPU which has other vector length rather than specified at compile time
  - When we use fixed vector length, SVE-supported faiss python package will contain 7 shared libraries like _swigfaiss.so , _swigfaiss_sve.so , _swigfaiss_sve128.so , _swigfaiss_sve256.so , _swigfaiss_sve512.so , _swigfaiss_sve1024.so , and _swigfaiss_sve2048.so . The package size will be exploded.
  - For these reason, I don't specify the vector length at compile time and faiss_sve detects the vector length at run time.
Add a mechanism of detecting ARM SVE on runtime environment and importing swigfaiss_sve dynamically
- Currently it only supports Linux, but there is no SVE environment with non-Linux OS now, as far as I know

NOTE: I plan to make one more PR about add some SVE implementation after this PR merged. This PR only contains adding sve target.

mdouze · 2023-05-31T16:54:35Z

Please don't add a faiss/python/swigfaiss_sve.swig file.

vorj · 2023-05-31T19:47:49Z

Oh, sorry. I missed but that has been copied at this line. I removed the file and added the path on .gitignore .

vorj · 2023-06-20T06:44:54Z

environment: line 9: /opt/conda/lib/jvm/languages/python/bin/conda: No such file or directory

🤨

vorj · 2023-06-20T06:45:33Z

Ah, #2917, OK.

vorj · 2023-06-20T07:08:54Z

@mdouze How about the current status of this PR?

mdouze · 2023-06-21T09:37:12Z

So the diff only changes the compilation flags, it does not add VSE specific SIMD implementations, right?
Do you have hardware to try it on and maybe measure performance improvements?

vorj · 2023-06-21T18:21:37Z

So the diff only changes the compilation flags, it does not add VSE specific SIMD implementations, right?
Do you have hardware to try it on and maybe measure performance improvements?

In this PR faiss uses SVE only with auto vectorized functions like fvec_L2sqr.
This PR still has little performance improvements, but I aim this as to add faiss_sve target at first.

vorj · 2023-06-22T07:30:36Z

As I wrote before,

I plan to make one more PR about add some SVE implementation after this PR merged.

It will include SVE implmemtations of code_distance , exhaustive_L2sqr_blas_cmax , and so on.

vorj · 2023-06-28T09:59:51Z

@mdouze IMO the PRs should be separated, but I'm willing to include the commits of performance improvement in this PR if you want it. How would you like it?

mdouze · 2023-06-29T13:33:01Z

Sorry for being a bit slow to react.
I think that it's fine to land this packaging PR first, let us check the implications in terms of library size.

vorj · 2023-06-30T02:50:13Z

@mdouze OK. When you will want my action like:

need me to make a decision,
need to change some codes, or
want to know my opinion,

please feel free to send me some comments. Anyway, I will wait the checking for a while. Thanks.

naveentatikonda · 2023-09-21T18:22:44Z

@mdouze and @vorj is there any update on adding SVE support and do you guys still have plans to add it? I saw some discussion on the other PR and there was no activity since a while. Basically, we were looking for some optimization to Scalar Quantization(specifically SQfp16) on ARM like AVX2 on x86.

Also, please let us know if you need any help to run tests for SVE support. We have bandwidth and resources to run tests. Thanks!

vorj · 2023-09-22T06:22:38Z

@naveentatikonda I am just a contributor not employed by Meta, so actually I don't know the plans on this (official faiss) repository. However, as I told above, I have further patches to improve performance more, and I will create PR when this merged.

naveentatikonda · 2023-09-25T17:12:45Z

@mdouze and @vorj is there any update on adding SVE support and do you guys still have plans to add it? I saw some discussion on the other PR and there was no activity since a while. Basically, we were looking for some optimization to Scalar Quantization(specifically SQfp16) on ARM like AVX2 on x86.

Also, please let us know if you need any help to run tests for SVE support. We have bandwidth and resources to run tests. Thanks!

@mdouze Did you get a chance to look into my question?

mdouze · 2023-09-26T13:01:44Z

OK so I think a way to move forward is to accept this PR but not cover it with CI.
Then optimized code for SVE can be contributed. At some point we will probably either:

add SVE to the CI or
remove SVE support if it turns out it is not used too much.

Is there a doc somewhere that shows what current and future ARM implementaitons support SVE ?

Thanks

mdouze · 2023-09-26T14:46:39Z

Would you mind rebasing on the latest Faiss so that I can import it to the internal Faiss version?
Thanks

alexanderguzhva · 2023-09-26T19:07:57Z

I can assist and review the code, if needed

vorj · 2023-09-27T03:24:32Z

@mdouze

Is there a doc somewhere that shows what current and future ARM implementaitons support SVE ?

At least, current and future CPUs implemented ARMv9 will support SVE, because SVE2 is in the basic instruction set of ARMv9. Cortex-A510, Cortex-X2, Neoverse N2, Neoverse V2 are supporting ARMv9. However, I don't know that concrete implementations (real CPUs) will has ARMv9 or SVE, as this is decided by manufacturers.

naveentatikonda · 2023-09-27T16:59:36Z

@naveentatikonda I am just a contributor not employed by Meta, so actually I don't know the plans on this (official faiss) repository. However, as I told above, I have further patches to improve performance more, and I will create PR when this merged.

@vorj Do you also have plans to add sve support to ScalarQuantization after this PR is merged?

vorj · 2023-09-28T17:34:03Z

@naveentatikonda

Do you also have plans to add sve support to ScalarQuantization after this PR is merged?

Currently I don't have the SVE version of ScalarQuantization, so you will be able to contribute it. However, I will speed it up that the unoptimized codes I will find on some times to spare. If I will find no SVE ScalarQuantization codes at my faiss-optimizing time, I will do that.

vorj · 2023-10-06T03:21:43Z

@mdouze

Would you mind rebasing on the latest Faiss so that I can import it to the internal Faiss version?

I did it. Would you review this?

cjnolet · 2023-10-10T16:21:50Z

Just want to add a note here that this change is also very important to Nvidia RAPIDS libraries, as we're gearing up to have more libraries optimized for the Grace architecture.

facebook-github-bot added the CLA Signed label May 31, 2023

vorj force-pushed the support-arm_sve branch from d0c643a to 4def441 Compare May 31, 2023 07:45

vorj force-pushed the support-arm_sve branch from 4def441 to a6d28e4 Compare May 31, 2023 19:47

vorj force-pushed the support-arm_sve branch 3 times, most recently from 091b0f7 to 5a3d8ea Compare June 2, 2023 23:46

vorj force-pushed the support-arm_sve branch 3 times, most recently from 96f35db to d7c27ba Compare June 13, 2023 04:11

vorj force-pushed the support-arm_sve branch from 0ecf934 to 59acf2b Compare June 20, 2023 06:11

vorj force-pushed the support-arm_sve branch from 59acf2b to b0c2296 Compare June 21, 2023 10:48

vorj force-pushed the support-arm_sve branch from b0c2296 to 49578a1 Compare June 26, 2023 14:39

mdouze mentioned this pull request Jun 29, 2023

Add BMI2; Add LTO; Upgrade SQ4, SQ6, SQ8 for AVX2 #2931

Closed

vorj force-pushed the support-arm_sve branch from a48be32 to 28accce Compare September 27, 2023 02:56

vorj force-pushed the support-arm_sve branch from 28accce to 4e921e5 Compare September 27, 2023 11:33

vorj force-pushed the support-arm_sve branch 2 times, most recently from 4832e9a to 1ab6f01 Compare October 3, 2023 12:05

vorj force-pushed the support-arm_sve branch from 1ab6f01 to 941806c Compare October 6, 2023 03:20

add sve targets, faiss_sve and swigfaiss_sve

155e4bd

vorj force-pushed the support-arm_sve branch from 941806c to 155e4bd Compare October 25, 2023 07:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add sve targets #2886

Add sve targets #2886

vorj commented May 31, 2023 •

edited

mdouze commented May 31, 2023

vorj commented May 31, 2023 •

edited

vorj commented Jun 20, 2023

vorj commented Jun 20, 2023

vorj commented Jun 20, 2023

mdouze commented Jun 21, 2023 •

edited

vorj commented Jun 21, 2023

vorj commented Jun 22, 2023

vorj commented Jun 28, 2023

mdouze commented Jun 29, 2023

vorj commented Jun 30, 2023

naveentatikonda commented Sep 21, 2023 •

edited

vorj commented Sep 22, 2023

naveentatikonda commented Sep 25, 2023

mdouze commented Sep 26, 2023

mdouze commented Sep 26, 2023

alexanderguzhva commented Sep 26, 2023

vorj commented Sep 27, 2023

naveentatikonda commented Sep 27, 2023

vorj commented Sep 28, 2023

vorj commented Oct 6, 2023

cjnolet commented Oct 10, 2023 •

edited

Add sve targets #2886

Are you sure you want to change the base?

Add sve targets #2886

Conversation

vorj commented May 31, 2023 • edited

mdouze commented May 31, 2023

vorj commented May 31, 2023 • edited

vorj commented Jun 20, 2023

vorj commented Jun 20, 2023

vorj commented Jun 20, 2023

mdouze commented Jun 21, 2023 • edited

vorj commented Jun 21, 2023

vorj commented Jun 22, 2023

vorj commented Jun 28, 2023

mdouze commented Jun 29, 2023

vorj commented Jun 30, 2023

naveentatikonda commented Sep 21, 2023 • edited

vorj commented Sep 22, 2023

naveentatikonda commented Sep 25, 2023

mdouze commented Sep 26, 2023

mdouze commented Sep 26, 2023

alexanderguzhva commented Sep 26, 2023

vorj commented Sep 27, 2023

naveentatikonda commented Sep 27, 2023

vorj commented Sep 28, 2023

vorj commented Oct 6, 2023

cjnolet commented Oct 10, 2023 • edited

vorj commented May 31, 2023 •

edited

vorj commented May 31, 2023 •

edited

mdouze commented Jun 21, 2023 •

edited

naveentatikonda commented Sep 21, 2023 •

edited

cjnolet commented Oct 10, 2023 •

edited