Numerical error introduced using ARM64 using clang-14 but not clang-13 #91824

phargogh · 2024-05-10T23:07:23Z

Hello! I am experiencing an issue with clang on ARM64 (reproducible on M1 mac and Raspberry Pi 4B) where chaining a series of mathematical operations together introduces some numerical error. I have a workaround where breaking some of the math onto a second line does not introduce the error. I have tested this on a Raspberry Pi 4B and also on the following debian versions on my M1 mac:

debian buster (arm64, gcc 8.3.0, clang 7.0.1, no numerical issue)
debian bullseye (arm64, gcc 10.2.1, clang 11.0.1-2, no numerical issue)
debian bookworm (arm64, gcc 12.2.0, clang 13.0.1, no numerical issue)
debian bookworm (arm64, gcc 12.2.0, clang 14.0.6, numerical issue present)
debian trixie (arm64, gcc 13.2.0, clang 16.0.6, numerical issue present)
debian sid (arm64, gcc 13.2.0, clang 16.0.6, numerical issue present)

A minimal reproducible sample is here in this gist: https://gist.github.com/phargogh/c4264b37e7f0beed31661eacce53d14a

Thank you!

topperc · 2024-05-10T23:22:49Z

Try passing -ffp-contract=off. The default was changed to -ffp-contract=on in clang 14.

-ffp-contract=on enables the use of FMA instructions for A * B + C if they appear in the same expression. FMA keeps the full precision of the multiply result before doing the addition. Using -ffp-contract=off causes the multiply result to be rounded to a double before doing the addition.

llvmbot · 2024-05-10T23:45:03Z

@llvm/issue-subscribers-backend-aarch64

Author: James Douglass (phargogh)

Hello! I am experiencing an issue with clang on ARM64 (reproducible on M1 mac and Raspberry Pi 4B) where chaining a series of mathematical operations together introduces some numerical error. I have a workaround where breaking some of the math onto a second line does not introduce the error. I have tested this on a Raspberry Pi 4B and also on the following debian versions on my M1 mac:

debian buster (arm64, gcc 8.3.0, clang 7.0.1, no numerical issue)
debian bullseye (arm64, gcc 10.2.1, clang 11.0.1-2, no numerical issue)
debian bookworm (arm64, gcc 12.2.0, clang 13.0.1, no numerical issue)
debian bookworm (arm64, gcc 12.2.0, clang 14.0.6, numerical issue present)
debian trixie (arm64, gcc 13.2.0, clang 16.0.6, numerical issue present)
debian sid (arm64, gcc 13.2.0, clang 16.0.6, numerical issue present)

A minimal reproducible sample is here in this gist: https://gist.github.com/phargogh/c4264b37e7f0beed31661eacce53d14a

Thank you!

phargogh · 2024-05-13T21:38:40Z

Thanks @topperc ! I confirm that the behavior is "restored" with -ffp-contract=off. I'll make sure our builds reflect this extra flag.

Although, I'm a little perplexed about why this behavior is creating different results on ARM64 vs X86_64. Any ideas?

See llvm/llvm-project#91824 for the thread with the llvm devs. RE:natcap#1562

arsenm · 2024-05-13T21:54:31Z

Thanks @topperc ! I confirm that the behavior is "restored" with -ffp-contract=off. I'll make sure our builds reflect this extra flag.

Although, I'm a little perplexed about why this behavior is creating different results on ARM64 vs X86_64. Any ideas?

Because the decision is based on whether the backend thinks FMA is fast. That will always be yes on arm64, and for base x86_64 it will be no. If you use -mfma or target any modernish CPU you'll probably get the fused result

phargogh · 2024-05-24T22:45:35Z

Thanks for the information! I think my issue is resolved here, so I'll go ahead and close the issue.

github-actions bot added the new issue label May 10, 2024

EugeneZelenko added backend:AArch64 and removed new issue labels May 10, 2024

EugeneZelenko added the floating-point Floating-point math label May 10, 2024

phargogh added a commit to phargogh/invest that referenced this issue May 13, 2024

Using a compiler flag instead of breaking up math.

cd0871d

See llvm/llvm-project#91824 for the thread with the llvm devs. RE:natcap#1562

phargogh mentioned this issue May 13, 2024

Correcting viewshed behavior on M1 with a compiler flag natcap/invest#1578

Merged

3 tasks

phargogh closed this as completed May 24, 2024

EugeneZelenko added the question A question, not bug report. Check out https://llvm.org/docs/GettingInvolved.html instead! label May 25, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Numerical error introduced using ARM64 using clang-14 but not clang-13 #91824

Numerical error introduced using ARM64 using clang-14 but not clang-13 #91824

phargogh commented May 10, 2024

topperc commented May 10, 2024 •

edited

llvmbot commented May 10, 2024

phargogh commented May 13, 2024

arsenm commented May 13, 2024

phargogh commented May 24, 2024

Numerical error introduced using ARM64 using clang-14 but not clang-13 #91824

Numerical error introduced using ARM64 using clang-14 but not clang-13 #91824

Comments

phargogh commented May 10, 2024

topperc commented May 10, 2024 • edited

llvmbot commented May 10, 2024

phargogh commented May 13, 2024

arsenm commented May 13, 2024

phargogh commented May 24, 2024

topperc commented May 10, 2024 •

edited