convert(Int16, ::Float64) does not throw inexact exception when it should #14549

yuyichao · 2016-01-03T16:48:17Z

julia> convert(Int16, 88776.0)
23240

LLVM IR (wrapped in another function)

define i16 @julia_i16_convert_22530(double) #0 {
top:
  %1 = fptosi double %0 to i16
  %2 = sitofp i16 %1 to double
  %3 = fcmp oeq double %2, %0
  br i1 %3, label %pass, label %fail

fail:                                             ; preds = %top
  %4 = load %jl_value_t*, %jl_value_t** @jl_inexact_exception, align 4
  call void @jl_throw(%jl_value_t* %4)
  unreachable

pass:                                             ; preds = %top
  ret i16 %1
}

ASM

Dump of assembler code for function julia_i16_convert_22473:
   0xcce3b000 <+0>:     push    {r11, lr}
   0xcce3b004 <+4>:     mov     r11, sp
   0xcce3b008 <+8>:     vcvt.s32.f64    s2, d0
   0xcce3b00c <+12>:    vmov    r0, s2
   0xcce3b010 <+16>:    vcvt.f64.s32    d16, s2
   0xcce3b014 <+20>:    vcmpe.f64       d16, d0
   0xcce3b018 <+24>:    vmrs    APSR_nzcv, fpscr
   0xcce3b01c <+28>:    popeq   {r11, pc}
   0xcce3b020 <+32>:    movw    r0, #2920       ; 0xb68
   0xcce3b024 <+36>:    movt    r0, #63345      ; 0xf771
   0xcce3b028 <+40>:    ldr     r0, [r0]
   0xcce3b02c <+44>:    bl      0xcce3b050
End of assembler dump.

(interestingly this doesn't happen on AArch64)

yuyichao · 2016-01-03T16:56:06Z

ASM is from gcc disassembly due to #14550
LLVM 3.7.0. Hardware is a Cortex-A57 with a chroot ArchLinuxARM armv7. JULIA_CPU_TARGET is cortex-a7.

Keno · 2016-01-03T17:05:14Z

Isn't this the same issue as #10124? fptosi is undefined for out-of-range floating point values.

yuyichao · 2016-01-03T17:09:31Z

Yeah, seems like it #10124 (comment). I wasn't really sure what exactly are the undefined behaviors.

The PR that closes that issue doesn't seem to cover this one though.

Keno · 2016-01-03T17:11:21Z

Yeah, we need to audit all uses of unsafe_trunc, not just the one in ==.

simonbyrne · 2016-01-04T10:06:49Z

Ah, that is interesting. It seems that undefined behaviour actually allows values outside of the range of the destination type (which in hindsight makes sense, since Int16 isn't a native type).

I guess this means we really need to do the checks before calling unsafe_trunc?

simonbyrne · 2016-01-06T21:05:25Z

So the problem is that adding range checks pre-fptosi adds overhead (the range boundaries need to be loaded from memory to the registers, then 2 cmps and an and) for what is actually a common operation (the nature of being a dynamic technical language is that code will frequently convert between numeric types).

Ideally this would be handled at the LLVM level: either define a checked version of fptosi, or make the undefined behaviour a little less undefined (i.e. that it returns an arbitrary value of the particular type).

c.f. related rust issue: rust-lang/rust#10184

simonbyrne · 2016-01-22T11:17:29Z

It's a bit of a hack, but what if we were to define:

function convert(::Type{Int16},x::Float64)
    u = unsafe_trunc(Int32,x) % Int16
    convert(Float64,u) == x || throw(InexactError())
    u
end

It seems to give the same (valid) instructions on x86, does it fix the issue on ARM?

We're still technically playing with undefined behaviour here, so it would be good to get this clarified upstream.

yuyichao · 2016-01-22T13:46:28Z

It does seem to fix the issue on ARM and is still working on all platforms I have tested. There seems to be a ~20% performance regression on both x64 and aarch64 though.

simonbyrne · 2016-01-22T13:55:58Z

Any idea why it's slower? The code_llvm and code_native output looks basically the same.

yuyichao · 2016-01-22T14:11:45Z

You are right. I was hitting johnmyleswhite/Benchmarks.jl#36 and didn't look at allocation count. There's also a small difference due to inlining (the julia version is harder to inline, which shouldn't be an issue if we simply fix the intrinsics). There's no measurable performance difference once these two issues are fixed.

There's a small difference on x64 with (patched) llvm 3.7.1 in the branch instructions and where the error branch is and IMHO the new version generates slightly better code since the error branch is at the end of the function.

simonbyrne · 2016-01-22T14:20:06Z

Ah good. I was thinking we could just implement these in Julia, since there's no real reason they need to be intrinsics. What if I just wrap them in an @inline?

yuyichao · 2016-01-22T14:23:12Z

What if I just wrap them in an @inline?

It will still make the function that calls them harder to inline. Not sure how big an effect it is though.

simonbyrne · 2016-01-22T14:25:42Z

Does convert(Int8,::Float64) have the same issue?

yuyichao · 2016-01-22T14:27:23Z

~~No because it's defined as an intrinsic?~~

nvm, I see you are refering to the original issue. .........................

simonbyrne · 2016-01-22T14:30:03Z

I meant, the same issue as above, i.e. does convert(Int8, 88776.0) throw an error?

yuyichao · 2016-01-22T14:34:39Z

Yeah realized that and edited my comment above...
Yes, it seems that most of the conversion has this issue.

julia> convert(Int8, 200000.0)
64

julia> convert(Int8, 200000f0)
64

julia> convert(UInt8, 200000.0)
0x40

julia> convert(UInt8, 200000f0)
0x40

julia> convert(Int16, 200000.0)
3392

julia> convert(Int16, 200000f0)
3392

julia> convert(UInt16, 200000.0)
0x0d40

julia> convert(UInt16, 200000f0)
0x0d40

(U)Int32 and up seems fine (on arm at least not sure about UB).

…tations. Fixes #14549.

simonbyrne · 2016-01-22T19:53:12Z

I posted a note on llvm-dev here:
http://lists.llvm.org/pipermail/llvm-dev/2016-January/094405.html

simonbyrne · 2016-01-27T16:59:56Z

I don't know much about Swift, but it seems that they manually check every conversion:
https://github.com/apple/swift/blob/bf969a385f06e9731e4642ace08f42efdf4d6dd8/stdlib/public/core/FloatingPoint.swift.gyb#L657-L690

simonbyrne · 2016-01-27T17:04:50Z

Ah, but it seems that preconditions are removed on -Ounchecked builds.

tkelman · 2016-01-27T17:59:27Z

I had seen somewhere that different levels of -O flags make a surprisingly large difference to swift's performance, this kind of thing is probably part of the reason.

simonbyrne · 2016-06-27T23:33:22Z

Since we still haven't fixed this, here is a recap:

The problem here is that if the input is out of range in Float -> Integer conversion the behaviour is undefined, and LLVM considers returning a 32bit integer instead of a 16bit one acceptable undefined behaviour, which breaks our assumptions in the convert logic (which converts to integer, then back to float and compares this with the original result).

Unless we can convince LLVM to change this, the options here are:

change all float -> integer conversions to include a manual bounds check (which according to LLVM is the correct way). This seems to be about 50% slower, which is a shame as this is not an uncommon operation.
Just fix the breaking cases on ARM only, either by using a manual check, or doing convert(Int16,convert(Int32, x))

toivoh · 2016-06-28T06:08:32Z

If we know that we are going to get a 32 bit result, can we just mask it down to 16 bits before converting back to float? That should be pretty cheap.

simonbyrne · 2016-06-28T20:12:34Z

Actually, I must have been doing something wrong: now it seems to be about the same (or possibly even faster) to do the range check. So maybe we should change this.

…tations. Fixes #14549.

yuyichao · 2016-09-19T12:56:20Z

Seems that LLVM implements more optimizations on aarch64 and now (LLVM-svn) this is a problem there too.

…tations. Fixes #14549.

Replaces checked_fptosi/checked_fptoui intrinsics with Julia implementations. Fixes #14549. Explain logic behind float->integer conversion checking

Replaces checked_fptosi/checked_fptoui intrinsics with Julia implementations. Fixes #14549. Explain logic behind float->integer conversion checking (cherry picked from commit f935a50)

yuyichao added the system:arm ARMv7 and AArch64 label Jan 3, 2016

simonbyrne added a commit that referenced this issue Jan 22, 2016

Replaces checked_fptosi/checked_fptoui intrinsics with Julia implemen…

9734e5d

…tations. Fixes #14549.

simonbyrne mentioned this issue Jan 22, 2016

replace checked_fptosi intrinsics with Julia implementation #14763

Merged

yuyichao mentioned this issue May 1, 2016

rationalize() test failure in the numbers tests on arm #16148

Closed

simonbyrne added a commit that referenced this issue Jun 28, 2016

Replaces checked_fptosi/checked_fptoui intrinsics with Julia implemen…

bf773a0

…tations. Fixes #14549.

yuyichao mentioned this issue Sep 16, 2016

Rationalize test failure on Power + master #18553

Closed

ViralBShah added the system:powerpc PowerPC label Sep 17, 2016

simonbyrne added a commit that referenced this issue Sep 19, 2016

Replaces checked_fptosi/checked_fptoui intrinsics with Julia implemen…

c92f76e

…tations. Fixes #14549.

simonbyrne added a commit that referenced this issue Sep 21, 2016

Replaces checked_fptosi/checked_fptoui intrinsics with Julia implemen…

fea8f26

…tations. Fixes #14549.

vchuravy mentioned this issue Sep 23, 2016

[power] reenable partword atomics #18639

Merged

simonbyrne added a commit that referenced this issue Sep 24, 2016

Replaces checked_fptosi/checked_fptoui intrinsics with Julia implemen…

cec4fa5

…tations. Fixes #14549.

simonbyrne added a commit that referenced this issue Sep 24, 2016

Replaces checked_fptosi/checked_fptoui intrinsics with Julia implemen…

8e2b79a

…tations. Fixes #14549.

simonbyrne added a commit that referenced this issue Sep 24, 2016

Replaces checked_fptosi/checked_fptoui intrinsics with Julia implemen…

70c459a

…tations. Fixes #14549.

simonbyrne added a commit that referenced this issue Sep 29, 2016

Replaces checked_fptosi/checked_fptoui intrinsics with Julia implemen…

ad872cb

…tations. Fixes #14549.

simonbyrne closed this as completed in #14763 Sep 30, 2016

simonbyrne added a commit that referenced this issue Sep 30, 2016

replace checked_fptosi intrinsics with Julia implementation (#14763)

f935a50

Replaces checked_fptosi/checked_fptoui intrinsics with Julia implementations. Fixes #14549. Explain logic behind float->integer conversion checking

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

convert(Int16, ::Float64) does not throw inexact exception when it should #14549

convert(Int16, ::Float64) does not throw inexact exception when it should #14549

yuyichao commented Jan 3, 2016

yuyichao commented Jan 3, 2016

Keno commented Jan 3, 2016

yuyichao commented Jan 3, 2016

Keno commented Jan 3, 2016

simonbyrne commented Jan 4, 2016

simonbyrne commented Jan 6, 2016

simonbyrne commented Jan 22, 2016

yuyichao commented Jan 22, 2016

simonbyrne commented Jan 22, 2016

yuyichao commented Jan 22, 2016

simonbyrne commented Jan 22, 2016

yuyichao commented Jan 22, 2016

simonbyrne commented Jan 22, 2016

yuyichao commented Jan 22, 2016

simonbyrne commented Jan 22, 2016

yuyichao commented Jan 22, 2016

simonbyrne commented Jan 22, 2016

simonbyrne commented Jan 27, 2016

simonbyrne commented Jan 27, 2016

tkelman commented Jan 27, 2016

simonbyrne commented Jun 27, 2016

toivoh commented Jun 28, 2016

simonbyrne commented Jun 28, 2016

yuyichao commented Sep 19, 2016

convert(Int16, ::Float64) does not throw inexact exception when it should #14549

convert(Int16, ::Float64) does not throw inexact exception when it should #14549

Comments

yuyichao commented Jan 3, 2016

yuyichao commented Jan 3, 2016

Keno commented Jan 3, 2016

yuyichao commented Jan 3, 2016

Keno commented Jan 3, 2016

simonbyrne commented Jan 4, 2016

simonbyrne commented Jan 6, 2016

simonbyrne commented Jan 22, 2016

yuyichao commented Jan 22, 2016

simonbyrne commented Jan 22, 2016

yuyichao commented Jan 22, 2016

simonbyrne commented Jan 22, 2016

yuyichao commented Jan 22, 2016

simonbyrne commented Jan 22, 2016

yuyichao commented Jan 22, 2016

simonbyrne commented Jan 22, 2016

yuyichao commented Jan 22, 2016

simonbyrne commented Jan 22, 2016

simonbyrne commented Jan 27, 2016

simonbyrne commented Jan 27, 2016

tkelman commented Jan 27, 2016

simonbyrne commented Jun 27, 2016

toivoh commented Jun 28, 2016

simonbyrne commented Jun 28, 2016

yuyichao commented Sep 19, 2016