Add `Expr(:funcinfo, ...` and unsafe verifier #43747

vchuravy · 2022-01-10T19:05:51Z

As discussed in #41616 we need some way of marking function as being used in an unsafe situation.
Unsafe meaning executed on a non-Julia thread, where we can not access any TLS variables. This PR mocks
up the necessary changes so that we can discuss this approach right now:

julia> function f_unsafe()
         Base.@unsafe()
       end
julia> f_unsafe()

julia> function f_unsafe()
         Base.@unsafe()
         return Ref(Int)
       end
julia> f_unsafe()
Verifying unsafe function julia_f_unsafe_43 failed

signal (6): Aborted
in expression starting at REPL[4]:1
gsignal at /usr/lib/libc.so.6 (unknown line)
abort at /usr/lib/libc.so.6 (unknown line)
runOnFunction at /home/vchuravy/src/julia/src/llvm-unsafe-verifier.cpp:66 [inlined]

Changes:

Add Expr(:funcinfo) for setting LLVM metadata on a function
Add macro unsafe
Add nascent unsafe verifier pass

JeffBezanson · 2022-01-11T22:34:42Z

Yes, this is a good idea! I think we'll need some more specific names for various kinds of restrictions that people want in different cases. For example, no-runtime (doesn't call into the julia runtime), maybe no-allocation, foreign-thread (this case). The foreign-thread restriction might be the same as no-runtime, unless we carefully catalog runtime system functions that don't access TLS?

I assume this needs to be recursive as well; storing a flag on each function if it is "unsafe" and checking that unsafe functions only call other unsafe functions?

The really hard part I think is when to throw the error. Some options:

When the method is defined. In this case it needs to have concrete argument types, so we can compile it right away and do the checks.
When the function is called: probably impossible, since I guess all we could do is print a message and abort, which is no good.
When @cfunction is called on it. This is probably the most strictly compatible with julia's normal execution model, but may be too late to be most useful.

vchuravy · 2022-01-11T23:32:09Z

Yeah, right now it is when it is compiled (which may or may not be during @cfunction as we learned during #43748). We probably also need to propagate it upwards to the capi wrapper. Right now it terminates the program, which is also not the best error message.

I assume this needs to be recursive as well; storing a flag on each function if it is "unsafe" and checking that unsafe functions only call other unsafe functions?

Yeah this seemed challenging and very invasive. It might be useful to have a "strict" mode and a lenient mode.

We could have multiple flags? And then allow them to be set from the macro.

julia.unsafe.allow_tls = {true|false}
julia.unsafe.allow_alloc = {true|false} # false -> implies allow_tls=false
julia.unsafe.allow_rt = {true|false} # false -> implies allow_alloc=false, allow_tls=false

julia.unsafe.strict = {true|false} # all called functions must "unsafe" as well
julia.unsafe.nocalls # no function calls

vtjnash · 2022-01-12T01:41:27Z

Since we always support runtime function replacement (aka #265), I am not sure how this could ever be made to work.

vchuravy · 2022-01-12T01:54:07Z

I am not sure how this could ever be made to work.

Can you expand on this? There are two different properties here that I want.

For Emit safepoints at function entry #41616 I need a way to annotate methods that are going to be called on foreign threads as not eligible for safepoint insertion.
I would like to have a static verification that a @cfunction compilation is not going to contain runtime interactions

Now I can see how #265 can come and rain on my parade, by invalidating the function (actually a good question what happens with a @cfunction in that case). Precisely because of this I would like to have a verification step that will error, instead of having mysterious failures down the road.

vtjnash · 2022-01-12T15:50:21Z

@cfunction always interacts with the runtime first, to check for method updates (ala #265), and only then decides what method to call

JeffBezanson · 2022-01-12T18:50:18Z

That makes me think this should really be part of @cfunction then --- you tell cfunction "give me a pointer to code for this function with the following properties" and it errors if it can't fulfill that request.

tkf · 2022-01-16T04:13:06Z

(:+1: My intuition before reading all the discussion was also that it should be done at @cfunction, too.)

If the compiler checks that the given function does not need ptls etc. at the time @cfunction is invoked, it sounds strange to call this @unsafe. If we conceptualize @unsafe is like @inbounds, it means that it's the programmer who checks that given code is safe. It's similar to Rust's unsafe. But then if the compiler can throw at @cfunction, it's the compiler that verifies the ptls-freeness (and other things). In a way, the function annotated by this macro is very safe.

So, how about @runtimefree? I'm thinking "runtime" as PARTR + GC + dynamic dispatch + (other things accessed through current_task like threadid).

adkabo · 2022-02-02T04:02:24Z

#43852 provides @assume_effects. @assume_* could be a more general naming convention for these.

vtjnash · 2022-10-25T14:41:50Z

No longer needed. Though we could add a metadata/compiler option that turns off #41616

vchuravy added 3 commits January 9, 2022 19:22

Add Expr(:funcinfo) for setting LLVM metadata on a function

c4ca9dc

Add macro unsafe

d200bee

Add nascent unsafe verifier pass

1b021a2

vchuravy requested review from JeffBezanson and vtjnash January 10, 2022 19:05

vchuravy mentioned this pull request Jan 10, 2022

@cfunctions precompiled into a system image are not executable from a foreign thread due to TLS accesses #43748

Closed

AriMKatz mentioned this pull request Feb 4, 2022

Idea: Integration with JET.jl tshort/StaticCompiler.jl#54

Open

jpsamaroo mentioned this pull request Feb 11, 2022

-O0 flag broken JuliaGPU/AMDGPU.jl#195

Open

vtjnash closed this Oct 25, 2022

vtjnash deleted the vc/funcinfo branch October 25, 2022 14:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add `Expr(:funcinfo, ...` and unsafe verifier #43747

Add `Expr(:funcinfo, ...` and unsafe verifier #43747

vchuravy commented Jan 10, 2022

JeffBezanson commented Jan 11, 2022

vchuravy commented Jan 11, 2022

vtjnash commented Jan 12, 2022

vchuravy commented Jan 12, 2022

vtjnash commented Jan 12, 2022

JeffBezanson commented Jan 12, 2022

tkf commented Jan 16, 2022

adkabo commented Feb 2, 2022

vtjnash commented Oct 25, 2022

Add Expr(:funcinfo, ... and unsafe verifier #43747

Add Expr(:funcinfo, ... and unsafe verifier #43747

Conversation

vchuravy commented Jan 10, 2022

Changes:

JeffBezanson commented Jan 11, 2022

vchuravy commented Jan 11, 2022

vtjnash commented Jan 12, 2022

vchuravy commented Jan 12, 2022

vtjnash commented Jan 12, 2022

JeffBezanson commented Jan 12, 2022

tkf commented Jan 16, 2022

adkabo commented Feb 2, 2022

vtjnash commented Oct 25, 2022

Add `Expr(:funcinfo, ...` and unsafe verifier #43747

Add `Expr(:funcinfo, ...` and unsafe verifier #43747