Request: Store the Expr behind each Method object #24347

cstjean · 2017-10-26T10:59:22Z

If we had a function code_expr(::Method) that returns :(function foo(...) ...), it would make augmenting existing functions (for instrumenting profiler, tracing, ...) vastly easier to do without digging into the internals or parsing source files (which may have been modified). @davidanthoff has another use for it in Query.jl, and it would kinda solve #2625

The text was updated successfully, but these errors were encountered:

StefanKarpinski · 2017-10-26T13:31:58Z

I would propose storing the original source text instead – we can always reparse it on demand.

cstjean · 2017-10-26T14:33:36Z

What would you store for function-defining macro expansions like @with_kw struct ... end?

JeffBezanson · 2017-10-26T14:47:30Z

Are you sure you really want the AST? ASTs are very complicated, with deeply-nested structure and multiple possible representations of many forms. For program transformations the IR is much easier to use.

davidanthoff · 2017-10-26T16:15:32Z

I think I want the AST, but I'm certainly not sure ;) My use case is that I want to translate queries into say SQL, and parts of query specification by users comes in the form of anonymous functions. I think that is probably easier if the representation I get is as close to what the user typed. That is the path the LINQ folks took, so it is well trotted. Might also work with IR, but I'm just not familiar enough with that to really be able to tell.

chakravala · 2017-10-28T10:17:23Z

Yea, the AST would be great, I really want this feature too. If you only provide the string, then when you parse it, there might be unnecessary extra objects inserted, like :line blocks, which I find annoying and also blows up the size of the new AST (especially if function has a lot of lines) in my situation I would have to make an extra call to a recursive function that deletes all the :line objects from the AST, which will take up even more extra time and effort, in addition to parsing. so AST might be more direct and efficient

However, I could see how if you only store the string, it might be smaller storage.

timholy · 2017-10-28T11:36:46Z

Perhaps it's not ideal, but for the record here is where we stand now. I can illustrate this most easily with tools in Revise, but of course you might want a lower-level implementation that wouldn't require reading the whole file. I'll choose a method in Base because those are a bit trickier (for a package, m.file contains the absolute path, but for Base methods the path is truncated).

julia> Revise.track(Base)

julia> m = first(methods(svd))
svd(A::BitArray{2}) in Base.LinAlg at linalg/bitarray.jl:90

# This is just to get the full path to the source file, in a way that's guaranteed to match the key
# in the src text cache (`normpath` on the result from `Base.find_in_path` should presumably
# work too)
julia> fn = first(Iterators.filter(str->endswith(str, String(m.file)), keys(Revise.file2modules)))
"/home/tim/src/julia-1.0/base/linalg/bitarray.jl"

julia> src = split(Revise.read_from_cache(Revise.file2modules[fn], fn), '\n');

julia> src[m.line]
"svd(A::BitMatrix) = svd(float(A))"

Or for the expression:

julia> src = Revise.read_from_cache(Revise.file2modules[fn], fn);

julia> i = 0; for j = 1:m.line-1 i = findnext(x->x=='\n', src, i+1); end; i
2409

julia> ex = parse(src, i)
(:(svd(A::BitMatrix) = begin
            #= none:2 =#
            svd(float(A))
        end), 2444)

These would allow someone to implement code_expr(m::Method) using currently available tools and then see where it starts to encounter problems. One that is fairly predictable is

for T in (Int, Float64)
    @eval foo(x::$T) = T
end

jw3126 · 2017-10-28T12:37:39Z

If one could access the AST of a method, one could build a Cassette like mechanism that overdubs the AST instead of IR of a method. For some transformations AST is better. For example one could build custom @inbounds like macros that do things like:

Remove all assertions from a function
Remove all @argcheck from a function

JeffBezanson · 2017-10-28T18:51:23Z

If you only provide the string, then when you parse it, there might be unnecessary extra objects inserted, like :line blocks

The best way to handle this is to add an option to parse to exclude location info. Our default behavior is to include location info, so if we saved the AST for a method it would include line nodes.

I still strongly recommend against doing program transformations on a representation with probably ~100 forms, instead of basically just assignments, branches, and calls. And even if you're ok with that, how are you going to handle unexpanded macros and nested functions? With nested functions, it's almost incoherent to talk about the "AST of a method", since the outer function contains expressions for multiple methods, and the inner function expressions are missing context needed to interpret them correctly. Do you want to re-implement our scope analysis and closure conversion, or have us hand you all the information on a silver platter via the IR?

chakravala · 2017-10-28T19:10:16Z

The best way to handle this is to add an option to parse to exclude location info. Our default behavior is to include location info, so if we saved the AST for a method it would include line nodes.

Having an option like this would be good.

cstjean · 2017-10-29T00:46:29Z

I still strongly recommend against doing program transformations on a representation with probably ~100 forms, instead of basically just assignments, branches, and calls.

Tracing and profiling just involve wrapping the function body. Whether it's done with the AST or the IR doesn't matter much, but the AST is exported, supported and documented.

JeffBezanson · 2017-10-29T01:14:25Z

The IR is also supported and documented, and it's already possible to do transformations at that level, which e.g. ParallelAccelerator.jl and Casette.jl do.

Whether it's done with the AST or the IR doesn't matter much

Not true. The best example, as I said, is probably inner functions, where the AST doesn't correspond to the actual function in a simple way. But at the IR level it's simple, since everything has been converted to a top-level function with no free variables.

If it's useful to be able to wrap function bodies, it's possible we should have a utility for doing just that, since as you observe it's not even necessary to examine the AST in detail to do it --- letting you see the full AST for that would be overkill.

cstjean · 2017-10-29T01:15:23Z

@chakravala You might find MacroTools.striplines useful.

chakravala · 2017-10-29T01:25:02Z

@cstjean Thanks for the tip, I've already implemented my own Reduce.linefilter actually, since I enjoy writing recursive tree algorithms for fun.

davidanthoff · 2017-11-16T01:43:36Z

@JeffBezanson could you help me understand what IR I would get? For example, say I have a function like

function foo(x::Function)
    ir = ... # Magic!
end

And then I call it like say this: foo(i->2*i + log(i) == 23).

Would the suggestion be that I work with something that looks like what I would get if I call @code_llvm on the anonymous function? Or something else?

JeffBezanson · 2017-11-16T02:02:10Z

It would be the result of @code_lowered.

oxinabox · 2019-10-22T16:11:02Z

The updated version of @timholy 's revise based answer from before is:

Use CodeTracking.jl’s definition function.
https://github.com/timholy/CodeTracking.jl

It is what I do in Arborist
https://github.com/oxinabox/Arborist.jl

cstjean mentioned this issue Mar 26, 2018

Provide BODY JuliaDocs/DocStringExtensions.jl#54

Open

johnnychen94 mentioned this issue Oct 15, 2018

Feature request: support @showprogress for function type timholy/ProgressMeter.jl#116

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Request: Store the Expr behind each Method object #24347

Request: Store the Expr behind each Method object #24347

cstjean commented Oct 26, 2017 •

edited

StefanKarpinski commented Oct 26, 2017

cstjean commented Oct 26, 2017 •

edited

JeffBezanson commented Oct 26, 2017 •

edited

davidanthoff commented Oct 26, 2017

chakravala commented Oct 28, 2017 •

edited

timholy commented Oct 28, 2017

jw3126 commented Oct 28, 2017

JeffBezanson commented Oct 28, 2017

chakravala commented Oct 28, 2017

cstjean commented Oct 29, 2017

JeffBezanson commented Oct 29, 2017

cstjean commented Oct 29, 2017

chakravala commented Oct 29, 2017 •

edited

davidanthoff commented Nov 16, 2017

JeffBezanson commented Nov 16, 2017

oxinabox commented Oct 22, 2019

Request: Store the Expr behind each Method object #24347

Request: Store the Expr behind each Method object #24347

Comments

cstjean commented Oct 26, 2017 • edited

StefanKarpinski commented Oct 26, 2017

cstjean commented Oct 26, 2017 • edited

JeffBezanson commented Oct 26, 2017 • edited

davidanthoff commented Oct 26, 2017

chakravala commented Oct 28, 2017 • edited

timholy commented Oct 28, 2017

jw3126 commented Oct 28, 2017

JeffBezanson commented Oct 28, 2017

chakravala commented Oct 28, 2017

cstjean commented Oct 29, 2017

JeffBezanson commented Oct 29, 2017

cstjean commented Oct 29, 2017

chakravala commented Oct 29, 2017 • edited

davidanthoff commented Nov 16, 2017

JeffBezanson commented Nov 16, 2017

oxinabox commented Oct 22, 2019

cstjean commented Oct 26, 2017 •

edited

cstjean commented Oct 26, 2017 •

edited

JeffBezanson commented Oct 26, 2017 •

edited

chakravala commented Oct 28, 2017 •

edited

chakravala commented Oct 29, 2017 •

edited