Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

When is a start function's post-return option called? #100

Open
alexcrichton opened this issue Sep 1, 2022 · 14 comments
Open

When is a start function's post-return option called? #100

alexcrichton opened this issue Sep 1, 2022 · 14 comments

Comments

@alexcrichton
Copy link
Collaborator

In thinking about start functions recently one question I wasn't sure how to answer is when the returned values are "invalidated" or otherwise when the component can be reentered. A better way to phrase this I found was to ask when is the post-return canonical abi option for a start function invoked? Unlike normal functions which have a clearly defined time when the results have been processed (e.g. lifted and then lowered back into the destination) start functions are different where their values are consumed possibly much later during a component's full instantiation process.

Another related but slightly different scenario: if a component exports both a function and a value, if the embedder supports invoking the function before reading the value then this forces the value to be removed from linear memory and copied to the host. Otherwise ideally the embedder would like to leave the value in linear memory but would require that the exported function is not invoked until the value export is read. This seemed like a pretty different model than the current "you get a bag of exports" model that core wasm has and I wanted to make sure this was intentional.

@lukewagner
Copy link
Member

Ooh, great question! So one thing that partially addresses the question is that, as currently written in canon_lift (which is what start must invoke), may_enter will be False until post_return is called and thus, in the "values and functions exported" case you mentioned, the host wouldn't be allowed to call an export before lowering all the exported values and calling post_return. Now, in the case of component-to-component linking, it does seem possible for a parent component to attempt to call an export of a child instance before lowering all the values, so that should definitely trap. But you're right that it is unclear when post_return gets called in component-to-component linking.

Logically, it feels like post_return should run "right after" the last value is lowered, and the lifetimes of values are statically-analyzable enough that that could be clearly-defined, but that does feel kinda weird. Separately, I've been wondering in the back of my mind if it is a problem that, while params/results are lifted/lowered in a strict order during cross-component calls, value imports/exports are not. With the current Canonical ABI, I don't think the ordering is semantically observable (since there is no mutation allowed during the window), but with post-MVP adapters, the order would be quite observable and so in theory this could be a new source of late bustage (if a component impl assumed and was only tested to lift/lower in a fixed order). One fix would be to limit the value index space validation rules to force values in the value index space to be consumed in order. But maybe another is to say that the values of an instance (obtained via import or instantiate) have to be consumed all-at-once. If we did that, then the post_return could be more-easily defined to be called right after the all-at-once. As a rough idea, maybe there could be some sort of (splat <instanceidx>) argument in start definitions that splatted all the value exports of <instanceidx> into the arguments of the start invocation, with post_return called right after the invocation completed?

Definitely open to other ideas, though.

@alexcrichton
Copy link
Collaborator Author

If the order of lowering is worrisome and it's fixed by consuming everything at once, how would that affect a start function producing 2 values which are then the arguments to 2 separate other start functions?

Additionally for consuming everything at once all the results of one start function could go into the parameters of a different start function, but in a different order, so I think the order would be observable there?

For the precise semantics of post-return I think it might be good to have some Python-like pseudo code describing how start functions are invoked. I can sort of see how this all aligns but everything is different enough from the current adapters that something still feels off to me (but it's hard to put my finger on it).

One example question I think that still lingers: if internally within a component there are two start functions which produce two values which are exported from the final root component, in what order are those post-return closures invoked? Is it the order that the embedder consumes the values? Or is there like a queue of pending post-return which gets drained in-order at some point?

@lukewagner
Copy link
Member

Yep, good point; "all at once" doesn't make sense. So another idea, which perhaps lines up with what you're hinting at too, was that the value index space should perhaps work like a value stack instead of like an index space, with start working like core wasm call. In addition to ensuring an ordered lowering of lifted values, each semantic stack element could be defined to contain a pair (lifted-value, post-return-closure), with the semantics of start saying to call the post-return-closure for all popped values and push a post-return-closure of its own for the last of its own return values. Could that make more sense?

And you're right, it'd be useful to cover this in the Python pseudo-code too; what's tricky (and why I didn't dig into it already) is that it requires taking on full module-linking (ultimately defining a little instantiation-time "linking interpreter" that executes component-level definitions as its instruction set). @pl-semiotics is actively working on a reference interpreter (that embeds the core wasm reference interpreter and is meant to correspond directly to the formal spec) which would include all this and execute proper .wast tests, so that should cover this gap before too long.

@alexcrichton
Copy link
Collaborator Author

Oh I wasn't thinking of a stack myself, I'm not actually sure what to do about this so I'm just asking questions!

If there's only a stack, though, that would prevent consuming values in different orders, right? That would mean that if a start function produced 2 values you couldn't use them in the reverse order as arguments to a different start function?

@lukewagner
Copy link
Member

The idea is that the stack would ensure the order that the values are consumed, but one remaining question from what I described above is how this ordering is preserved when passing values between components through imports (and with) and exports (and alias): at the moment these don't imply an order so the order could get scrambled. I'll need to think some more about this to propose an answer. But definitely it's seeming like we'll need some significant tweak to how value imports/exports work; thanks again for highlighting these issues!

@lukewagner
Copy link
Member

So I thought some more about what adopting a fully stack-based approach for values would look like over the weekend and I realized it would break our ability to freely union and intersect world files which is what would allow, e.g., a host to run "any component that targets any of these (unioned) world files". In particular, if two world files had different sets of ordered value imports, unioning them would not in general be able to produced a merged order that could run components targeting either world. That seems unfortunate, especially since I think this is something hosts would want to do and we otherwise would mostly get it for free from unordered+named imports/exports. I think this is also analogous to the classic "fragile base class".

So, setting aside the idea of fixing the order of value lowering, I was able to think of a modest tweak to the existing proposal that (hopefully) gives us a clear semantics for when to run post-return. The idea is to say that every value in the value index space is a pair (value, post-lower), where post-lower is a closure that is semantically called immediately after the value is lowered. Then, all post-lower closures created from a single start would share a single mutable counter that starts at the number-of-results and gets decremented by each post-lower call, with the final post-lower call calling post-return. While that sounds dynamic, I believe the existing definition flow analysis needed to do adapter fusion should be able to statically determine where the longest-living result value is consumed and place the post-return call there. WDYT?

@alexcrichton
Copy link
Collaborator Author

I think that should work, yeah. I'm also considering the case where a component exports a value to the embedding itself where we still don't want a sort of refcounted counter but I believe the solution there is to have some sort of step where the embedder reads all the values from the component and then, afterwards, has a list of post-return functions to run (e.g. instance flags "may enter" to reset). The dynamic-ness there can probably be papered over with a new JIT function/trampoline/etc, though if necessary.


While I'm here and thinking about this, I think this might be a good place to discuss a somewhat tangential but still related question. One thought I had on this is "what to do with an unused value"? This naively isn't possible due to the linearity rules but there's a few cases I think that aren't covered by the current Binary.md (and definitely not wasmparser's implementation of values)

  • What if an exported value is never aliased into another component? When is that value's post-return closure invoked?
  • What if an exported value is aliased twice into another component? I don't think that this should be possible but I don't think that the current Binary.md covers this case.

In general I was under the impression that values were intended to be a small addition to the existing component model that largely fit within the rest of the system seamlessly. I don't think the linearity rule plays well with this, though, and I feel that the linearity on values makes them different enough that it may warrant an integration into components via a different means than the otherwise-standard import/export/alias integration. For example instantiation may want to produce values directly onto the value stack to avoid dealing with aliases. The initial values on the value stack at the start of instantiation could be defined via a new "value section" or something similar. I think things can have a similar flavor where items are named and therefore can be reordered and added to/removed as necessary but I'm becoming more convinced that due to linearity values shouldn't be "just another kind of item"

@lukewagner
Copy link
Member

Those are also good questions not adequately described by the explainers. The "linearity" requirement means that both of those two cases (in bullets) should be validation errors. Practically speaking, I think the way this would be implemented is that both the value index space and the exports of instances in the instance index space would attach a "used" flag to the value type, used to catch double-use and check use before the end of a component scope.

But I agree that these extra complications raise the question of whether it's a bad idea to try to shoehorn values into index spaces or whether values should work like a stack instead (just like core wasm function bodies). I spent some time sketching out how this might look in the limit: e.g., components could have top-level params and results (with valtypes), so that components were essentially functions + extra named imports/exports and instantiate was like a souped-up function call. The problem I ran into with this approach is what I described above, and it stems from making the precise order of value imports/exports a part of the component's public signature.

But I suppose another point on the design spectrum would be to keep the order of value imports/exports out of the public component interface, but have the component's internal validation push its value imports onto a value stack that is pushed/popped by instantiate, start and export. I didn't think much about this option (I was hung up on the order-of-lifting question, which implied making order a part of the public interface), but it seems feasible on first glance. Does it sound better to you?

@alexcrichton
Copy link
Collaborator Author

Sorry I think I may not be being very clear. I'm not trying to say we should have value index spaces or value stacks, I'm instead trying to come at this from a different angle of use cases that seem reasonable given the structure of components and ensuring that the design fits the use cases. Some use cases I'm thinking of are:

  • It's intended to be backwards-compatible to add exports to a component. This means that I can add a value export to a component, but that means preexisting consumers aren't actually using that value. This is what I meant in my first bullet where a component has exported values that aren't consumed/aliased. You mention that this should be a validation error but I believe that would go against the idea that adding exports is always backwards-compatible (and compatible from a subtyping perspective as well).

  • With the subtyping relationship expected from imports/exports components expect to be able to reorder the names that are referenced without affecting external consumers. This has implications on a stack-based representation if the stack-based representation doesn't have the ability to shuffle/move things around on the stack itself.

  • Similar to the above two I'd expect to be able to remove a value import from a component and have it still be instantiable in prior components which gave it a value during instantiation. This means that the value given isn't actually consumed or ever lowered, it's just "forgotten" in a sense.

  • My second bullet point was pointing out that while some error needs to happen somewhere it's not obvious where it should happen. You mention that aliasing a value twice should be a validation error, but this I don't think can be done because you could instantiate a component, alias its exported value, then export the whole component instance itself. Somewhere far later down the line someone else could pick up that component and alias the same exported value. Validation is a local operation though that (afaik) is intended to not require the dataflow analysis and such that Wasmtime currently implements. Making a validation error here, however, I think would require this analysis. An alternative solution to my second bullet would be to have a dynamic flag that's checked, but I'm presuming that we want to avoid this where possible since it's an instantiation error rather than a validation error.

I have no horse in the race for index-space-vs-stack or how exactly to represent all this. I think as long as everything lines up in a way that is satisfactory to everyone then I don't really mind how the precise design all ends up.

I believe the initial motivation for a stack as opposed to the index space that exists today was to solve the issue related to when to call post-return, but the given conclusion now I believe is that values are actually a (value, post-return) pair where the post-return has a shared mutable counter which is decremented when called. Given that adjustment I'm not sure that a stack is even necessary any more over the flexibility of an index space which allows consuming things in whatever order is necessary as opposed to maintaining a stack discipline.

To me the major remaining design question is how you get a value out of a component instance. If it's with an alias statement of some kind then I believe this will force a dynamic runtime flag to get checked whenver it's accessed. With the full dataflow-analysis that Wasmtime does, though, this flag can probably go away except for the root level of a component. An alternative design I didn't describe very thoroughly above is one where alias directives don't support value types at all, instead the instantiate directive directly either pushes onto a stack or defines new entries in the index space. This does not require a flag I believe but would remove the ability to export a whole instance which can then later have a value extracted from it.

@lukewagner
Copy link
Member

Ah, that's a great point and indeed independent of stack-vs-index-space: if subtyping says I can compatibly ignore values, then not using an imported/exported value can't be a validation error. If we want to attach an observable effect to when the last value is consumed, I think this means that we need to associate an explicit point when unused values are implicitly "dropped" (which possibly calls the final post-return), keeping the values "linear". Independently, I had been wondering if, as an optimization for unnused values, we should have a drop top-level definition that marks a value as "used" without actually performing a lower. Since our subtyping is coercive, subtyping implies a coercion function that takes values of the subtype and produces values of the supertype, and so this could be the natural place to specify where the drop semantically happens. So I think that could address bullets 1-3 above?

For bullet 4, if we stick with the index-space approach, then the way I was imagining this working as a purely static validation constraint is that, when you alias a value export of an instance, the instance type (used to validate all subsequent definitions, including an export definition that re-exports the instance) ceases to have that export. Thus aliasing mutates the instance type "in place" during the linear validation pass (although I think one can formally think of as saying that validation of each instruction has an input type-state and an output type-state, just like with stack-based validation).

Lastly, I agree with your analysis that the index-space is probably a win over the stack approach; I was just opening up the possibility for discussion.

@alexcrichton
Copy link
Collaborator Author

Yeah I think that drop-ing values during coercions for imports would work to handle the import/export problem. I wondered about an explicit drop operation myself but I think it's reasonable to omit originally because if you want to drop something you could instead either not import it or not alias it.

I did not think of mutating the type but I'm wary of doing that since AFAIK this would be the first time that a type could be mutated after being introduced. One question that initially comes to mind is how to handle the case where a component is instnatiated and then that component instance is passed to two separate instantiations. Does the first instantiation "erase" all value exports in the instance type, leaving the rest of the instance type available for the next instantiation? In some sense a change like this would require instance types to become linear themselves, but only until they have no value exports I guess?

@lukewagner
Copy link
Member

Yeah, it's definitely a new thing and worth considering carefully. In programming languages, this sort of "type of a thing changing in response to that thing being used" shows up with, e.g., typestate analysis and session types, so you could think of what we're doing here as in that spirit. And again, stack-machine validation is doing this too, with each instruction transforming the "stack type", so this seems to be in the nature of linear types. The parallel work on formalizing the component model will be useful for precisely defining the validation rules and proving that they ensure what we think they ensure.

To your question: after an instance $i was passed to a first instantiate $C, the validation rules would say that, after the instantiate, $i's type would have the values (mentioned in the type of $C) subtracted. And yeah, I think you're right: an instance type is linear if it contains any linear values; once it has the last linear value subtracted, it ceases to be linear.

@alexcrichton
Copy link
Collaborator Author

At a conceptual level this seems like it could work, but I do want to emphasize again that the implementation will have no similar prior art to draw upon in wasm validators which want to implement this. While sure the stack can be manipulated in wasm functions I don't think that's the same thing because this is the type of a referenceable object. This means that a reference to "instance 1" will have a different type depending on when that reference is queried. I know at least historically in wasmparser we've been able to represent an instance type as "this is an instance of component X" and now that will no longer be possible because it either needs to be copied out or have some sort of list of "it's that component minus these paths-to-some-value-items that are consumed".

I don't think it's unimplementable by any means but I don't want to accidentally trivialize the difficulty implementation. Already values were I think originally billed as "should be easy to implement given everything else" but as this thread has shown there's a lot of tricky and subtle things and I don't believe they're going to be easy to implement by any stretch.

@lukewagner
Copy link
Member

Yes, I think that's fair. It may be a good idea to wait to implement this until after some more pressing features are done and reevaluate then.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants