Add local and global variable support #13199

dcbaker · 2024-05-09T16:10:03Z

This is still very WIP, and it's got a lot of other work it's based on.

Closes: #2607

Instead of storing all state inside the Interpreter itself, store it inside a separate class, and then compose those classes. This has some advantages: - We can pass this state to type convertors and validators, allowing more validation code to be done in shared validators instead of in the bodies of functions - We can share this state with modules, removing the need for repeating ourselves in the ModuleState object - It simplifies the Interpreter class, which is very complicated - It simplifies creating subprojects, as we can re-use a single interpreter, simply replacing it's local state while maintaining the global state

Unfortunately we need to keep the `.subproject` attribute to continue to implement an interface. I have implemented this as a method, which simplifies this, as it simply looks up the state value. I have left the uses of .subproject as they are numerous and we can't remove the attribute anyway.

Also initialize it to a non-null value, as we don't expect it to be null, and it's easy to fix. This also requires a property for interfaces reasons. It is, however, used much less and as such I have replaced all uses that are not part of that interface.

This was only used in a couple of places, and requires extra tracking to ensure it is correct, while we already have `state.local.current_node.lineno`, which is always accurate and up to date.

Summary is shared across all subprojects. It's also mutable state that might need to be turned back if a subproject fails (unlike some other attributes like the holder maps and modules), so it needs to be stored outside the Interpreter itself.

These are renamed to `args_frozen` since `global_args_frozen` will move to the `GlobalState` so they wont conflict, and to reduce some amount of typing.

… use it

Which is built, but never used

Since both the AST Interpreter and the main Interpreter both use this in exactly the same way I put it in the base state.

For the main Interpreter, this is actually a useless second tracking of information in the Build class, but for ASTInterpreter, which doesn't have a Build, it needs to be an attribute. To ensure API compatibility, I've used a `@property` in Interpreter.state.global. I've not used this as properties (methods) have more overhead than an attribute lookup This also uncovered an interesting bug, when a function is called inside of `project()`, like `project(version : files('VERSION'))`, we would use the default subproject_dir for validation, which provides a mechanism for sandbox escape. I'm not entirely happy with my patch changes to make that more robust, but they do make it more robust (and work).

This is, unfortunately, still tracked in two place. One in `Build.dep_manifests` and one in `Interpreter.state.local`. I don't see a good way to get rid of this, unfortunately

And rename to `default_subproject_options`, which help clarify the difference between that and `project_default_options`

Since it's only used to check membership, a set makes more sense. Also add some documentation to the attribute

Since we'll soon be using a single Interpreter with changing State objects instead of whole new Interpreters, we can't pass a static reference to build here. Instead, access it through the Interpreter reference, so that each subproject will use the right state

Otherwise as the build changes the backend will be incorrect. Mypy gets really confused about `@property`s, and module imports, so I've had to change `import build` to `import build as _build`.

The former performs a linear search, which is slower than the set based search.

A second object is used, do the need to attach a few extra attributes. This is partially a nice cleanup, and partially to get to the point that we can get rid of multiple Interpreter instances.

It's not actually useful, we can just add the root meson.build file to the depfiles when we translate the meson.build into ast. If there's a provided ast then we don't go down that path anyway.

Since we're about to stop replacing the interpreter and use a single interpreter for the entire run.

Instead, replace the state with new state. This means we don't have to re-calculate any of the methods or tell the sub-interpreter about the state of the super-interpreter, it's already there. Since all of the state that is not an implementation detail of the Interpreter itself is no longer bound to the interpreter this is all very trivial. This fixes a number of potential bugs created by side-effects from a failing, optional, subproject (Assume A => B => C, where C configures successfully but B fails after that) 1. If B called `meson_override_dependency()` or `meson.override_find_program()` those overrides would remain, even though the targets would not exist. 2. Targets from C would be built, even though B will not be, and A does not use them. 3. If C called `meson.override_dependency()` or `meson.override_find_program()` these overrides would remain, and could affect A 4. If B is optional, but a subproject D is called if B fails, and D calls C with different `default_options` than B, D will get a copy of C using those different options 5. Summary options for C will be printed, even though it is unused

This is currently unused

I'm not in love with this, because I feel like it really should be part of the AssignmentNode.

This reverts commit cb7e8d212e8177692b28374f0217b5cbf0a4299e.

For what scope a variable is assigned to

TODO: - the operator is incorrect if scoping is used - += shouldn't inherit this operation - the formatter/rewritter needs to be taught about this

It's unused, and with the addition of local variables it becomes a footgun, since there are both local and global variables, and this only returns global variables.

Which is less code, and will properly handle local vs global variables

This allows the writing of code like: ```meson var: global = 'bar' var: local = 'foo' message(var) ``` which will print `foo`, as local variables always shadow global ones. There are a few things here I'm not sure how to handle. 1. Should we allow a variable to be declared global and local in the same file? 2. Should scoping only be allowed on the initial declaration? Ie, should we have a DeclarationNode that is different than an AssignmentNode?

dcbaker added 30 commits May 9, 2024 09:07

interpreter: move source_root to GlobalState

03d9696

interpreter: remove current_lineno

765037f

This was only used in a couple of places, and requires extra tracking to ensure it is correct, while we already have `state.local.current_node.lineno`, which is always accurate and up to date.

interpreter: move argument_depth to LocalState

4d5e5ea

interpreter: move tmp_meson_version to GlobalState

4ddd4ba

interpreter: move subdir and root subdir to LocalState

0eb1bd5

interpreter: move variables to LocalState

3aeb26f

cmake: use typed_pos_args for CMakeSubproject.get_variable

544bdac

interpreter: Move active_project_name to LocalInterpreterState

6059434

interpreter: move rule relaxations to LocalState

7575807

interperter: Remove unused Interpreter.subproject_directory_name

7b21fb5

interpreter: move project_args_frozen to LocalState

f6aef55

These are renamed to `args_frozen` since `global_args_frozen` will move to the `GlobalState` so they wont conflict, and to reduce some amount of typing.

interpreter: move global_args_frozen to GlobalState

82f6414

interperter: move compilers to LocalState

a18bee1

interpreter: move build_def_files to GlobalState

f07eb12

interpreter: only calculate build_filename if we're actually going to…

580db21

… use it

interpreter: remove Interpreter.generators

958a1bc

Which is built, but never used

interpreter: move processed_buildfiles to InterpreterBase.state.local

8b65deb

Since both the AST Interpreter and the main Interpreter both use this in exactly the same way I put it in the base state.

interpreter: move subproject_stack to GlobalState

5e293ef

interpreter: move configure_file_outputs to LocalState

1e812ce

interpreter/dependencyfallbacks: Add copyright header

252bfbe

interpreter: move Build to GlobalState

911c3c7

interpreter: move project_version to LocalState

2513168

This is, unfortunately, still tracked in two place. One in `Build.dep_manifests` and one in `Interpreter.state.local`. I don't see a good way to get rid of this, unfortunately

interpreter: move default_project_options to LocalState

33a0b89

And rename to `default_subproject_options`, which help clarify the difference between that and `project_default_options`

interpreter: move project_default_options to LocalState

74dbeef

build: store Build.modules as a set

5cbe29b

Since it's only used to check membership, a set makes more sense. Also add some documentation to the attribute

dcbaker added 22 commits May 9, 2024 09:08

backend: Access Build through Interpreter

1de8319

Otherwise as the build changes the backend will be incorrect. Mypy gets really confused about `@property`s, and module imports, so I've had to change `import build` to `import build as _build`.

interpreter: move user_defined_options to GlobalState

68dfea6

interpreter: don't use in dict.keys() use in dict

9766a07

The former performs a linear search, which is slower than the set based search.

interpreter: move subprojects to GlobalState

916b801

interpreter: move environment and coredata to GlobalState

aa875a9

interpreter: move backend to GlobalState

80c07cb

interpreter: move ast to LocalState

bfdb9b9

interpreter: Make SubprojectHolder an ObjectHolder of LocalState

ff74756

A second object is used, do the need to attach a few extra attributes. This is partially a nice cleanup, and partially to get to the point that we can get rid of multiple Interpreter instances.

interpreter: remove is_translated

ae3991e

It's not actually useful, we can just add the root meson.build file to the depfiles when we translate the meson.build into ast. If there's a provided ast then we don't go down that path anyway.

interpreter: access subproject name from interpreter state in MesonMain

160070d

Since we're about to stop replacing the interpreter and use a single interpreter for the entire run.

interpreterbase/state: Store local variables

8bfd475

This is currently unused

mparser: add annotation for better type analysis

36f6343

mparser: Add scope to IdNode

b2b1ef1

I'm not in love with this, because I feel like it really should be part of the AssignmentNode.

Revert "mparser: Add scope to IdNode"

2c6b3ae

This reverts commit cb7e8d212e8177692b28374f0217b5cbf0a4299e.

mparser: Add a Scope enum

7e2b361

For what scope a variable is assigned to

mparser: Add scope to AssignNode

a4668a6

TODO: - the operator is incorrect if scoping is used - += shouldn't inherit this operation - the formatter/rewritter needs to be taught about this

interpreter: remove unused get_variables method

2e6f4c3

It's unused, and with the addition of local variables it becomes a footgun, since there are both local and global variables, and this only returns global variables.

interpreter: use self.get_variable() instead of manual lookup

4ed055c

Which is less code, and will properly handle local vs global variables

wip

7667ffa

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add local and global variable support #13199

Add local and global variable support #13199

dcbaker commented May 9, 2024

Add local and global variable support #13199

Are you sure you want to change the base?

Add local and global variable support #13199

Conversation

dcbaker commented May 9, 2024