Change `pulumi refresh` to report diff relative to desired state instead of relative to only output changes #16146

lukehoban · 2024-05-08T05:50:05Z

This changes refresh steps to compute a diff similar to what would happen if a preview were run immediately after the refresh using the inputs from the program, but inverted.

This doesn't change what is stored back into state, but does produce a diff that is more aligned with true "change to desired state".

This fixes several corner cases that are unfortunate in the prior implementation:

Changes to output-only properties - like etags - are no longer shown as refresh diffs (but are updated in the state outputs).
IgnoreChanges now works to ignore property additions or changes (the changes are still updated in state so that future updates don't overwrite them again).

We also now store the ignoreChanges resource option into the state file, so that it is visible during a refresh. Unfortunately, applying new ignoreChanges requires an update, which is awkward but unavoidable while refresh doesn't run the program. We should add a state edit command to directly modify this in the CLI.

Notes:

Diff shows same properties being updated and deleted diff: -tags,tagsAll~tags,tagsAll #16144 affects some of these cases - though its technically orthogonal
Resource inputs not populated during reads for import operations #11279 is another technically orthogonal issue that many providers (at least TFBridge ones) - do not report back changes to input properties on Read when the input property (or property path) was missing on the inputs. This is again technically orthogonal - but leads to cases that appear "wrong" in terms of what is stored back into the state still - though the same as before this change.
Azure Native doesn't seem to handle ignoreChanges passed to Diff, so the ability to ignore changes on refresh doesn't currently work for Azure Native.

Fixes #16072.

Makes two changes to the handling of ignoreChanges: 1. Stores the ignoreChanges configuration for each resource in the state file 2. Applies the ignoreChanges configuration from the state file as part of processing a refresh This allows users to use `ignoreChanges` to prevent refresh diffs being reported when changes happen in the cloud provider that they do not want to bring back into their Pulumi state. This can be used to acknowledge that part or all of a resource may change in the cloud provider, but the user is okay with this and doesn't want to be notified about it during refresh. Importantly, this means that the diff won't be reported, but also that the changes won't be applied to state. The ignoreChanges option when applied to a refresh operation applies to *both* input and output properties of the resource. This is different from the behavior during a normal update operation, where ignoreChanges only applies to input properties. The special "*" property path ignores all properties, but also ignores the case where the resource is no longer found in the cloud, and instead keeps the resource with all of it's existing input and output properties. Because the program is not run for refresh operations, users must apply the `ignoreChanges` via a `pulumi up` first, and then future `pulumi refresh` or `pulumi refresh --preview-only` oeprations will ignore the specificd changes. This is unfortunate, but the only option we have unless/until Pulumi fundamentally changes it's approach to refresh and decides to run the program as part of computing the resource state (and options) to use during the refresh.

Change refresh steps to compute a diff similar to what would happen if a `preview` were run immediately after the refresh using the inputs from the program, but inverted. This doesn't change what is stored back into state, but does produce a diff that is more aligned with true "change to desired state". This fixes several corner cases that are unfortuante in the prior implementation: 1. Changes to output-only proeprties - like `etag`s - are no longer shown as refresh diffs (but are updated in the state outputs). 2. IgnoreChanges now works to ignore property additions or changes (the changes are still updated in state so that future updates don't overwrite them again). TODO: - [ ] The diff display isn't updated yet - only the progres row display. - [ ] Tests

pulumi-bot · 2024-05-08T06:02:38Z

Changelog

[uncommitted] (2024-05-20)

Features

[engine] Change pulumi refresh to report diff relative to desired state instead of relative to only output changes
#16146

t0yv0 · 2024-05-08T19:50:44Z

pkg/resource/deploy/step.go

+		// * newInputs where oldInputs are expected
+		// * newOutputs where oldOutputs are expected
+		// * oldInputs where newInputs are expected
+		diff, err := diffResource(s.new.URN, s.new.ID, s.new.Inputs, s.new.Outputs, s.old.Inputs, prov, preview,


Assuming this is going to call Diff(), I love that because that automatically engages bridge machinery for planning changes suppressing changes and computing detailed diffs, that is going to be very helpful to run that. The downside is that new inputs returned from Read() are not check-clean and Diff expects these to be checked clean inputs so this may uncover some latent issues in bridged providers when rolled out.

new inputs returned from Read() are not check-clean

I think the implied requriement even before these changes is that they are, as they have always been stored directly into the state file, and only "check-clean" inputs are allowed to be stored in state. And that's a reasonable requirement, because these are inputs provided directly by a provider, which is capable of ensuring anything it returns is check-clean. Check is only needed to take an arbitrary bag of inputs from a users and turn it into something the provider knows is valid.

Agreed that's reasonable. I was worried this will promote pulumi/pulumi-aws#2318 to a refresh issue but it's empirically not the case, works the same as before. So that's very good.

t0yv0 · 2024-05-08T22:20:02Z

Doing another pass here, really 👍 on this, "Changes to output-only properties are no longer shown as refresh diffs" is very valuable as well. This accomplishes most of the benefit of #16072 without running the program on pulumi refresh which may be an un-releasable change (too invasive because it would slow down pulumi refresh as well as possibly cause some previously working programs to crash as they do not expect being run during pulumi refresh). Summarizing quick background discussion with @EronWright - there inevitably will be cases where resource outputs interact with the program and full pulumi preview is higher fidelity as to actual drift compared this PR form of just calling Diff per-resource, but it seems to me the tradeoffs are worth it.

The provider previously triggers replaces on every update, because it replaces on `prefix` but doesn't store it into outputs. This *also* triggers the new refresh logic to report an update as part of the refresh, but it seems it is indeed just "broken" even outside of refresh, we just weren't testing multiple consecutive updates to these resources previously.

We now need to render the diff relative to the inputs instead of presenting an "output diff".

Enables setting the `ignoreChanges` resource option on a resource directly in the state. This is important for use cases where `ignoreChanges` must be applied ahead of a `refresh` operation, without being able to do a `pulumi up` in between to add the resource option via code.

lukehoban

A few notes:

Ideally we would offer a way to opt-out of this via env var. The changes are coupled across (a) RefreshStep in the engine (b) diff renderer (c) progress renderer. We would need to incorporate the env var into all three places and maintain old and new behaviour in both. But I think we likely do need to do this.
I am nervous about the wide variety of allowed Diff behaviours, and how few of them are tested generally here, especially relative to the true Diff behaviours in providers. Ideally we would have tf-bridge-like, kubernetes-like, and az-native-like diff implementations we could test against here to validate and baseline refresh (and many other features) against.

lukehoban · 2024-05-20T03:13:51Z

pkg/backend/display/rows.go

@@ -435,19 +435,14 @@ func (data *resourceRowData) getInfoColumn() string {
 }

 func getDiffInfo(step engine.StepEventMetadata, action apitype.UpdateKind) string {
-	diffOutputs := action == apitype.RefreshUpdate


We are no longer treating a refresh as an "output diff". It is now a diff of inferred inputs, preferably from the detailedDiff computed by the step.

lukehoban · 2024-05-20T03:16:18Z

pkg/backend/display/diff.go

+		}
+
+		// If this is the output step for an import or refresh, we actually want to display the diff at this point.
+		if payload.Metadata.Op == deploy.OpImport || (refresh && payload.Metadata.Op == deploy.OpUpdate) {


For refresh, we now render a normal input diff but at ResourceOutputsEvent time (because we don't know the diff until we run the step). Very similar to Import. However, the step gets rewritten into a "OpUpdate" if there are changes, so look at that.

I believe it is right to ignore the OpSame and OpDelete cases that an OpRefresh could also be rewritten to - but it's possible there are cases where those should also go through this path?

lukehoban · 2024-05-20T03:17:45Z

pkg/cmd/pulumi/state_setignorechanges.go

+	var yes bool
+
+	cmd := &cobra.Command{
+		Use:   "set-ignore-changes [resource URN] --path [property-path-1] ...",


I honestly wish we had a more general command or set of commands for modifying components of state directly like this, and didn't need to invent a new mechanism here. Open to feedback on the CLI UX here - it's not particularly "nice", but seems as well aligned as possible with surrounding CLI UX and with being able to technically express all use cases.

lukehoban · 2024-05-20T03:20:03Z

pkg/engine/lifecycletest/testdata/output/TestCanceledRefresh/diff.stdout.txt

 <{%fg 2%}>  + foo: <{%reset%}><{%fg 2%}>"bar"<{%reset%}><{%fg 2%}>
 <{%reset%}><{%fg 13%}><{%bold%}>Resources:<{%reset%}>
-    <{%fg 3%}>~ 1 updated<{%reset%}>
+    1 unchanged


This and other similar changes are intentional - the only change is in outputs so it doesn't report as a diff.

Note that these tests do the equivalent of --show-sames, so they actually do show the diff for the sames, which wouldn't be "normal" in practice.

lukehoban · 2024-05-20T03:21:59Z

pkg/resource/deploy/step.go


 // ResultOp returns the operation that corresponds to the change to this resource after reading its current state, if
 // any.
 func (s *RefreshStep) ResultOp() display.StepOp {
 	if s.new == nil {
 		return OpDelete
 	}
-	if s.new == s.old || s.old.Outputs.Diff(s.new.Outputs) == nil {
+	if s.new == s.old || s.diff.Changes == plugin.DiffNone {


I'm not 100% certain how DiffUnknown should be treated here.

lukehoban added 2 commits May 7, 2024 23:00

lukehoban force-pushed the lukehoban/ignoreChangesRefresh branch from f3b983d to 9cc0f2c Compare May 8, 2024 06:01

lukehoban requested review from Frassle and pgavlin May 8, 2024 18:47

lukehoban added 2 commits May 8, 2024 11:58

Lint clean

2e24100

Add changelog

18083bd

t0yv0 self-requested a review May 8, 2024 19:40

t0yv0 reviewed May 8, 2024

View reviewed changes

Unblock tests

0c0b0ba

lukehoban force-pushed the lukehoban/ignoreChangesRefresh branch from 26059f7 to d266a15 Compare May 9, 2024 01:39

lukehoban force-pushed the lukehoban/ignoreChangesRefresh branch from d266a15 to dc8439b Compare May 9, 2024 03:35

t0yv0 mentioned this pull request May 10, 2024

Refresh reorders set elements causing spurious diffs pulumi/pulumi-terraform-bridge#1904

Open

lukehoban added 2 commits May 15, 2024 09:42

Add Refresh.Apply tests

2727bd7

Fix up diff display for Refresh

8096bff

We now need to render the diff relative to the inputs instead of presenting an "output diff".

lukehoban mentioned this pull request May 17, 2024

[WIP] Add a ignoreRefreshChanges resource option #16015

Closed

3 tasks

lukehoban added 5 commits May 17, 2024 16:19

Merge branch 'master' into lukehoban/ignoreChangesRefresh

6010b48

Ensure that state Copy includes IgnoreChanges

ed0f136

Fix lint errors

59aad09

Update lifceycle test baselines

5cf91cf

lukehoban marked this pull request as ready for review May 19, 2024 21:13

lukehoban requested a review from a team as a code owner May 19, 2024 21:13

Update TestRefreshBasics test and baselines

7a08bb8

lukehoban commented May 20, 2024

View reviewed changes

t0yv0 self-requested a review May 21, 2024 01:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Change `pulumi refresh` to report diff relative to desired state instead of relative to only output changes #16146

Change `pulumi refresh` to report diff relative to desired state instead of relative to only output changes #16146

lukehoban commented May 8, 2024 •

edited

pulumi-bot commented May 8, 2024 •

edited

t0yv0 May 8, 2024

lukehoban May 8, 2024

t0yv0 May 8, 2024

t0yv0 commented May 8, 2024 •

edited

lukehoban left a comment

lukehoban May 20, 2024

lukehoban May 20, 2024

lukehoban May 20, 2024

lukehoban May 20, 2024

lukehoban May 20, 2024

Change pulumi refresh to report diff relative to desired state instead of relative to only output changes #16146

Are you sure you want to change the base?

Change pulumi refresh to report diff relative to desired state instead of relative to only output changes #16146

Conversation

lukehoban commented May 8, 2024 • edited

pulumi-bot commented May 8, 2024 • edited

Changelog

[uncommitted] (2024-05-20)

Features

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

t0yv0 commented May 8, 2024 • edited

lukehoban left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Change `pulumi refresh` to report diff relative to desired state instead of relative to only output changes #16146

Change `pulumi refresh` to report diff relative to desired state instead of relative to only output changes #16146

lukehoban commented May 8, 2024 •

edited

pulumi-bot commented May 8, 2024 •

edited

t0yv0 commented May 8, 2024 •

edited