Freshness sensor does not run when assets are pending or failed #21949

mr-mcox · 2024-05-17T20:20:15Z

Dagster version

1.7.5

What's the issue?

I am encountering an issue with Dagster 1.7.5 when defining and running freshness checks. When one of the jobs is running, the freshness check evaluation result is "Running" and the freshness checks sensor fails at every evaluation with the following error:

dagster._core.errors.SensorExecutionError: Error occurred during the execution of evaluation_fn for sensor freshness_checks_sensor

  File "/usr/local/lib/python3.11/site-packages/dagster/_grpc/impl.py", line 388, in get_external_sensor_execution
    with user_code_error_boundary(
  File "/usr/local/lib/python3.11/contextlib.py", line 158, in __exit__
    self.gen.throw(typ, value, traceback)
  File "/usr/local/lib/python3.11/site-packages/dagster/_core/errors.py", line 297, in user_code_error_boundary
    raise new_error from e

The above exception was caused by the following exception:
dagster._check.CheckError: Expected non-None value: Expected the dagster event to be an asset check evaluation.

  File "/usr/local/lib/python3.11/site-packages/dagster/_core/errors.py", line 287, in user_code_error_boundary
    yield
  File "/usr/local/lib/python3.11/site-packages/dagster/_grpc/impl.py", line 394, in get_external_sensor_execution
    return sensor_def.evaluate_tick(sensor_context)
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/usr/local/lib/python3.11/site-packages/dagster/_core/definitions/sensor_definition.py", line 795, in evaluate_tick
    result = self._evaluation_fn(context)
             ^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/usr/local/lib/python3.11/site-packages/dagster/_core/definitions/sensor_definition.py", line 1101, in _wrapped_fn
    raw_evaluation_result = fn(**context_param, **resource_args_populated)
                            ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/usr/local/lib/python3.11/site-packages/dagster/_core/definitions/asset_check_factories/freshness_checks/sensor.py", line 98, in the_sensor
    evaluation = check.not_none(
                 ^^^^^^^^^^^^^^^
  File "/usr/local/lib/python3.11/site-packages/dagster/_check/__init__.py", line 1003, in not_none
    raise CheckError(f"Expected non-None value: {additional_message}")

This behavior is undesirable because it means that if a job is running, freshness cannot be checked, which defeats the purpose of checking freshness.

What did you expect to happen?

I expected that the freshness checks would be executed by the sensor, which would inform me if a job was taking too long to run. Additionally, I assumed that one running job (Job A) would not prevent freshness checks from running on another job (Job B).

How to reproduce?

Here is a simplified version of the code I am using to define freshness checks and the corresponding sensor:

from dagster import AssetKey, build_sensor_for_freshness_checks, build_last_update_freshness_checks
from datetime import timedelta

# Define freshness checks
job_a_complete_freshness_check = build_last_update_freshness_checks(
    assets=[AssetKey(["final_asset", "job_a"])],
    lower_bound_delta=timedelta(days=1),
)

job_b_complete_freshness_check = build_last_update_freshness_checks(
    assets=[AssetKey(["final_asset", "job_b"])],
    lower_bound_delta=timedelta(days=1),
)

all_freshness_checks = (
    job_a_complete_freshness_check +
    job_b_complete_freshness_check  # type: ignore[operator]
)

# Define sensor for freshness checks
freshness_checks_sensor = build_sensor_for_freshness_checks(
    freshness_checks=all_freshness_checks,
    minimum_interval_seconds=60 * 30,
)

The final_asset asset is at the end of a long-running job. When one of the jobs is running, the freshness check evaluation result shows "Running", and the freshness checks sensor fails with the error mentioned above.

Additionally, if one of the assets fails to materialize in the latest run (eg an upstream dependency fails), the sensor fails to run in exactly the same manner.

Deployment type

Dagster Helm chart

Deployment details

Likely irrelevant deployment details:

Jobs are executed using the kubernetes executor
Dagster is deployed on Google Kubernetes Engine

Additional information

No response

Message from the maintainers

Impacted by this issue? Give it a 👍! We factor engagement into prioritization.

The text was updated successfully, but these errors were encountered:

dpeng817 · 2024-05-23T00:06:06Z

Should be fixed by this PR: #22013. Once I can get confirmation from a few users that they are no longer running into this, we can close.

mr-mcox · 2024-05-30T16:02:58Z

I can confirm that undesirable behavior has ceased after deploying this change. From my perspective, this issue is resolved - thank you!

dpeng817 · 2024-05-30T16:04:23Z

Closing for now!

mr-mcox added the type: bug Something isn't working label May 17, 2024

mr-mcox changed the title ~~Freshness Sensor does not run~~ Freshness sensor does not run when assets are pending or failed May 17, 2024

garethbrickman added area: asset-checks Related to Asset Checks area: sensor Related to Sensors and removed area: asset-checks Related to Asset Checks labels May 20, 2024

krukowski assigned dpeng817 May 20, 2024

dpeng817 closed this as completed May 30, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Freshness sensor does not run when assets are pending or failed #21949

Freshness sensor does not run when assets are pending or failed #21949

mr-mcox commented May 17, 2024

dpeng817 commented May 23, 2024 •

edited

mr-mcox commented May 30, 2024

dpeng817 commented May 30, 2024

Freshness sensor does not run when assets are pending or failed #21949

Freshness sensor does not run when assets are pending or failed #21949

Comments

mr-mcox commented May 17, 2024

Dagster version

What's the issue?

What did you expect to happen?

How to reproduce?

Deployment type

Deployment details

Additional information

Message from the maintainers

dpeng817 commented May 23, 2024 • edited

mr-mcox commented May 30, 2024

dpeng817 commented May 30, 2024

dpeng817 commented May 23, 2024 •

edited