Integrate calibration pipeline into atmos #2895

nefrathenrici · 2024-04-11T00:02:19Z

This PR adds the framework for reproducible calibration experiments, starting with one example experiment.

calibration/README.md provides an overview. Most of the actual code is in ClimaCalibrate

Content

calibration/model_interface.jl: Model interface hooks for CalibrateAtmos. This will be used in every atmos calibration.
test/calibration_interface.jl: Basic tests for the model interface file.
calibration/experiments: Folder containing subfolders of calibration experiments. Currently only contains sphere_held_suarez_rhoe_equilmoist

sphere_held_suarez_rhoe_equilmoist/

experiment_config.yml: Stores filepaths for calibration artifacts and other configuration data
model_config.yml: Atmos yaml config for the calibration
pipeline.sbatch: Script to run calibration experiment on central
observation_map.jl: Code to process model output into observation space
postprocessing.jl: Plotting script
prior.toml: Prior distribution file

Sbozzolo · 2024-05-23T17:53:51Z

test/calibration_interface.jl

+file_path = joinpath(member_path, "parameters.toml")
+mkpath(dirname(file_path))
+touch(file_path)
+
+atmos_config = ClimaCalibrate.set_up_forward_model(1, 1, experiment_dir)
+(; parsed_args) = atmos_config
+
+@testset "Atmos Configuration" begin
+    @test parsed_args["moist"] == "equil"
+    @test parsed_args["toml"] == [file_path]
+    @test parsed_args["output_dir"] == member_path
+    @test parsed_args["restart_file"] ==
+          "/groups/esm/ClimaArtifacts/artifacts/atmos_held_suarez_obs/day200.0.hdf5"
+end
+
+rm(file_path)


Use temporary folders instead of relying on creating a new folder and removing it. The folder will not be cleaned in case of failing tests.

Sbozzolo · 2024-05-23T17:54:52Z

test/calibration_interface.jl

+    @test parsed_args["toml"] == [file_path]
+    @test parsed_args["output_dir"] == member_path
+    @test parsed_args["restart_file"] ==
+          "/groups/esm/ClimaArtifacts/artifacts/atmos_held_suarez_obs/day200.0.hdf5"


You should use Artifacts or ClimaArtifacts instead of hardcoding the path.

This test has to be reproducible even outside of the Caltech cluster

Sbozzolo · 2024-05-23T17:56:09Z

I am not too familiar with the calibration pipeline, so it will take me a little bit of time to understand and properly review this PR.

Sbozzolo · 2024-05-23T17:57:43Z

calibration/experiments/sphere_held_suarez_rhoe_equilmoist/Project.toml

+YAML = "ddb6d928-2868-570f-bddf-ab3f9cf99eb6"
+
+[compat]
+ClimaAtmos = "=0.24.0"


If ClimaAtmos is pinned to a specific version, why do we need to have a job in buildkite?

Sbozzolo · 2024-05-23T23:20:15Z

calibration/experiments/sphere_held_suarez_rhoe_equilmoist/Project.toml

+Distributions = "31c24e10-a181-5473-b8eb-7969acd0382f"
+EnsembleKalmanProcesses = "aa8a2aa5-91d8-4396-bcef-d4f2ec43552d"
+JLD2 = "033835bb-8acc-5ee8-8aae-3f567f8a3819"
+NetCDF = "30363a11-5582-574a-97bb-aa9a979735b9"


Is NetCDF needed?

Sbozzolo · 2024-05-23T23:23:02Z

calibration/experiments/sphere_held_suarez_rhoe_equilmoist/observation_map.jl

+        try
+            G_ensemble[:, m] .= process_member_data(simdir)
+        catch err
+            @info "Error during observation map for ensemble member $m" err
+            G_ensemble[:, m] .= NaN
+        end


What are you trying to try-catch here?

Sbozzolo · 2024-05-23T23:23:51Z

calibration/experiments/sphere_held_suarez_rhoe_equilmoist/observation_map.jl

+const config = ExperimentConfig(@__DIR__)
+function observation_map(iteration)
+    (; ensemble_size, output_dir) = config
+    dims = 1


Maybe dims can have a more descrpitive name. dims of what?

Sbozzolo · 2024-05-23T23:25:30Z

calibration/experiments/sphere_held_suarez_rhoe_equilmoist/postprocessing.jl

+            ensemble_error += abs(i - theta_star)^2
+            ensemble_spread += abs(i - ensemble_mean)^2
+
+        end


Suggested change

ensemble_error += abs(i - theta_star)^2

ensemble_spread += abs(i - ensemble_mean)^2

end

ensemble_error += abs(i - theta_star)^2

ensemble_spread += abs(i - ensemble_mean)^2

end

Sbozzolo · 2024-05-23T23:25:39Z

calibration/experiments/sphere_held_suarez_rhoe_equilmoist/postprocessing.jl

+        ensemble_error = 0
+        ensemble_spread = 0


Suggested change

ensemble_error = 0

ensemble_spread = 0

ensemble_error = 0.0

ensemble_spread = 0.0

Sbozzolo · 2024-05-23T23:27:12Z

calibration/model_interface.jl

+    set_up_forward_model(member, iteration, experiment_dir::AbstractString)
+
+Returns an AtmosConfig object for the given member and iteration.
+If given an experiment id string, it will load the config from the corresponding YAML file.


Is this implemented?

Sbozzolo · 2024-05-23T23:28:02Z

calibration/model_interface.jl

+    if haskey(config_dict, "restart_file") &&
+       !isabspath(config_dict["restart_file"])
+        config_dict["restart_file"] =
+            joinpath(experiment_dir, config_dict["restart_file"])
+    end


Shouldn't this be handled internally by Atmos?

Sbozzolo · 2024-05-23T23:30:48Z

What is the ultimate intent of this PR? Is it to test that things work? To teach people how to do calibration? Other reasons?

nefrathenrici force-pushed the ne/calibrate branch from fb79171 to dbee822 Compare April 20, 2024 00:07

nefrathenrici force-pushed the ne/calibrate branch from d8a5a2d to 8154925 Compare May 9, 2024 21:14

nefrathenrici force-pushed the ne/calibrate branch from a2f4074 to 4f8a838 Compare May 20, 2024 20:29

AlexisRenchon mentioned this pull request May 22, 2024

Integrate calibration pipeline into ClimaLand CliMA/ClimaLand.jl#621

Draft

nefrathenrici force-pushed the ne/calibrate branch from e2fc12b to 16c9a61 Compare May 23, 2024 16:52

Add calibration framework and perfect model experiment

9ea6670

nefrathenrici force-pushed the ne/calibrate branch from 16c9a61 to 9ea6670 Compare May 23, 2024 16:54

nefrathenrici marked this pull request as ready for review May 23, 2024 17:37

nefrathenrici requested review from charleskawczynski and Sbozzolo May 23, 2024 17:40

Sbozzolo reviewed May 23, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Integrate calibration pipeline into atmos #2895

Integrate calibration pipeline into atmos #2895

nefrathenrici commented Apr 11, 2024 •

edited

Sbozzolo May 23, 2024

Sbozzolo May 23, 2024

Sbozzolo commented May 23, 2024

Sbozzolo May 23, 2024

Sbozzolo May 23, 2024

Sbozzolo May 23, 2024

Sbozzolo May 23, 2024

Sbozzolo May 23, 2024

Sbozzolo May 23, 2024

Sbozzolo May 23, 2024

Sbozzolo May 23, 2024

Sbozzolo commented May 23, 2024

Integrate calibration pipeline into atmos #2895

Are you sure you want to change the base?

Integrate calibration pipeline into atmos #2895

Conversation

nefrathenrici commented Apr 11, 2024 • edited

Content

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Sbozzolo commented May 23, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Sbozzolo commented May 23, 2024

nefrathenrici commented Apr 11, 2024 •

edited