feat: Raise error for unknown keywords #632

MRVermeulenDeltares · 2024-04-24T07:02:42Z

#622

Throw error when unknown keyword is located in the mdu
When multiple unknown keywords are located in the mdu, have the error contain multiple unknown keywords.

… on config.extra

…tionManager and update notification layout for file loading

…tion

This reverts commit 969ee20.

…of a message.

# Conflicts: # hydrolib/core/dflowfm/ini/models.py

sonarcloud · 2024-05-10T09:29:04Z

Quality Gate passed

Issues
0 New issues
0 Accepted issues

Measures
0 Security Hotspots
No data about Coverage
0.0% Duplication on New Code

See analysis details on SonarCloud

MRVermeulenDeltares · 2024-05-10T13:32:08Z

After the update to throw an error on an unknown key multiple test started to fail.
I have done some reasearch to try and figure out why.
I came to a few conclusions:

Unknown keywords in the testfiles
Outdated keywords in the testfiles
Not defined keywords in the models (those keywords are in the manual)
Problems with subfiles being validatated and not containing properties in their classes that are defined in their files. (e.g. .bc files and crosssection .ini related files, might be more. )

This is what I have found in regard to the keywords in relation with the tests, I tried to tackle all keywords but I am not 100% sure I have them all.
Manual I used for verification is: content.oss.deltares.nl/dhydro/D-Flow_FM_User_Manual.pdf

Section	Keyword	defined in manual	Mentioned in #634
General	guiversion	Yes, Table A.1
Geometry	pipefile	No	Yes
Geometry	branchfile	No
Geometry	onednetworkfile	No
Geometry	shipdeffile	No	Yes
Geometry	bedlevelfile	Yes, Table A.4, deprecated and removed
VolumeTables	usevolumetablesfile	No
Numerics	jasfer3d	No	Yes
Numerics	jarhoxu	No
Numerics	vertadvtypmom3onbnd	No
Numerics	jposhchk	No	Yes
Numerics	newcorio	Yes, Table A.3, research keyword	Yes
Numerics	jaorgsethu	Yes, but only in examples, not in a table defining the keyword
Numerics	jaupwindsrc	No	Yes
Numerics	eddyviscositybedfacmax	No	Yes
Numerics	icoriolistype	Yes, Table A.3, research keyword	Yes
Numerics	zlayercenterbedvel	Yes, but only once in a text: "10.3 Z-layer modelling"	Yes
Numerics	epshstem	No	Yes
Numerics	zwsbtol	No	Yes
Numerics	horadvtypzlayer	Yes, Table A.3, research keyword	Yes
Numerics	corioadamsbashfordfac	Yes, Table A.3, research keyword	Yes
Numerics	drop3d	Yes, Table A.3, research keyword	Yes
Numerics	transporttimestepping	Yes, Table A.4, deprecated and removed
Numerics	transportmethod	Yes, Table A.4, deprecated and removed
Numerics	noderivedtypes	No
Numerics	fixedweirfrictscheme	No	Yes
Numerics	jbasqbnddownwindhs	Yes, Table A.3, research keyword	Yes
Numerics	logprofkepsbndin	Yes, Table A.3, research keyword	Yes
Physics	selfattractionloading	Yes, But only mentions in Table F.4	Yes
Physics	soiltempthick	No	Yes
Physics	jadelvappos	No
Physics	uniffrictcoef1d2d	No	Yes
Physics	umodlin	No
Physics	effectspiral	Yes, but only in examples, not in a table defining the keyword
Time	timestepanalysis	No	Yes
Time	dtfacmax	No	Yes
Time	autotimestepdiff	No
Wind	windhuorzwsbased	Yes, Table A.3, research keyword	Yes
Waves	wavenikuradse	No
Output	writepart_domain	Yes, but only once in a text: "6.4.2 Partitioning a model"	Yes
Output	wrimap_salinity	~Yes, Table F.5
Output	velocitydirectionclassesinterval	No	Yes
Output	timesplitinterval	No	Yes
Output	wrimap_temperature	~Yes, Table F.5
Output	wrimap_input_dt	No
Output	writebalancefile	Yes, Table A.4, deprecated and removed
Output	wrihis_heatflux	No
Output	wrirst_bnd	No	Yes
Output	writedfminterpretedvalues	No	Yes
Output	s1incinterval	No
Output	velocitymagnitudeclasses	No	Yes

Action points for issue

Wait on implementation of Add mdu keywords (not listed in appendix A of the manual) #634
Determine if this list ^ contains keywords that need to be added which aren't defined in another issue
Remove unknown keywords from testfiles
Further investigate and resolve the issue with .bc files and crosssection .ini related files

tim-vd-aardweg · 2024-05-24T06:53:51Z

In #634 I wanted to add this test:

    def test_load_model_with_research_keywords_as_fmmodel_raises_error(self):
        input_mdu = (
                test_input_dir / "research" / "mdu_with_research_keywords_from_dia_file_2024.03_release.mdu"
        )

        with pytest.raises(ValueError) as e:
            _ = FMModel(filepath=input_mdu)

        expected_error = "Unknown keywords are detected in section"
        assert expected_error in str(e.value)

But it currently fails, since we have Extra.ignore. So I can't yet add that test. That test can only be added after this issue is implemented.

tim-vd-aardweg · 2024-05-24T09:08:35Z

hydrolib/core/dflowfm/ini/models.py

@@ -55,6 +56,15 @@ class Config:
        extra = Extra.ignore
        arbitrary_types_allowed = False

+    def __init__(self, **data):
+        super().__init__(**data)


Wouldn't it be more efficient to first check if there are unknown keywords and then call super().__init__()? It seems we have all the data we need to determine if there are unknown keywords. And if there are unknown keywords there is no reason to let pydantic work all its magic, since we are raising an error anyway?

tim-vd-aardweg · 2024-05-24T09:09:42Z

hydrolib/core/dflowfm/ini/util.py


+from pydantic import Extra


Not used. There are a couple other imports that are not used. Please remove those as well.

tim-vd-aardweg · 2024-05-24T09:11:12Z

hydrolib/core/dflowfm/ini/util.py

+        Notify the user of unknown keywords.
+
+        Args:
+            data (Dict[str, Any])   : Input data containing all set properties which are checked on unknown keywords.


What are set properties? It's a dict, not a set? 😊

tim-vd-aardweg · 2024-05-24T09:14:06Z

hydrolib/core/dflowfm/ini/models.py

+    def __init__(self, **data):
+        super().__init__(**data)
+        self._unknown_keyword_error_manager.raise_error_for_unknown_keywords(
+            data,


It seems we are only using the keys of this dict in the raise_error_for_unknown_keywords() method. So just pass data.keys() and update the types expected by the method as you will now pass a list of strings instead of a dict.

Nevermind, ignore this comment. It seems perfectly fine to just pass the dict :)

tim-vd-aardweg · 2024-05-24T09:22:43Z

hydrolib/core/dflowfm/ini/util.py

+        self, data: Dict[str, Any], fields: Dict[str, Any], excluded_fields: Set
+    ) -> List[str]:
+        list_of_unknown_keywords = []
+        for name, _ in data.items():


You can use: for name in data. There is no need to get the keys and values!

tim-vd-aardweg · 2024-05-24T09:36:20Z

hydrolib/core/dflowfm/ini/util.py

+    def _is_unknown_keyword(
+        self, name: str, fields: Dict[str, Any], excluded_fields: Set
+    ):
+        return name not in fields and name not in excluded_fields


In HYDROLIB-core we use pydantic to turn dictionaries into valid objects. Pydantic can map the key in the dictionary to the appropriate attribute by either looking at the field name or the field alias. This behaviour can be set in the config (config.allow_population_by_field_name). In our BaseModel we set config.allow_population_by_field_name to True. This means that we want to be able to map the data via either the name or the alias.

In your implementation, we say that a keyword is unknown if there is no attribute with the same name as the key. However, this will, for example, fail for the research keywords that are implemented in #642. All research keyword attribute names are prefixed with research. This will fail the mapping and they will be considered an unknown keyword. I therefore think that it is important to not only check the field names, but also their alias!

tim-vd-aardweg · 2024-05-24T09:44:36Z

hydrolib/core/dflowfm/ini/models.py

@@ -55,6 +56,15 @@ class Config:
        extra = Extra.ignore
        arbitrary_types_allowed = False

+    def __init__(self, **data):


Was there a reason we could not put this in a root_validator?

tim-vd-aardweg · 2024-05-24T10:07:18Z

This doesn't work for unknown sections, right? Should you create a follow-up issue for this?

tim-vd-aardweg · 2024-05-24T10:09:10Z

tests/dflowfm/test_mdu.py

@@ -388,3 +392,115 @@ def test_loading_fmmodel_model_with_both_ini_and_xyn_obsfiles(self):
            assert actual_point.x == expected_point.x
            assert actual_point.y == expected_point.y
            assert actual_point.name == expected_point.name
+
+    def test_mdu_unknown_keywords_loading_gives_message_for_missing_keyword(


Suggested change

def test_mdu_unknown_keywords_loading_gives_message_for_missing_keyword(

def test_mdu_unknown_keywords_loading_gives_message_for_unknown_keyword(

tim-vd-aardweg · 2024-05-24T10:10:44Z

tests/dflowfm/test_mdu.py

+        tmp_mdu_path.write_text(tmp_mdu)
+
+        with patch(
+            "hydrolib.core.dflowfm.ini.models.INIBasedModel.Config"


Why do we need this mock?

tim-vd-aardweg · 2024-05-24T10:11:15Z

tests/dflowfm/test_mdu.py

+            )
+            assert name in captured.out
+
+    def test_mdu_unknown_keywords_loading_gives_message_for_missing_keyword2(


Give it an appropriate name please

tim-vd-aardweg · 2024-05-24T10:14:57Z

tests/dflowfm/test_mdu.py

+        FMModel(filepath=tmp_mdu_path)
+        captured = capsys.readouterr()
+
+        excluded_fields = ["comments", "datablock", "_header"]


This test will still pass if we ever change the _exclude_fields of the model. So maybe use: excluded_fields = model._exclude_fields instead? Maybe even assert that the list is not empty?

tim-vd-aardweg · 2024-05-24T10:16:27Z

tests/dflowfm/test_mdu.py

+        for excluded_field in excluded_fields:
+            assert excluded_field not in captured.out
+
+    def test_mdu_unknown_keywords_allow_extra_setting_field_gives_message(self, capsys):


In my opinion there is no need to check all the different options for Extra.<allow/forbid/ignore>. The user should not edit the values we have set, since it's not a public setting. If they decide to change it anyway, we can't offer support/validation.

feat: (622) write message on reading file and setting variables based…

4c64a2d

… on config.extra

MRVermeulenDeltares linked an issue Apr 24, 2024 that may be closed by this pull request

When there are unknown keywords in the mdu, a warning should be given instead of them being silently dropped. #622

Open

MRVermeulenDeltares mentioned this pull request Apr 24, 2024

feat: When there are unknown keywords in the mdu, a warning should be given instead of them being silently dropped. #629

Closed

MRVermeulenDeltares and others added 6 commits April 24, 2024 09:22

feat: (622) resolve failing tests.

4ee0250

autoformat: isort & black

ca14542

feat: (622) move related changes to specific class UnknownKeyNotifica…

0808f17

…tionManager and update notification layout for file loading

feat: (622) Move UnknownKeyNotificationManager to util

5980d31

feat: (622) Add unit tests for UnknownKeyNotificationManager

49ec80f

feat: (622) Add extra spacing at the end of printing the list per sec…

9ae839e

…tion

MRVermeulenDeltares mentioned this pull request Apr 24, 2024

When there are unknown keywords in the mdu, a warning should be given instead of them being silently dropped. #622

Open

MRVermeulenDeltares and others added 4 commits April 24, 2024 15:36

feat: (622) Try to resolve failing tests on python 3.8

969ee20

Revert "feat: (622) Try to resolve failing tests on python 3.8"

2cae2a9

This reverts commit 969ee20.

feat: (622) Resolve problem occuring in python 3.8

b6f2ebc

autoformat: isort & black

060539c

MRVermeulenDeltares changed the title ~~feat: give-warning-for-unknown-keyword~~ feat: give-error-for-unknown-keyword May 10, 2024

MRVermeulenDeltares added 2 commits May 10, 2024 11:20

feat: (622) Update unknown keyword manager to raise an error instead …

790dfe4

…of a message.

Merge branch 'main' into feat/622-give-warning-for-unknown-keywords

4ee0166

# Conflicts: # hydrolib/core/dflowfm/ini/models.py

tim-vd-aardweg changed the title ~~feat: give-error-for-unknown-keyword~~ feat: Raise error for unknown keywords May 13, 2024

tim-vd-aardweg mentioned this pull request May 24, 2024

feat: Add research keywords #642

Open

tim-vd-aardweg reviewed May 24, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Raise error for unknown keywords #632

feat: Raise error for unknown keywords #632

MRVermeulenDeltares commented Apr 24, 2024 •

edited

sonarcloud bot commented May 10, 2024

MRVermeulenDeltares commented May 10, 2024 •

edited

tim-vd-aardweg commented May 24, 2024

tim-vd-aardweg May 24, 2024

tim-vd-aardweg May 24, 2024

tim-vd-aardweg May 24, 2024

tim-vd-aardweg May 24, 2024

tim-vd-aardweg May 24, 2024

tim-vd-aardweg May 24, 2024

tim-vd-aardweg May 24, 2024

tim-vd-aardweg May 24, 2024

tim-vd-aardweg commented May 24, 2024

tim-vd-aardweg May 24, 2024

tim-vd-aardweg May 24, 2024

tim-vd-aardweg May 24, 2024

tim-vd-aardweg May 24, 2024

tim-vd-aardweg May 24, 2024

	def test_mdu_unknown_keywords_loading_gives_message_for_missing_keyword(
	def test_mdu_unknown_keywords_loading_gives_message_for_unknown_keyword(

feat: Raise error for unknown keywords #632

Are you sure you want to change the base?

feat: Raise error for unknown keywords #632

Conversation

MRVermeulenDeltares commented Apr 24, 2024 • edited

sonarcloud bot commented May 10, 2024

Quality Gate passed

MRVermeulenDeltares commented May 10, 2024 • edited

Action points for issue

tim-vd-aardweg commented May 24, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

tim-vd-aardweg commented May 24, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

MRVermeulenDeltares commented Apr 24, 2024 •

edited

MRVermeulenDeltares commented May 10, 2024 •

edited