Make test log parsing robust #3187

heplesser · 2024-04-22T11:32:39Z

This PR fixes #3155. It catches errors in parsing XML files generated by tests, displays them in the results table exits with non-zero exit code as if tests had actually failed.

heplesser · 2024-05-02T12:39:42Z

@terhorstd @lekshmideepu Ping!

terhorstd

I'd suggest some small refactoring to reduce cross-code dependencies and code complexity due to introduction of "another case".

terhorstd · 2024-05-03T12:18:07Z

testsuite/summarize_tests.py

+        except Exception as err:
+            msg = f"ERROR: {pfile} not parsable with error {err}"
+            results[ph_name] = {"Tests": 0, "Skipped": 0, "Failures": 0, "Errors": 0, "Time": 0, "Failed tests": [msg]}
+            totals["Failed tests"].append(msg)


Why is this exception not handled inside parse_result_file()? If the exception can be handled as is done here, it should be handled where it is first encountered; otherwise every caller of parse_result_file needs to implement it. Seems the results can be handled quite symmetrically to the "normal" return values, which would be even more reason to move the try … except inside.

Done with next push.

terhorstd · 2024-05-03T12:22:25Z

testsuite/summarize_tests.py

+                totals[k] += v
+        except Exception as err:
+            msg = f"ERROR: {pfile} not parsable with error {err}"
+            results[ph_name] = {"Tests": 0, "Skipped": 0, "Failures": 0, "Errors": 0, "Time": 0, "Failed tests": [msg]}


Suggested change

results[ph_name] = {"Tests": 0, "Skipped": 0, "Failures": 0, "Errors": 0, "Time": 0, "Failed tests": [msg]}

results[ph_name] = {"Tests": 0, "Skipped": 0, "Failures": 0, "Errors": 1, "Time": 0, "Failed tests": [msg]}

If we mark the "Failed tests" entry also in the "Errors" column, we could keep len(results["Failed tests"]) == results["Errors"] + results["Failures"], which seems like an easy to sustain symmetry and follows the principle of least surprise.

My preference would be to have the failed and errored tests separately.

When parsing of the XML file fails, we lose all information about an entire phase of the testing process. We do not even know how many tests were in this phase and could therefore be considered "Failed". Indeed, in all cases I have seen so far, the tests actually all passed, we just could not parse this because the XML file was corrupted.

Therefore, we need to handle parse failures differently and cannot include them in the other statistics.

The exact number of failures is not relevant, since the type of failures determines criticality and required actions. So I think it's ok to have parsing failure recorded as "one failure". The same happens in other cases where we treat one wrong result different from any number of other errors. If there is at least one failure in a stage we need to look into it and understand what this means anyway. It's not possible to dismiss a number of failures just because the number is small.

@lekshmideepu, yes, that's why there are Errors and Failures. My point is that I think we do not need a third type that requires extra condition checks when interpreting the results. For me it would be intuitively clean to assert correct function of the testsuite by an

assert results["Failures"] + results["Errors"] == len(results["Failed tests"])

Where the list of failed tests contains messages for each of the errors and failures. The level on which something went wrong is not known for each failure anyway. If a test fails, this can be a problem anywhere between external libraries, nest code or broken test or test-suite. This is not special for a parsing error.

@terhorstd Thanks for the clarification. I thought it differently. I agree with your point.

@terhorstd In Pytest, "error" and "failure" have specific meanings (see https://stackoverflow.com/questions/21513718/pytest-error-vs-fail): If an assertion is triggered while a test_...() is executed, this counts as a failure, while a failed assertion outside a test_, especially in fixtures, is counted as an error. The sum of passes, failures, errors and skips should always equal the number of tests in total. This should apply for each row in our table and the totals at the bottom.

If we cannot parse an XML results file, we have no information and just counting that as an error would break the logic of the PyTest result counts. Therefore, I still think that it is most sensible to treat XML parsing errors separately from PyTest result tabulation.

terhorstd · 2024-05-03T12:25:42Z

testsuite/summarize_tests.py

+        if pr["Tests"] == 0 and pr["Failed tests"]:
+            print(f"{'--- XML PARSING FAILURE ---':^{len(cols) * col_w}}")
+        else:


Not necessary if marked as in previous comment. The extra XML PARSING FAILURE string can be absorbed into msg and I don't see why centering necessary in addition to all-caps.

I have removed the all caps. As explained above, we need to treat this case special. And I find the centred message much more visually pleasing :).

testsuite/summarize_tests.py

lekshmideepu

There is a typo the link for posting the issues. LN: 130
nest-simulator instead of nest_simulator in the below link.
print(" https://github.com/nest/nest_simulator/issues")

heplesser · 2024-05-06T12:44:57Z

There is a typo the link for posting the issues. LN: 130 nest-simulator instead of nest_simulator in the below link. print(" https://github.com/nest/nest_simulator/issues")

Thanks, fixed with next push.

lekshmideepu

Thanks @heplesser Looks good to me 👍

Make test log parsing robust

2e6e024

heplesser added T: Bug Wrong statements in the code or documentation S: Critical Needs to be addressed immediately I: No breaking change Previously written code will work as before, no one should note anything changing (aside the fix) labels Apr 22, 2024

heplesser requested review from lekshmideepu and terhorstd April 22, 2024 11:32

heplesser added this to To do in Build system and CI via automation Apr 22, 2024

terhorstd requested changes May 3, 2024

View reviewed changes

This comment was marked as duplicate.

Sign in to view

lekshmideepu reviewed May 6, 2024

View reviewed changes

Move exception detection, fix typo

b656ef8

heplesser requested review from lekshmideepu and terhorstd May 6, 2024 12:58

lekshmideepu approved these changes May 15, 2024

View reviewed changes

gtrensch moved this from To do to Review in Build system and CI May 17, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make test log parsing robust #3187

Make test log parsing robust #3187

heplesser commented Apr 22, 2024

heplesser commented May 2, 2024

terhorstd left a comment

terhorstd May 3, 2024

heplesser May 6, 2024

terhorstd May 3, 2024

lekshmideepu May 6, 2024

heplesser May 6, 2024

terhorstd May 7, 2024

terhorstd May 7, 2024

lekshmideepu May 14, 2024

heplesser May 16, 2024

terhorstd May 3, 2024

heplesser May 6, 2024

This comment was marked as duplicate.

lekshmideepu left a comment •

edited

heplesser commented May 6, 2024

lekshmideepu left a comment

	results[ph_name] = {"Tests": 0, "Skipped": 0, "Failures": 0, "Errors": 0, "Time": 0, "Failed tests": [msg]}
	results[ph_name] = {"Tests": 0, "Skipped": 0, "Failures": 0, "Errors": 1, "Time": 0, "Failed tests": [msg]}

Make test log parsing robust #3187

Are you sure you want to change the base?

Make test log parsing robust #3187

Conversation

heplesser commented Apr 22, 2024

heplesser commented May 2, 2024

terhorstd left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

This comment was marked as duplicate.

lekshmideepu left a comment • edited

Choose a reason for hiding this comment

heplesser commented May 6, 2024

lekshmideepu left a comment

Choose a reason for hiding this comment

lekshmideepu left a comment •

edited