Implement check-result: test check failure affects test result by default #3239

martinhoyer · 2024-09-25T16:20:29Z

Trying to address #3185

Pull Request Checklist

tmt/result.py

martinhoyer · 2024-09-26T17:51:57Z

Rebased to @happz's result-store-original-result #3147 and tried to use beakerlib for the first time. Feels werid to do all the assertgreps, but I've been merely following the existing examples.

Still proof of concept, but at least I finally figured that the "after-test" check were not available without the change in execute/internal.py.

happz · 2024-09-26T17:52:36Z

@martinhoyer if you change the base branch to the one from #3147, the diff should reduce a bit.

tmt/result.py

happz · 2024-09-27T19:15:33Z

tmt/result.py

+        checks_failed = any(check.result == ResultOutcome.FAIL for check in self.check)
+
+        if interpret == ResultInterpret.RESPECT:
+            if self.respect_checks and checks_failed:


respect_check feels more like an input, IIUIC it's basically interpret, but in respect to checks. So, they should enter the method through the same door, so to speak.

I see, so you want me to create

class CheckResults(enum.Enum): RESPECT = 'respect' IGNORE = 'ignore' ...

?

I'm so lost in all the schemas, specs, et al.

I suppose we could re-use ResultInterpret, it has the same purpose, and skip can serve as ignore.

Would this help?

return _result.interpret_result( ResultInterpret(invocation.test.result) if invocation.test.result else ResultInterpret.RESPECT, ResultInterpret(invocation.test.check_result) if invocation.test.check_result else ResultInterpret.RESPECT )

I'm so lost in all the schemas, specs, et al.

It shouldn't be that messy. If I can help somehow, let me know, maybe the whole area needs more documentation.

Yes, something like this, with one exception: I don't understand why Result.check_result exists. How checks should be interpreted is a test-level metadata key, i.e. it should come from the test, not the result. Isn't it the same story as Test.result?

Thanks. Honestly that's a leftover from the previous attempts to make things work that I missed and kept.

In the light of #3185 (comment), no check-result on test level, I suppose...

This means new result check-level field would reside in tmt.checks.Check class (and _RawCheck), and interpretation of check results need to match a result with corresponding check in test.check list...

martinhoyer · 2024-09-30T13:47:31Z

tmt.utils.GeneralError: Test check 'dmesg' was not found in check registry.

perplexing

tmt/result.py

tests/execute/result/check_results/main.fmf

docs/releases.rst

spec/tests/check.fmf

spec/tests/result.fmf

happz · 2024-10-01T09:00:07Z

tmt/result.py

+
+        # Handle individual check results
+        for check in self.check:
+            check_result = ResultInterpret(check.result)


The field should already be ResultInterpret, as mentioned in a comment above WRT its definition. And I just noticed that Test.result isn't an enum, which is definitely incorrect, and a bad example :/

happz · 2024-10-01T09:00:15Z

tmt/checks/__init__.py

+        default='respect',
+        serialize=lambda result: result.value if isinstance(result, enum.Enum) else str(result),
+        unserialize=lambda spec: spec,
+        help='How to interpret the check result.')


I think we can switch from Any to ResultInterpret, if we add the from_spec() method to it, unserialize could use it. Might need also normalize callback, but I'd pay that price for getting rid of Any. With Any, no user of this field can't be sure what the value is.

martinhoyer · 2024-10-01T11:13:38Z

tmt/result.py

+        if interpret == ResultInterpret.XFAIL:
+            if self.result == ResultOutcome.PASS and checks_failed:
+                self.result = ResultOutcome.FAIL


@psss I got a little entangled in the xfail logic and the # Handle individual check results part should probably be done before this.

I wonder if it will be digestible enough for the users, I mean:
xfail test fails, check with default 'respect' fails = FAIL
i.e. users would have to add xfail also to check, if expected. Could be breaking change for some?

Thanks for doing this. I would also like to petition for the following scenario.
if the before test portion is the only part that fails for the check, the test should be self.result = ResultOutcome.WARN.

psss · 2024-10-04T07:50:20Z

spec/tests/check.fmf

+    The `result` key can be used to specify how the check result should be
+    interpreted. It accepts three options:
+    - 'respect' (default): The check result affects the overall test result.
+    - 'skip': The check result does not affect the overall test result.


So here we're introducing a new value skip which is currently not present in the Test.result key specification. What about keeping the consistency here as well and support pass, info, warn, error, fail in the very same way?

Hmmm,skip is definitely one of the values of https://github.com/teemtee/tmt/blob/main/tmt/result.py#L43, and it's also allowed by schema, https://github.com/teemtee/tmt/blob/main/tmt/schemas/common.yaml#L481. If it's not supposed to be supported as a result value, we should remove it from these places as well.

Although, I'd argue the specification should be updated instead, because skip is a perfectly valid test outcome, like pass or fail, and it makes sense that one might want to enforce it as a test outcome via result, just like one might want to enforce pass or xfail.

And +1 to accepting all keys allowed by ResultInterpret, i.e. why allowing skip and not xfail or pass. Except custom, that one does not make much sense for a test check.

And restraint, restraint is also useless at this point of code :)

To be honest, I got pretty frustrated with this code. I guess I'm too dumb for mixing custom mixins, uninheritable classes, specifications, schemas, et al. I'm sure there are good reasons for all of this, but idk, somehow I think doing a relatively small change should be easier.

That's sad to hear. I believe that while it might be a relatively small change, it's not a small change given the interaction of test and test check results. Schema, specification, the fact that enum inheritance is limited, well, that's a fact we can't bypass.

Anyway, should I create some BaseResultInterpretMixin class w/ SerializableContainer?

I for one don't think so. Where do you think it would be needed?

I tried to went back to using 'ignore', but now we're mixing two things together, results and how to interpret it. :(

We can get on a call and try to get through the parts that cause you trouble, if you want to. Mixing the result and its interpretation is definitely a sign of a wrong turn somewhere.

Yeah, check_result = CheckResultInterpret(check.result.value) is definitely not the right move, check.result.value is the outcome, the interpretation must come from the check, not from its result. That's also true for test, and you see how test result interpretation gets in, via invocation.test.result in _result.interpret_result(invocation.test.result) call - but we do not have the same way for test checks, and as I was saying, their interpretation needs to be given to interpret_result(), as soon as you try to extract it from a result, you're on the wrong track.

I don't want to start rewriting the patch for you, but you obviously do not enjoy the coding at this moment, if it would help, we can sit on it.

I for one don't think so. Where do you think it would be needed?

It's just that the ResultsInterpret is not exactly DRY.
Only thing that came to mind, except for using str literals instead of enums.

We can get on a call and try to get through the parts that cause you trouble, if you want to.

Thanks, much appreciated.
It seems like we strayed a bit too far from the original goal and while we have check result keys, it's used the same way as test result and I'm not sure if it is the best way to let users define that the check result should be "ignored".

I don't want to start rewriting the patch for you, but you obviously do not enjoy the coding at this moment, if it would help, we can sit on it.

Thanks @happz, I'll get to it with a pair of fresh eyes later. Sorry for letting my depression leak to the comments here.

Ok, as you wish. Just a couple of notes:

something like this could work for propagating how check results should be interpreted - check_result = CheckResultInterpret(check.result.value) is definitely not the right way how to get this info.
- return _result.interpret_result(invocation.test.result) + return _result.interpret_result( + invocation.test.result, + { + check.how: check.result + for check in invocation.test.check + } + ) def interpret_result( self, - interpret: ResultInterpret) -> 'Result': + interpret_test: ResultInterpret, + interpret_checks: dict[str, CheckResultInterpret]) -> 'Result':

use a helper function for setting a note, the repetition of if/else consumes 4 lines for every single note, a helper function would take care of ', ...' and condition, greatly improving the readability of code.

this looks like a mistake, check result propagated into result - is that correct? We probably don't want check result to be promoted to main result, instead failed results would turn the main result into a failed one - a skipped check does not mean main result should turn to SKIP
elif interpret_check in [CheckResultInterpret.PASS, CheckResultInterpret.FAIL, CheckResultInterpret.INFO, CheckResultInterpret.WARN, CheckResultInterpret.ERROR, CheckResultInterpret.SKIP]: self.result = ResultOutcome(check.value)

the order should be probably reversed: first we need to sort out check results, because they might be PASS, but their result key might be set to fail, turning them to failed checks - any(check.result == ResultOutcome.FAIL for check in self.check) will miss checks that would turn into failed checks because of their interpretation.

Failing 'check' should affect test result by default. respect_checks option needs to be implemented.

happz · 2024-10-04T14:49:40Z

tmt/result.py

-                        self.note += ', check failed'
-                    else:
-                        self.note = 'check failed'
+                    self._set_note('check failed')


This branch does look superfluous to me: check.result should be changed, not self.result - at this point, we're deciding the fate of a single check result, and the test result is beyond the horizon. The effect of this check result on the test result will be evaluated later. Which means, the outcome of the test did not change in any way, therefore no need for a note -> no need to check for RESPECT + FAIL.

happz · 2024-10-04T14:58:06Z

tmt/result.py

-                else:
-                    self.note = f'check result: {check.result.value}'
+                # We don't promote check result to main result, but failed checks turn the
+                # main result to FAIL


We don't know that yet: if the test has result: info, it does not matter what is the outcome of checks, they should never override the main test result, because the user clearly said "skip" is the desired and final value. We still need to resolve them, to record their final outcomes though, therefore we can't skip check results when result: skip is set.

I have a feeling you are getting too entangled in the problem. My advice would be to ignore the other part of the problem: focus on the interpretation of one single check result, and completely ignore what it might do to the test result. Go over all check results, and interpret every single one of them as needed. Ignore whatever their effect might be on the main result. Only after that, take self.result and interpret_test, collect the final check results, and resolve their interaction. If test.result is SKIP, PASS, etc., set self.result and exit. If it's RESPECT, compute the effect of check results, switch self.result to FAIL or WARN, exit. And so on. We are not in the business of saving CPU cycles, make it correct first, and then we can make it fast by introducing shortcuts. We can afford crude and readable code :)

Maybe even one more: how about ignoring this method for now, and instead constructing a simple method that would focus on the interpretation only. Feed it with ResultOutcome and ResultInterpret for the test, a map of check name/ResultOutcome and a map of check name/CheckResultInterpret, and let it decide the final test result, final check results, and optionally a note. Add a parametrized unit test for it for various combinations. Do not feed it with Result or CheckResult or any structures, just enums and strings. Get that right, get it working, then we can discuss just how this process works, whether e.g. SKIP has the desired effect. No worrying about other structures. Once we are happy, translate it to this method, i.e. let it write decisions into structures.

Thanks again. I'll get to it tomorrow.

Once we arr

🏴‍☠️

I'll get to it tomorrow.

hmm, looks like a busy weekend. Monday it is.

martinhoyer added status | need help Extra attention is needed status | need tests Test coverage to be added for the affected code status | need docs Documentation to be added for the affected code area | results Related to how tmt stores and shares results labels Sep 25, 2024

martinhoyer self-assigned this Sep 25, 2024

martinhoyer mentioned this pull request Sep 26, 2024

Store the original test, check and subresult outcome in results #3147

Merged

5 tasks

martinhoyer force-pushed the check-results branch from 345eced to 6ace91c Compare September 26, 2024 17:42

martinhoyer changed the title ~~Have failure in check result propagate to the test result~~ Implement check-result: test check failure affects test result by default Sep 26, 2024

happz reviewed Sep 26, 2024

View reviewed changes

tmt/result.py Outdated Show resolved Hide resolved

happz reviewed Sep 26, 2024

View reviewed changes

tmt/result.py Show resolved Hide resolved

happz reviewed Sep 27, 2024

View reviewed changes

martinhoyer commented Sep 30, 2024

View reviewed changes

tmt/result.py Outdated Show resolved Hide resolved

martinhoyer force-pushed the check-results branch from d7e7066 to ea54015 Compare September 30, 2024 14:44

martinhoyer commented Sep 30, 2024

View reviewed changes

tests/execute/result/check_results/main.fmf Outdated Show resolved Hide resolved

martinhoyer marked this pull request as ready for review October 1, 2024 05:53

martinhoyer requested review from psss, lukaszachy, thrix and janhavlin as code owners October 1, 2024 05:53

martinhoyer force-pushed the check-results branch from ef6b6d6 to 01a3246 Compare October 1, 2024 08:14

martinhoyer commented Oct 1, 2024

View reviewed changes

docs/releases.rst Outdated Show resolved Hide resolved

martinhoyer commented Oct 1, 2024

View reviewed changes

spec/tests/check.fmf Outdated Show resolved Hide resolved

martinhoyer commented Oct 1, 2024

View reviewed changes

spec/tests/result.fmf Outdated Show resolved Hide resolved

happz reviewed Oct 1, 2024

View reviewed changes

psss added this to the 1.38 milestone Oct 1, 2024

martinhoyer removed the status | need help Extra attention is needed label Oct 1, 2024

martinhoyer removed status | need tests Test coverage to be added for the affected code status | need docs Documentation to be added for the affected code labels Oct 1, 2024

martinhoyer commented Oct 1, 2024

View reviewed changes

martinhoyer mentioned this pull request Oct 3, 2024

Fail the test if dmesg after check fails #3261

Open

psss reviewed Oct 4, 2024

View reviewed changes

martinhoyer added 3 commits October 4, 2024 10:39

Initial check-result implementation

41bb466

Failing 'check' should affect test result by default. respect_checks option needs to be implemented.

Add check-result key

730fc74

Refactor to test.check.result

f91ea05

martinhoyer force-pushed the check-results branch from a476e34 to f91ea05 Compare October 4, 2024 13:01

martinhoyer added 2 commits October 4, 2024 15:03

fixup! Add check-result key

0efce7c

Incorporating suggestions

795d175

happz reviewed Oct 4, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement check-result: test check failure affects test result by default #3239

Implement check-result: test check failure affects test result by default #3239

martinhoyer commented Sep 25, 2024 •

edited

Loading

martinhoyer commented Sep 26, 2024 •

edited

Loading

happz commented Sep 26, 2024

happz Sep 27, 2024

martinhoyer Sep 30, 2024

happz Sep 30, 2024

happz Sep 30, 2024

happz Sep 30, 2024

happz Sep 30, 2024

martinhoyer Sep 30, 2024

happz Sep 30, 2024

martinhoyer Sep 30, 2024

martinhoyer Oct 1, 2024

martinhoyer commented Sep 30, 2024

happz Oct 1, 2024

happz Oct 1, 2024

martinhoyer Oct 1, 2024

sbertramrh Oct 4, 2024

psss Oct 4, 2024

happz Oct 4, 2024

happz Oct 4, 2024

happz Oct 4, 2024

happz Oct 4, 2024

happz Oct 4, 2024 •

edited

Loading

happz Oct 4, 2024

martinhoyer Oct 4, 2024

martinhoyer Oct 4, 2024

happz Oct 4, 2024

happz Oct 4, 2024

martinhoyer Oct 4, 2024

happz Oct 4, 2024

happz Oct 4, 2024 •

edited

Loading

martinhoyer Oct 4, 2024

martinhoyer Oct 5, 2024

Implement check-result: test check failure affects test result by default #3239

Are you sure you want to change the base?

Implement check-result: test check failure affects test result by default #3239

Conversation

martinhoyer commented Sep 25, 2024 • edited Loading

martinhoyer commented Sep 26, 2024 • edited Loading

happz commented Sep 26, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

martinhoyer commented Sep 30, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

happz Oct 4, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

happz Oct 4, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

martinhoyer commented Sep 25, 2024 •

edited

Loading

martinhoyer commented Sep 26, 2024 •

edited

Loading

happz Oct 4, 2024 •

edited

Loading

happz Oct 4, 2024 •

edited

Loading