Bats tests don't test for shell code correctness #74

h3xx · 2022-12-19T16:23:48Z

The tests in tests/*.bats don't test for the correctness or usability of the shell code generated.

All of the tests merely compare the output against a cached copy of what it assumes the output should be.

This leads to inflexible tests, and more effort to make trivial changes.

An example of a better test methodology might be:

Ensure the script output is the same across all versions of PHP
- I.e. Run in PHP 8.2, then ensure it matches the output from running in PHP 7.4, then ensure it matches PHP 8.0, etc.
Run shellcheck against the generated output

Excerpt of my original comment in #73 (~43 lines)

It is a little concerning that the test that failed depends on a byte-for-byte comparison against a copy of the code it tests.

I recall an article I read a long time ago about a dev who built a unit test that contained an exact copy of the code being tested, a particularly tricky algorithm. The dev said they were okay with updating it twice (once in the source and once in the unit test) stating it was worth 100% code coverage.

Reasons not to do this:

1. It doesn't actually test the code for correctness.

Since it's just a simple text diff, it doesn't matter how the code works, just that it's the same.

Furthermore, if there's a bug in both the source and the copy, the test won't show it - in fact, it'll pass! Instead of testing for the presence of bugs, this test actually presents a barrier to fixing the bug since it must now be fixed in multiple places.

(I'm sure there are other tests that test for correctness, but my point is that this one doesn't.)

2. It deliberately violates DRY (Don't Repeat Yourself).

The code has to exist in two places, and must be updated in both places.

I suppose this could be fixed by copying the source file into the cache before starting, but that IMO would be equally silly.

3. It's extra development overhead.

It's an extra step to making changes, and can be confusing to new developers (it confused me).

It can also muddy the intent of commits in the git history; it may be harder to grok the change intended by the dev, if the change is made n times instead of just one. It's the same reason I avoid committing auto-generated files alongside the files they were generated from - just having one source of truth is best.

Originally posted by @h3xx in #73 (comment)

The text was updated successfully, but these errors were encountered:

ktomk · 2022-12-21T17:10:17Z

As commented, I have looked into the build now. It also got some updates, also towards building with more ease: we can just make now. If local doesn't work, CI workflow on Github also now with noise removed and PHP 8.2. This is what v1.5.5 has just been released with. Thanks again for your contribution.

The issue description makes sense to me.

I'm already using shellcheck to test bash script generation output in my run .travis.yml project, so I'm quite sure it is viable. I'll add that for starters.

I'm also cleaning out a rough edge for the bats testing so it installs within the project tree for the in-tree build.

Comparing against a master (you suggest from one PHP version) would add the benefit to spare to store the master within the repository. I'll take a look, right now I'm actually scratching my head why there are two master files:

tests/fixtures/bash/cached.txt
tests/fixtures/bash/default.txt

and combined with that, why there are two php files:

resources/bash/cached.php
resources/bash/default.php

The default one could just be build from the cached one if I get this right. And likewise the txt files are only to test for both such cases which really looks redundant.

This just FYI, will now look forward the changes as proposed.

Btw. on which platform are you developing?

h3xx · 2022-12-21T22:36:59Z

Btw. on which platform are you developing?

My main system is Slackware Linux x86_64. I have Docker installed, so potentially I could test on all versions of Ubuntu as well.

ktomk · 2022-12-24T10:54:20Z

Btw. on which platform are you developing?

My main system is Slackware Linux x86_64. I have Docker installed, so potentially I could test on all versions of Ubuntu as well.

I'm on Ubuntu myself, also w/ Docker. Reminds me that next to shellcheck'ing we could also summon a matrix w/ Docker containers. But regardless: Do you have any clue on how to automatically test the autocompletion after sourced?

We already have test application in fixtures (acme), so that part would be covered, I'm merely thinking about how to test the auto-complete. So far I'm doing it manually after sourcing for the changes but this is of a similar burden.

I'd also already see it an improvement if this is for bash only as I can imagine this is different for each shell. Feedback, even rough, appreceated.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bats tests don't test for shell code correctness #74

Bats tests don't test for shell code correctness #74

h3xx commented Dec 19, 2022

1. It doesn't actually test the code for correctness.

2. It deliberately violates DRY (Don't Repeat Yourself).

3. It's extra development overhead.

ktomk commented Dec 21, 2022

h3xx commented Dec 21, 2022

ktomk commented Dec 24, 2022

Bats tests don't test for shell code correctness #74

Bats tests don't test for shell code correctness #74

Comments

h3xx commented Dec 19, 2022

1. It doesn't actually test the code for correctness.

2. It deliberately violates DRY (Don't Repeat Yourself).

3. It's extra development overhead.

ktomk commented Dec 21, 2022

h3xx commented Dec 21, 2022

ktomk commented Dec 24, 2022