IGNORE RDIFF & testing examples in CRAN release #538

jarioksa · 2022-10-05T08:28:38Z

We have had reference output for vegan examples for a decade. In principle, it is useful as it can spot unwanted changes and regression in the results of functions. However, it is of limited use because several results are numerically unstable, and comparison against reference output always gives many screenfuls of output hiding output from regressions. For this reason we have also removed the reference output from CRAN releases. For some time R has enabled marking numerically unstable segments so that these are not compared to the reference output, and this could allow having more useful detection of other changes. Such sections can be marked adding special comments to the examples in man pages:

## IGNORE_RDIFF_BEGIN
scores(cca(dune)) # signs can flip in eigenvector ordination
## IGNORE_RDIFF_END

Numerically unstable sections are mainly in two main groups:

Last decimals or almost-zero values can differ in last bits depending on hardware, operating systems, R version, variant of BLAS etc.
Sings of eigenvectors are arbitrary, and in addition to reasons above, it seems that these can change by phase of moon and positions of trams in Helsinki network (see sporalogy).

In proper tests these would be handled allowing fuzz in numerical comparison or using absolute values of eigenvectors, but the examples are primarily examples, although they can be used as additional tests as well.

I have now started marking these sections in man pages. The current reference output (in vegan/tests/Examples/vegan-Ex.Rout.save) is based on my tests with Apple Silicon M1 & M2 and Apple's native BLAS ("neural engine"). The first markings were based on running tests with the same hardware and software except using R reference BLAS, and testing vegan in WinBuilder strains of R (and it seems that ordisurf.Rd still needs a bit more extensive marking). I am not sure that the marking is yet sufficient to have the reference output in the next CRAN release, but we can start making reference output useful to all developers and mark the unstable sections. Naturally, it is good to understand why the section is unstable to avoid marking sections in examples when we should fix code. It can also make sense to to change example, but we must remember that they are primarily examples and should be user-friendly and readable.

Random number generator is a potential source of numerical differences. However, this is something that we should avoid marking out. In principle, same seed should give the same random number sequences in all versions of R (since 3.6-0) in all platforms, and if there are differences, the reasons should be analysed instead of marking out.

NB., even if we do not include reference output of examples, the examples are run in CRAN checks and in github, and these tests will detect errors.

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

IGNORE RDIFF & testing examples in CRAN release #538

IGNORE RDIFF & testing examples in CRAN release #538

jarioksa commented Oct 5, 2022 •

edited

IGNORE RDIFF & testing examples in CRAN release #538

IGNORE RDIFF & testing examples in CRAN release #538

Comments

jarioksa commented Oct 5, 2022 • edited

jarioksa commented Oct 5, 2022 •

edited