Check models compatibility in Docker rebuild GH Actions workflow #719

juhoinkinen · 2023-07-06T09:32:09Z

The recently added GH Actions workflow for rebuilding Docker images (#715) could also verify that the models trained on the previous image build work (results-wise) identically in the new image. It is quite undesirable that the models would work even slightly differently in different Docker image builds of the same Annif version.

These are the steps in the workflow that aim to verify the compatibility and results identicality of models:

Train models with all (trainable) algorithms using old image (the one in quay.io with the tag being rebuild)
Evaluate the models and store results in eval.prev.out file using old image
Evaluate the models and store results in eval.out file using new image
Compare eval.prev.out and eval.out with diff, and fail the workflow in the case of difference, unless the box for allowing this is checked when triggering the workflow

For both training and evaluation the tests/corpora/archaeology/fulltext/ corpus is used, which is fine for all algorithms I think. Although there could be some dedicated corpora for this.

Also, there could be a similar workflow for checking models compatibility when preparing an Annif release, instead of doing compatibility checks manually, so the compatibility-check steps could be moved to a separate action for reusability, like the prepare action of CI/CD workflow.

Note: I've been working on this in my own fork, to avoid accidental image pushes to quay.io.

TODO before merge:

Switch the image used to compare current build to quay.io/natlibfi/annif from jinkinen/annif

codecov · 2023-07-06T09:35:47Z

Codecov Report

Patch and project coverage have no change.

Comparison is base (320af2b) 99.67% compared to head (07f0af7) 99.67%.

❗ Current head 07f0af7 differs from pull request most recent head 569b367. Consider uploading reports for the commit 569b367 to get more accurate results

Additional details and impacted files

@@           Coverage Diff           @@
##             main     #719   +/-   ##
=======================================
  Coverage   99.67%   99.67%           
=======================================
  Files          89       89           
  Lines        6380     6380           
=======================================
  Hits         6359     6359           
  Misses         21       21

☔ View full report in Codecov by Sentry.
📢 Do you have feedback about the report comment? Let us know in this issue.

…ackends" This reverts commit 831e5a0.

sonarcloud · 2023-07-06T13:25:38Z

Kudos, SonarCloud Quality Gate passed!

0 Bugs
0 Vulnerabilities
0 Security Hotspots
0 Code Smells

No Coverage information
No Duplication information

juhoinkinen · 2024-04-19T08:31:45Z

When #762 is merged, the upload/download functionality could be utilized for the models compatibility check. By downloading models (maybe from GitHub Actions cache?) to check this first step could be omitted:

Train models with all (trainable) algorithms using old image (the one in quay.io with the tag being rebuild)

juhoinkinen added 12 commits July 4, 2023 11:33

Check model compatibility between previous & current build

41e0665

Add option to bypass eval results diff condition

a9ee1d3

Set path to project config with env via Docker command

929cad6

Add example project configs for stwfsa, svc and nn_ensemble backends

831e5a0

Train & eval all backends

97085b0

Separate train & eval steps

7c18538

Disable stwfsa project evaluation for now

38bfad1

Make diff ignore log messages by Omikuji

16111f7

Tee eval output instead of redirect

df40ebb

Pass envs for Annif from job level

246e35e

Store previous build tag as env

2762e5f

Use omikuji and svc via English projects; stwfsa via Swedish project

07f0af7

juhoinkinen added the maintenance label Jul 6, 2023

juhoinkinen added 2 commits July 6, 2023 16:15

Run annif commands via scripts

94c9bd2

Revert "Add example project configs for stwfsa, svc and nn_ensemble b…

569b367

…ackends" This reverts commit 831e5a0.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Check models compatibility in Docker rebuild GH Actions workflow #719

Check models compatibility in Docker rebuild GH Actions workflow #719

juhoinkinen commented Jul 6, 2023 •

edited

codecov bot commented Jul 6, 2023 •

edited

sonarcloud bot commented Jul 6, 2023

juhoinkinen commented Apr 19, 2024

Check models compatibility in Docker rebuild GH Actions workflow #719

Are you sure you want to change the base?

Check models compatibility in Docker rebuild GH Actions workflow #719

Conversation

juhoinkinen commented Jul 6, 2023 • edited

codecov bot commented Jul 6, 2023 • edited

Codecov Report

sonarcloud bot commented Jul 6, 2023

juhoinkinen commented Apr 19, 2024

juhoinkinen commented Jul 6, 2023 •

edited

codecov bot commented Jul 6, 2023 •

edited