Skip to content

afni/afni_ci_test_data

Repository files navigation

Test data management

This repository stores the data required for AFNI's continuous integration testing on CircleCI (and testing run locally with the run_afni_tests.py tool). If you are not using this tool but still wish to use data from this repository you must install datalad if you have not already.

To get the repository:

datalad install https://github.com/afni/afni_ci_test_data.git

Adding more data to the test data repository

N.B. You need to modify your data repository to have push access to the afni server to do this. See git configuration in the trouble shooting tips below.

Adding arbitrary data

Examples of adding data to the repository can be seen in the setup script in this repository. Data can be extracted from archives at URLs, installed via datalad from other online repositories (OpenNeuro and the like), or created on the fly by executing an AFNI command that generates data that is then saved to the repository. For examples, read through the initial setup of this repository

Updating sample test data

When new tests are added, or the expected output of a particular test changes you will need to add some more data to this repository. Follow these instructions to do this...

Be very careful that the behavior from your tested function and the test is what you want. This will create the new "correct" output. It is best to run all of the test commands inside the development container. This will reduce confusing results on circleci (variation in the output due to differences in environment). The best way to start a container is to follow the instructions emitted when you attempt to run container testing in debug mode i.e. run_afni_tests.py -d container

  1. Run all tests (they should all pass except the ones you wish to create data for)
cd /opt/afni/src/tests;                        \
./run_afni_tests.py                            \
    --build-dir /opt/afni/build                \
    -u                                         \
    -e='--runveryslow '                        \
    local
  1. Rerun the failed tests in "create sample output" mode.
./run_afni_tests.py                            \
    --build-dir /opt/afni/build                \
    -u                                         \
    -e='--runveryslow --create-sample-output'  \
    --lf                                       \
    -v verbose                                 \
    local
  1. Copy the sample data into the test data tree. You will need to modify the directory path that contains a datestamp below and be careful with trailing slashes:
rsync -aP                                                               \
    sample_output_of_tests/sample_output_2020_10_28_180130/             \
    afni_ci_test_data/sample_test_output/
  1. Add them to the datalad repo
cd afni_ci_test_data
datalad save -m 'update sample output data'
  1. Publish to the github and afni server. There is a way to configure the repository so that the following is a single command. I could not get it to work robustly across repository installations though:
datalad publish --transfer-data=all --to=origin
datalad publish --transfer-data=all --to=afni_ci_test_data

Note: the afni server stores all the binary blobs (the actual data). These files need to be globally accessible. The git repository doesn't need to look up to date, so for example it is fine to have an old version of master checked-out. The data will look out of date but it will be providing the appropriate files for download when requested.

  1. Fix permissions on the server... I can't find a work around for this right now.

Fixing permissions:

find /fraid/pub/dist/data/afni_ci_test_data -type d  ! -perm -o=x -exec chmod o+x {} \;; find /fraid/pub/dist/data/afni_ci_test_data -type f  ! -perm -o=r -exec chmod o+r {} \;
  1. Test that the data can be fetched:
cd /tmp
datalad install https://github.com/afni/afni_ci_test_data.git
cd afni_ci_test_data
# Insert your actual sample_test_output that you wish to test for
datalad get sample_test_output/3dTproject

If the above doesn't work start working through the trouble-shooting tips below. If it does then celebrate and do the dance of joy.

  1. Update your reference to the data in the main code repository
cd ~/afni # or the path to your afni source code
git add tests/afni_ci_tests_data
git commit -m 'update data for tests'

Troubleshooting tips:

git configuration

The file ./git/config file in the afni_ci_test_data repository does not contain an entry that looks like the following you may have some issues:

[remote "afni_ci_test_data"]
    url = https://afni.nimh.nih.gov/pub/dist/data/afni_ci_test_data/.git
    pushurl = afni.nimh.nih.gov:/fraid/pub/dist/data/afni_ci_test_data
    fetch = +refs/heads/*:refs/remotes/afni_ci_test_data/*
    annex-bare = false
    annex-uuid = c1ce38d5-c2ef-48c6-a1f2-e207215d0717

Specifically, you should have a pushurl configured. If you do not, datalad publish will try and fail to write via https. Also in the annex section the version should likely be 7 not 8.

Consulting the ghosts of the past

Check commits logs in the afni_ci_test_data repo, these provide an excellent guide to commands that were used to add data to the repository.

Fixing browser access

Due to NIH server configuration constraint the files need to be indexed if you wish to access them directly via a browser. This could be implemented as a git hook that is executed on push if that is desirable. Overall I think having browser access is not required though. It can make things confusing to try to support this functionality. The @make.directory.index script is stored in repository if you need it, add it to your path:

@make.directory.index -nested -dirs /fraid/pub/dist/data/afni_ci_test_data

annex remote accessibility

Sometimes when the remote is not accessible the annex remote will be disabled. This can be undone by modifying .git/config or using git-annex enableremote afni_ci_test_data

Potential quick fix

Sometimes the quick and easy fix is to run datalad update (usually in combination with permissions fixes and version changes).