Skip to content

datajoint/datajoint-file-validator

 
 

Repository files navigation

DataJoint File Validator

Test

This repository contains a Python package that validates file sets for DataJoint pipelines.

Installation

Install Locally

pip install git+https://github.com/datajoint/datajoint-file-validator.git

Dev Container

This repository includes a devcontainer configuration for Visual Studio Code. To use it, either:

  1. Install the Remote - Containers extension in VS Code and open the repository in a container.
  2. Open the devcontainer in GitHub Codespaces:

Launch devcontainer in GitHub Codespace instance

Quick Start

Validate a fileset against an existing manifest:

from datajoint_file_validator import validate

my_dataset_path = 'tests/data/filesets/fileset0'
manifest_path = 'datajoint_file_validator/manifests/demo_dlc/v0.1.yaml'
success, report = validate(my_dataset_path, manifest_path, verbose=True, format='json')
# Validation failed with the following errors:
# [
#  {
#    "rule": "Min total files",
#    "rule_description": "Check that there are at least 6 files anywhere in the fileset",
#    "constraint_id": "count_min",
#    "constraint_value": 6,
#    "errors": "constraint `count_min` failed: 4 < 6"
#  }
#]

print(success)
# False

Alternatively, validate using the included command line interface:

$ datajoint-file-validator validate tests/data/filesets/fileset0 datajoint_file_validator/manifests/demo_dlc/v0.1.yaml
❌ Validation failed with 1 errors!
┏━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━┓
┃                ┃ Rule           ┃               ┃ Constraint    ┃                ┃
┃ Rule ID        ┃ Description    ┃ Constraint ID ┃ Value         ┃ Errors         ┃
┡━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━┩
│ Min total      │ Check that     │ count_min     │ 6             │ constraint     │
│ files          │ there are at   │               │               │ `count_min`    │
│                │ least 6 files  │               │               │ failed: 4 < 6  │
│                │ anywhere in    │               │               │                │
│                │ the fileset    │               │               │                │
└────────────────┴────────────────┴───────────────┴───────────────┴────────────────┘

Author

Ethan Ho @ethho

License

MIT

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%