Define validation results JSON format to be used for MVP #49

jcadam14 · 2024-01-29T16:44:43Z

Need to determine JSON format for the validation results. This may just be the format coming from the data-validator, but need to go over that format with the group and decide if it's still relevant or needs to be adjusted. This most likely will end up being a story in the data-validator repo, or could be massaging that data here after retrieving it from the validator.

jcadam14 · 2024-03-19T20:10:38Z

multifield_validation_error.json
validation_failures.json

hkeeler · 2024-03-20T16:45:41Z

A few things we might want to consider...

Do we want to include links to the FIG in the JSON message. There's a couple places where that could make sense.
1. validation - Currently the anchors to the validation ids don't match anything in the validator (Anchor tags for data validation checks sbl-content#12). We've said we now plan to fix that in the CMS, so once that's done, the frontend could build those URLs, but it seems more convenient to just give the URLs to the frontend.
2. fields - This feels like bonus points. If we link to the validations in the FIG, those then direct you to the fields...though they're not links there either. 🤔 If we did decide to do this, though, note that the anchors there dash-cased, not snake_cased like the actual fields, so we'd have to do a little conversion.
We should add a wrapping element around the top-level array. That'll let us add additional metadata about the validation results, such as...
1. Stats like number of errors, warnings, etc.
2. Paginiation info
3. A link to the csv download
record_no is zero-indexed. Do we want to use one-based indexing instead? Seems less confusing to end users. Of course, that could still get them off-by-one since a CSV has a header row. I'm hesitant to put line_no in though since the validator needs to support other formats besides CSV int the future (JSON), and line number has no meaning in that case.
- Also, HMDA uses a "ULI" over line number.
description does not have the same rich formatting as the FIG. For instance, we drop the bullet lists, and it's more like multiple sentences. We could do more there, but I think that'd largely depend on how we want to show that info on the frontend...if at all?
Do we need human-readable field names vs. the snake_cased column names?

jcadam14 · 2024-03-20T18:49:36Z

We should add a wrapping element around the top-level array. That'll let us add additional metadata about the validation results, such as...

Stats like number of errors, warnings, etc.

Paginiation info

A link to the csv download

For the csv download, we were thinking an endpoint like /submissions/latest/result_download or /submissions/{id}/results_download. So including a link in metadata would be odd, in my brain, since that should be static. Unless we want them to be able to download specific chunks, like a paginated csv but I'd question the usefulness of that. The pagination info we could add to the results if we're going with a paginated /submissions/latest/results and/or /submissions/{id}/results paginated endpoint which I think for MVP is desired right?

jcadam14 · 2024-03-20T18:51:56Z

record_no is zero-indexed. Do we want to use one-based indexing instead? Seems less confusing to end users. Of course, that could still get them off-by-one since a CSV has a header row. I'm hesitant to put line_no in though since the validator needs to support other formats besides CSV int the future (JSON), and line number has no meaning in that case.

Also, HMDA uses a "ULI" over line number.

I like using the UID. We validate it's unique for each entry and probably something that makes sense to the FI submitting, and it's easier to search for in their original submitted data than scrolling to find a row number.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Define validation results JSON format to be used for MVP #49

Define validation results JSON format to be used for MVP #49

jcadam14 commented Jan 29, 2024

jcadam14 commented Mar 19, 2024 •

edited

hkeeler commented Mar 20, 2024 •

edited

jcadam14 commented Mar 20, 2024

jcadam14 commented Mar 20, 2024 •

edited

Define validation results JSON format to be used for MVP #49

Define validation results JSON format to be used for MVP #49

Comments

jcadam14 commented Jan 29, 2024

jcadam14 commented Mar 19, 2024 • edited

hkeeler commented Mar 20, 2024 • edited

jcadam14 commented Mar 20, 2024

jcadam14 commented Mar 20, 2024 • edited

jcadam14 commented Mar 19, 2024 •

edited

hkeeler commented Mar 20, 2024 •

edited

jcadam14 commented Mar 20, 2024 •

edited