Add tests from JSONTestSuite #140

gavlyukovskiy · 2024-05-04T17:09:38Z

This PR adds all tests from JSONTestSuite. That suite has 3 types of tests:

y_* are valid JSONs
i_* are invalid JSONs that are allowed to be parsed, usually some large numbers and/or invalid UTF-8 sequences
n_* are invalid JSONs, for this the normal parser should fail, however we can process all of them except 2 that have 100k nesting level (P.S. Don't try opening them in IDEA 😄)

I've copied all of them to our repository and established following test cases

mustPassSuiteWithNoopMaskerShouldBeEquivalentToJackson

y_* tests must pass. I'm using ValueMaskers.withRawValueFunction(value -> value) as a more complicated noop masker to make sure that we always correctly track values, transform them to strings and back to bytes. The results of this are equivalent when parsed by Jackson

mayPassSuiteWithNoopMaskerShouldNotFail

i_* tests must not fail with and ideally should be equivalent to how Jackson parses it (if it can). There are 4 exceptions when java.lang.String treats invalid UTF-8 characters differently than Jackson does

mustFailSuiteWithNoopMaskerShouldOnlyFailWithInvalidJsonException

n_* tests must fail for a normal JSON parser, however we allow it not to fail, but if it does fail, then it must be with InvalidJsonException. With ValueMaskers.withRawValueFunction (or default masking options) there are only 2 test cases that are failing due to internal StackOverFlowError.

shouldMaskAllTestCasesPredictably

For all test files (including n_*) I've simply asserted the masker json we currently produce.

mustPassSuiteWithNoopTextFunction

Verifies that all test files that contain correct UTF-8 sequences we properly decode it in withTextValueFunction. There are 26 i_* test files where we fail on converting invalid sequences.

For the withTextValueFunction function the question is whether we should continue to fail on those 26 test cases or take the behavior of java.lang.String of converting most invalid sequences into � and passing that to the user provided function?

Fixes

This PR has couple of fixes for cases when we get out of bounds or throw internal exception instead of InvalidJsonException. All of the fixes only apply to n_* and i_* test cases, nothing critical was found during the testing 😃

Hard decisions time!

The main problem is what to do with n_* test cases?

The main problem is that we're not actually parsing the values so we do not reject things like invalid numbers or invalid escapes (except for withTextValueFunction. We also do not fail on invalid json structures - for example in arrays we don't strictly need to see a comma and ignore anything unknown and mask values if we detect them. With that for most of the test cases we actually mask them in more or less predictable ways, for example

["" -> ["***" (n_array_unclosed.json)
[ , ""] -> [ , "***"] (n_array_missing_value.json)

However some test cases are leaking information or giving weird results

abc -> abc (n_string_single_string_no_double_quotes.json) when string doesn't have quotes then we simply leave it as is
['single quote'] -> ['single quo"&&&" (n_string_single_quote.json) we ignore the value started with a single quote and accidentally think that we've found a boolean to mask

In shouldMaskAllTestCasesPredictably I've asserted the current status, so it would be useful to go over them and decide whether we should do anything special about each one of those.

Closes #121

github-actions · 2024-05-04T17:16:47Z

Note

These results are affected by shared workloads on GitHub runners. Use the results only to detect possible regressions, but always rerun on more stable machine before making any conclusions!

Benchmark results (pull-request, `02e873c`)

Benchmark                                                          (characters)  (jsonPath)  (jsonSize)  (maskedKeyProbability)   Mode  Cnt        Score       Error   Units
BaselineBenchmark.countBytes                                            unicode         N/A         1kb                     0.1  thrpt    4  2597878.760 ± 55214.880   ops/s
BaselineBenchmark.countBytes:gc.alloc.rate.norm                         unicode         N/A         1kb                     0.1  thrpt    4       ≈ 10⁻⁴                B/op
BaselineBenchmark.jacksonParseAndMask                                   unicode         N/A         1kb                     0.1  thrpt    4    29691.626 ±   405.803   ops/s
BaselineBenchmark.jacksonParseAndMask:gc.alloc.rate.norm                unicode         N/A         1kb                     0.1  thrpt    4    65360.007 ±     0.009    B/op
BaselineBenchmark.jacksonParseOnly                                      unicode         N/A         1kb                     0.1  thrpt    4    51922.189 ±  1330.396   ops/s
BaselineBenchmark.jacksonParseOnly:gc.alloc.rate.norm                   unicode         N/A         1kb                     0.1  thrpt    4    24352.004 ±     0.001    B/op
BaselineBenchmark.regexReplace                                          unicode         N/A         1kb                     0.1  thrpt    4     5135.173 ±   158.267   ops/s
BaselineBenchmark.regexReplace:gc.alloc.rate.norm                       unicode         N/A         1kb                     0.1  thrpt    4    61656.036 ±     0.003    B/op
JsonMaskerBenchmark.jsonMaskerBytes                                     unicode       false         1kb                     0.1  thrpt    4   443265.436 ±  7721.031   ops/s
JsonMaskerBenchmark.jsonMaskerBytes:gc.alloc.rate.norm                  unicode       false         1kb                     0.1  thrpt    4     2232.000 ±     0.001    B/op
JsonMaskerBenchmark.jsonMaskerBytes                                     unicode        true         1kb                     0.1  thrpt    4   633252.596 ± 17046.233   ops/s
JsonMaskerBenchmark.jsonMaskerBytes:gc.alloc.rate.norm                  unicode        true         1kb                     0.1  thrpt    4     1280.000 ±     0.001    B/op
JsonMaskerBenchmark.jsonMaskerString                                    unicode       false         1kb                     0.1  thrpt    4   242061.367 ±  2599.519   ops/s
JsonMaskerBenchmark.jsonMaskerString:gc.alloc.rate.norm                 unicode       false         1kb                     0.1  thrpt    4    10136.001 ±     0.001    B/op
JsonMaskerBenchmark.jsonMaskerString                                    unicode        true         1kb                     0.1  thrpt    4   229563.933 ±  2180.684   ops/s
JsonMaskerBenchmark.jsonMaskerString:gc.alloc.rate.norm                 unicode        true         1kb                     0.1  thrpt    4    10312.001 ±     0.001    B/op
ValueMaskerBenchmark.maskWithRawValueFunction                           unicode         N/A         1kb                     0.1  thrpt    4   684465.685 ± 35260.523   ops/s
ValueMaskerBenchmark.maskWithRawValueFunction:gc.alloc.rate.norm        unicode         N/A         1kb                     0.1  thrpt    4     1592.000 ±     0.001    B/op
ValueMaskerBenchmark.maskWithStatic                                     unicode         N/A         1kb                     0.1  thrpt    4   771495.896 ± 31878.538   ops/s
ValueMaskerBenchmark.maskWithStatic:gc.alloc.rate.norm                  unicode         N/A         1kb                     0.1  thrpt    4     1232.000 ±     0.001    B/op
ValueMaskerBenchmark.maskWithTextValueFunction                          unicode         N/A         1kb                     0.1  thrpt    4   603895.151 ± 15617.026   ops/s
ValueMaskerBenchmark.maskWithTextValueFunction:gc.alloc.rate.norm       unicode         N/A         1kb                     0.1  thrpt    4     1880.000 ±     0.001    B/op

Benchmark results (master, `7bc09ac`)

Benchmark                                                          (characters)  (jsonPath)  (jsonSize)  (maskedKeyProbability)   Mode  Cnt        Score        Error   Units
BaselineBenchmark.countBytes                                            unicode         N/A         1kb                     0.1  thrpt    4  2595251.864 ± 116163.735   ops/s
BaselineBenchmark.countBytes:gc.alloc.rate.norm                         unicode         N/A         1kb                     0.1  thrpt    4       ≈ 10⁻⁴                 B/op
BaselineBenchmark.jacksonParseAndMask                                   unicode         N/A         1kb                     0.1  thrpt    4    29792.200 ±    881.748   ops/s
BaselineBenchmark.jacksonParseAndMask:gc.alloc.rate.norm                unicode         N/A         1kb                     0.1  thrpt    4    65184.006 ±      0.004    B/op
BaselineBenchmark.jacksonParseOnly                                      unicode         N/A         1kb                     0.1  thrpt    4    51625.098 ±   2503.166   ops/s
BaselineBenchmark.jacksonParseOnly:gc.alloc.rate.norm                   unicode         N/A         1kb                     0.1  thrpt    4    24352.004 ±      0.001    B/op
BaselineBenchmark.regexReplace                                          unicode         N/A         1kb                     0.1  thrpt    4     5111.192 ±     74.164   ops/s
BaselineBenchmark.regexReplace:gc.alloc.rate.norm                       unicode         N/A         1kb                     0.1  thrpt    4    61656.035 ±      0.009    B/op
JsonMaskerBenchmark.jsonMaskerBytes                                     unicode       false         1kb                     0.1  thrpt    4   447551.817 ±  10121.186   ops/s
JsonMaskerBenchmark.jsonMaskerBytes:gc.alloc.rate.norm                  unicode       false         1kb                     0.1  thrpt    4     2232.000 ±      0.001    B/op
JsonMaskerBenchmark.jsonMaskerBytes                                     unicode        true         1kb                     0.1  thrpt    4   676273.054 ±   9697.752   ops/s
JsonMaskerBenchmark.jsonMaskerBytes:gc.alloc.rate.norm                  unicode        true         1kb                     0.1  thrpt    4     1280.000 ±      0.001    B/op
JsonMaskerBenchmark.jsonMaskerString                                    unicode       false         1kb                     0.1  thrpt    4   241992.606 ±  10176.033   ops/s
JsonMaskerBenchmark.jsonMaskerString:gc.alloc.rate.norm                 unicode       false         1kb                     0.1  thrpt    4    10136.001 ±      0.001    B/op
JsonMaskerBenchmark.jsonMaskerString                                    unicode        true         1kb                     0.1  thrpt    4   260060.687 ±   6548.330   ops/s
JsonMaskerBenchmark.jsonMaskerString:gc.alloc.rate.norm                 unicode        true         1kb                     0.1  thrpt    4    10312.001 ±      0.001    B/op
ValueMaskerBenchmark.maskWithRawValueFunction                           unicode         N/A         1kb                     0.1  thrpt    4   662720.760 ±  11135.643   ops/s
ValueMaskerBenchmark.maskWithRawValueFunction:gc.alloc.rate.norm        unicode         N/A         1kb                     0.1  thrpt    4     1592.000 ±      0.001    B/op
ValueMaskerBenchmark.maskWithStatic                                     unicode         N/A         1kb                     0.1  thrpt    4   743197.178 ±  33867.771   ops/s
ValueMaskerBenchmark.maskWithStatic:gc.alloc.rate.norm                  unicode         N/A         1kb                     0.1  thrpt    4     1232.000 ±      0.001    B/op
ValueMaskerBenchmark.maskWithTextValueFunction                          unicode         N/A         1kb                     0.1  thrpt    4   587460.446 ±  27832.634   ops/s
ValueMaskerBenchmark.maskWithTextValueFunction:gc.alloc.rate.norm       unicode         N/A         1kb                     0.1  thrpt    4     1880.000 ±      0.001    B/op

sonarcloud · 2024-05-04T17:56:15Z

Quality Gate passed

Issues
0 New issues
0 Accepted issues

Measures
0 Security Hotspots
100.0% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarCloud

gavlyukovskiy · 2024-05-04T18:37:00Z

@Breus looks like that badge is going to get a little bit more impressive 😄

donavdey

The main problem is what to do with n_* test cases?

I'd suggest to clearly claim in the docs that the masker is not a RFC 8259 compliant JSON parser, so the output for non-valid JSONs may be unpredictable.

Breus

LGTM. Let's address the invalid input handling (? vs throwing exception) in a different PR. Not too big of a deal

gavlyukovskiy force-pushed the json-test-suite branch from c4a582d to db8feaf Compare May 4, 2024 17:30

gavlyukovskiy requested review from Breus and donavdey May 4, 2024 17:33

Add tests from JSONTestSuite

02e873c

gavlyukovskiy force-pushed the json-test-suite branch from db8feaf to 02e873c Compare May 4, 2024 17:44

donavdey approved these changes May 17, 2024

View reviewed changes

Breus approved these changes May 24, 2024

View reviewed changes

Breus merged commit ec96ef1 into master May 24, 2024
4 checks passed

Breus deleted the json-test-suite branch May 24, 2024 13:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add tests from JSONTestSuite #140

Add tests from JSONTestSuite #140

gavlyukovskiy commented May 4, 2024 •

edited

github-actions bot commented May 4, 2024 •

edited

sonarcloud bot commented May 4, 2024

gavlyukovskiy commented May 4, 2024

donavdey left a comment

Breus left a comment

Add tests from JSONTestSuite #140

Add tests from JSONTestSuite #140

Conversation

gavlyukovskiy commented May 4, 2024 • edited

mustPassSuiteWithNoopMaskerShouldBeEquivalentToJackson

mayPassSuiteWithNoopMaskerShouldNotFail

mustFailSuiteWithNoopMaskerShouldOnlyFailWithInvalidJsonException

shouldMaskAllTestCasesPredictably

mustPassSuiteWithNoopTextFunction

Fixes

Hard decisions time!

github-actions bot commented May 4, 2024 • edited

Benchmark results (pull-request, 02e873c)

Benchmark results (master, 7bc09ac)

sonarcloud bot commented May 4, 2024

Quality Gate passed

gavlyukovskiy commented May 4, 2024

donavdey left a comment

Choose a reason for hiding this comment

Breus left a comment

Choose a reason for hiding this comment

gavlyukovskiy commented May 4, 2024 •

edited

github-actions bot commented May 4, 2024 •

edited

Benchmark results (pull-request, `02e873c`)

Benchmark results (master, `7bc09ac`)