Remove the JS runtime from threshold calculations #1443

sniku · 2020-05-11T13:53:20Z

This issue aims to bring 80% of the benefits of #1441 while doing only 20% of the work.

The ideal future state of thresholds is defined in #1441 .
This issue addresses only the most important problem of thresholds - removing the js runtime. This issue doesn't introduce the new concepts from the original specification.

Motivation

This issue addresses the following issues (two first issues from the original spec).

Thresholds in k6 are executed in JavaScript runtime while thresholds in k6 cloud are parsed and executed in Python. This doesn't work for non-trivial cases.
Thresholds can't be validated.

Quick overview of the new thresholds

import { http_req_duration } from "k6/metrics/http";
import { Counter } from "k6/metrics";

let successfulLogins = new Counter("successful_logins");

export let options = {
  thresholds: {  // thresholds are still defined in an object {}
    'http_req_duration{staticAssets:true}': [
      {threshold: "mean < 500"}  // the threshold string is not JS. 
    ],
    'successful_logins': ["count > 1"]
  }
}

Yes, it's the old format, but the threshold expression is no longer executed in a JS runtime. Instead, it's parsed and executed in go.

Parsing

(copied from the original spec).

To provide consistency between local and cloud execution, threshold parsing should happen only in k6.
When the test is executed in the cloud, k6 should parse and provide the agreed-upon format to the k6-cloud backend.

The proposed format is

let thresholds = [{
  metric_name: "http_req_duration|others",
  metric_type: "trend|gauge|counter|rate",
  metric_value_contains: "time|default",  // isTime=true
  metric_initial_value: 0, // 
  filters: [
    {"left-hand": "status", "operator": ">", "right-hand": "200", "right-hand-type": "int"}, // not supported by this proposal.
    {"left-hand": "staticAssets", "operator": "==", "right-hand": "yes", "right-hand-type": "numeric"},
  ],
  conditions: [
    {"left-hand": "mean", "operator": "<", "right-hand": "200", "right-hand-type": "string"},
  ],
  name: "http_req_duration(status>200, staticAssets=yes)", // auto-generate when not provided. 
}];

Validation

Thresholds should be validated and abort the execution in case of validation errors.

`metric_name` validation.

The metric name must be one of:

Built-in k6 metric (http_req_duration, etc)
user-defined metric (successful_logins, etc).

When k6 detects a threshold on a metric that's not defined, the test should be aborted.
The init context must be executed to find user-defined metrics. Metric defined in branches are no longer supported

if (1==2){  // invalid
   let successfulLogins = new Counter("successful_logins");
}

`filters` validation

Example of filtered-threshold

export let options = {
  thresholds: {  
    'http_req_duration{staticAssets:true}': ["mean < 500"] 
  }
}

Should be parsed as:

  filters: [
    {
    "left-hand": "staticAssets",
    "operator": "==",  // only equality operator supported at the moment
    "right-hand": "true", 
    "right-hand-type": "string" // only string supported at the moment
    },
  ]

Spaces between left-hand, operator and right-hand are allowed but should be stripped away by the parser.

left-hand should follow the ECMAScript variable-name restrictions - https://mathiasbynens.be/notes/javascript-identifiers
or more strict restrictions (left for the implementer to decide).

right-hand-type can only be a string at the moment.

`conditions` array validation

Each threshold can have multiple conditions. Conditions can't be combined. Each condition must be a separate string.

Invalid

export let options = {
  thresholds: {  
    'http_req_duration': ["p95 < 500 && p99 < 500"] 
  }
}

Valid

export let options = {
  thresholds: {  
    'http_req_duration': ["p95 < 500', 'p99 < 500"] 
  }
}

"conditions" array has a similar restriction as "filters" and should be parsed in a similar way.

 [{
    "left-hand": "mean", // `aggregation method` name - restricted per metric type
    "operator": "<",    // see operators below
    "right-hand": "200", 
    "right-hand-type": 
    "numeric"
    }]

`aggregation method` validation

Invalid aggregation method should raise an exception and abort the test execution.

The aggregation methods available for Metric types are specified below.

Counter

count (number of times the .add("whatever") has been called )
sum (sum of numeric values passed to .add(5))
rate (count/s)

Gauge

last
min
max
value (undocumented alias for last)

Rate

rate

Trend

min
mean
avg
max
p(N)

Operators

>
>=
<
<=
==
===
!=

The text was updated successfully, but these errors were encountered:

na-- · 2020-05-11T20:54:18Z

Some duplicate/related issues that will have to be combed through and closed/unified with this one (or #1441): #961, #1136, #1312, #1305, #1313, #1321, and #680 (or #870)

na-- · 2020-06-30T16:00:46Z

#1526 demonstrates that we should probably emit a warning for thresholds that don't correspond to a system or custom metric

na-- · 2021-01-26T09:20:38Z

Not sure if this should be here or in #1441, or even in #1313, but in the next version of the threshold submetric definition syntax, we should likely ditch the colon for equality and use == instead. Besides us wanting other expressions (== makes more sense when we also want to have other comparisons like <, >, etc.), it will make filtering by group much better, see #948 (comment)

na-- · 2021-01-26T09:56:24Z

Another benefit for having the thresholds be defined as an array instead of a map, besides distinguishing between the old and new format for specifying them, is that when we have them ordered, we can more easily implement the custom exit code per threshold: #680 (comment)

oleiade · 2021-11-11T16:17:33Z

For the reference, and to support my own understanding, here's a little BNF definition of the proposed threshold expression format I tried to extract from the existing syntax, as well as the proposed one:

assertion           -> aggregation_method whitespace* operator whitespace* float
aggregation_method  -> trend | rate | gauge | counter
counter             -> "count" | "sum" | "rate"
gauge               -> "last" | "min" | "max" | "value"
rate                -> "rate"
trend               -> "min" | "mean" | "avg" | "max" | percentile
percentile          -> "p(" float ")"
operator            -> ">" | ">=" | "<=" | "<" | "==" | "===" | "!="
float               -> digit+ (. digit+)?
digit               -> "0" | "1" | "2" | "3" | "4" | "5" | "6" | "7" | "8" | "9"
whitespace          -> space | tab
tab                 -> "\t"
space               -> " "

N.B I followed the BNF definition syntax presented in Crafting Interpreters, as I find it both more readable/understandable and easy to manipulate (considering we're not intending to produce an actual parser out of it).

oleiade · 2021-12-13T16:07:42Z

Here's a little update on the context and scope of this issue posted by @na-- in #2251 for future reference:

We don't know how the k6 thresholds of the future will look like... #1441 is one suggestion for one aspect of them, there are various improvement ideas in #1136 and some of its linked issues and in this comment (probably others as well), #1831 will almost certainly also affect how we do things...

So, we don't know how thresholds 2.0 will look like, but it's almost certainly going to be different than the current thresholds. Our current syntax is just too limited to support a lot of the things we want. Though, even after we have better thresholds and deprecate the current syntax, we probably won't ever stop supporting it... It's such a major feature that all old k6 scripts should continue to work. We'll just silently (or with a warning) support the old format, internally translating it to the new feature (since its capabilities will be a superset of the current ones), but we probably won't ever develop it or add new features to it.

Again, we don't know how parsing of the future thresholds will look like. We might go with our own custom parser, and then a fancy fully-featured one will be great. But we might decide to adopt PromQL or some other widely-used format for them and use a ready library for parsing it.

oleiade · 2022-01-17T12:53:44Z

We had initially planned for the refactor to ship with v0.36.0. However, we uncovered certain issues with it internally, which pushed us to momentarily revert the change and delay its merge into master. We now plan to ship this, with #2330 by v0.37.0.

oleiade · 2022-03-10T08:26:39Z

The reframed scope as discussed in the comments is implemented as of #2400. The initial scope is still something we'd like to achieve as a longer-term goal. Because the issue is already so old, and there have been numerous iterations on this, we prefer to close this issue in favor of a new one as soon as the initial scope gets prioritized again.

sniku added bug refactor and removed bug labels May 11, 2020

na-- mentioned this issue Jun 30, 2020

Exit code is zero if thresholds are not met #1526

Closed

na-- mentioned this issue Aug 13, 2020

Emit an errors metric and have a default error rate script abort threshold #877

Open

na-- added the evaluation needed proposal needs to be validated or tested before fully implementing it in k6 label Sep 11, 2020

na-- mentioned this issue Sep 11, 2020

Add a Counter.rps sink #1628

Closed

na-- mentioned this issue Oct 22, 2020

Improve errors and warnings #1681

Open

na-- mentioned this issue Jan 26, 2021

Proposal to modify the group tag syntax. #948

Closed

This was referenced Jan 26, 2021

Add explicit tracking and ignoring of metrics and sub-metrics #1321

Open

Threshold enhancements - custom exit codes per threshold #680

Open

This was referenced Jan 29, 2021

Proposal for adding a new standard http_req_failures metric #1828

Closed

Common metrics registry #1832

Closed

na-- mentioned this issue Mar 11, 2021

Dynamic counters #1900

Open

sniku mentioned this issue Apr 8, 2021

Emit the value of the threshold-metric at the end of the test. #1956

Open

na-- changed the title ~~New thresholds: remove the js runtime~~ Remove the JS runtime from threshold calculations Nov 10, 2021

na-- assigned oleiade Nov 10, 2021

oleiade mentioned this issue Nov 19, 2021

Fix/1443 remove the JS runtime from threshold calculations #2251

Merged

This was referenced Jan 10, 2022

Remove the Format method of stats.Sink #2320

Open

Parse and validate thresholds before starting the execution #2330

Closed

Revert "Merge pull request #2251 from grafana/fix/1443_remove_the_js_… #2335

Merged

oleiade mentioned this issue Jan 27, 2022

Parse and validate threshold during init #2356

Closed

na-- added this to the v0.37.0 milestone Mar 6, 2022

oleiade closed this as completed Mar 10, 2022

oleiade mentioned this issue Aug 1, 2022

Forward thresholds execution results when running k6 -o cloud scripts.js #2626

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove the JS runtime from threshold calculations #1443

Remove the JS runtime from threshold calculations #1443

sniku commented May 11, 2020

na-- commented May 11, 2020 •

edited

na-- commented Jun 30, 2020

na-- commented Jan 26, 2021

na-- commented Jan 26, 2021

oleiade commented Nov 11, 2021 •

edited

oleiade commented Dec 13, 2021

oleiade commented Jan 17, 2022

oleiade commented Mar 10, 2022

Remove the JS runtime from threshold calculations #1443

Remove the JS runtime from threshold calculations #1443

Comments

sniku commented May 11, 2020

Motivation

Quick overview of the new thresholds

Parsing

Validation

metric_name validation.

filters validation

conditions array validation

aggregation method validation

na-- commented May 11, 2020 • edited

na-- commented Jun 30, 2020

na-- commented Jan 26, 2021

na-- commented Jan 26, 2021

oleiade commented Nov 11, 2021 • edited

oleiade commented Dec 13, 2021

oleiade commented Jan 17, 2022

oleiade commented Mar 10, 2022

`metric_name` validation.

`filters` validation

`conditions` array validation

`aggregation method` validation

na-- commented May 11, 2020 •

edited

oleiade commented Nov 11, 2021 •

edited