feat: Create performance tests [WEB-1458] #7741

julian-determined-ai · 2023-08-25T18:22:41Z

Description

The goal of this PR is to both introduce load tests for the purpose of benchmarking the current performance of our system and implement a system that will easily allow us to implement future tests. This PR will describe the current testing setup and future possibilities and enhancements, and add a few important notes about k6 that may be important in future updates.

For reference there is a prototype branch that contains a more in-depth setup for the load tests the branch is web-1458-prototype.

Current Setup

Running the test

Install k6 and the junit2html python package.
(within performance/determined)
npm install to install dependencies
npm start to build the 'api_performance_tests.js` test file
k6 run -e DET_MASTER=http://localhost:8080 build/api_performance_tests.js to run the file built in (3) the DET_MASTER` env var will set the url for the test cluster
A junit.xml file will be generated containing a test report.
junit2html junit.xml to create an html report from the generated xml.

The results of the test are:

Console output of metrics collected (example shown below).
A junit.xml file with pass/fail test results as well as http request duration statistics for each test.
A html file created from (2) with similar information.

To note, I had originally planned on implementing the test schema in web-1458-prototype but after discussions with @ashtonG we decided that it was a bit excessive for the ultimate goal of this ticket.

Example Results

Console output example:

jUnit output example (html version):

Test Logic

The current testing setup allows us to benchmark the current system performance by implementing a single "average load" test that simulates a ramping number of queries to the master. The current test will simulate 25 users querying the master endpoint. The test will ramp up to 25 users over the course of 5 minutes, Then sustain that request rate for 10 minutes, then ramp down to 0 users over a period of 5 minutes. The total test runtime is 20 minutes This is to simulate an average load on the system. The 25 users was based on of the number of users that recursion has in their system which is about 20.

k6 allows for setting important thresholds for a given test. Currently, two thresholds are set. First, I added a request failed threshold that will abort and fail the test if more than 1 percent of the HTTP requests fail. The idea being that if we are seeing that many tests fail we likely will want to investigate the cause and should not allow the test to pass.

Secondly, I have added a threshold for the request duration, the threshold expects more than 95% of all http requests to have a duration of less than 1 second, which is the overall performance goal for our system. The test suite is currently setup so that tests will not fail if this threshold is crossed. However, this gives us the ability to easily view this metric in test reports.

Sample Extension

Additional tests can be added using the test construct created in this PR. An example extra test to query the telemetry endpoint is:

    test(
        'visit telemetry endpoint',
        () => {
            const res = http.get(generateEndpointUrl('/api/v1/master/telemetry', clusterURL));
            check(res, { '200 response': (r) => r.status == 200 });
        }
    )

example output after the test addition (the testing stages were made shorter for example purposes):

console output:

jUnit output example (html version):

Test Plan

Commentary

Future Possibilities and Enhancements and Important Notes

Reference: web-1458-prototype

Test Structure

Load testing will often implement different types of test scenarios in order to track system performance under different test situations. The most common scenarios being smoke, average load, stress, soak, spike, and breakpoint tests.

An example set up k6 would look similar to this:

const scenarios: { [name: string]: Scenario } = {
    smoke: {
        executor: 'shared-iterations',
        vus: 5,
        iterations: 5
    }, average_load: {
        executor: 'ramping-vus',
        stages: [
            { duration: '10s', target: 50 },
            { duration: '60s', target: 50 },
            { duration: '10s', target: 0 }
        ],
        startTime: "5s"
    },
    stress: {
        executor: 'ramping-vus',
        stages: [
            { duration: '10s', target: 175 },
            { duration: '20s', target: 175 },
            { duration: '10s', target: 0 }
        ],
        startTime: "90s"
    },
    soak: {
        executor: 'ramping-vus',
        stages: [
            { duration: '5s', target: 50 },
            { duration: '1m', target: 50 },
            { duration: '1m', target: 0 }
        ],
        startTime: "135s"
    },
    spike: {
        executor: 'ramping-vus',
        stages: [
            { duration: '1m', target: 500 },
            { duration: '15s', target: 0 },
        ],
        startTime: "265s"
    },
}

example console output:

example jUnit output:

in the above example virtual users are spun up and down over a specified duration of time to simulate variations in web traffic. You can reference the k6 scenario documentation to learn more about how this configuration works.

In the future we will likely want to move to this sort of test scheme so that we can gather a holistic view of our system performance under different load types. Additionally, we will likely want to implement much longer running tests for some scenarios.

User Initialization

During implementation planning with @stoksc we discussed that we will likely want to be able to track unique users per test. For example, we might want to track performance for RBAC users with differing permissions. k6 has a few utilities using unique data within tests however there are some caveats. The largest one being that k6 does not allow for making http requests during the test initializing phase, meaning we cannot implement logic such as:

Before the start of the test suite query the cluster for the current set of users
Assign each virtual user to a determined user from (1)

there are alternative workflows we could implement but I did find it worth calling this fact out.

The current setup does not implement any sort of login or unique user configuration.

API Bindings and Typescript

The k6 recommended k6-template-typescript project was used to generate a typescript project for our test suite. The current setup does not use our generated typescript bindings but in the future we may want to, this was one of the main considerations for making this a typescript project. I wanted to add this note since it came up in discussions with @loksonarius

Result Reporting

There are a few quirks around metric reporting that were found during this implementation that are worth calling out.

Limitations around reporting results within k6

k6 gives the ability to tag and group tests in various ways. Tests can easily be tagged via custom tags, endpoint, groups, etc. k6 gives the ability to render a custom report output via a handle summary method that is defined within the test suite. However, all information and details regarding tags are scrubbed from the data that the handleSummary method receives. You can read more in this github issue grafana/k6#1321 as well as this thread about the lack of tag data: https://community.grafana.com/t/show-tag-data-in-output-or-summary-json-without-threshold/99320

for example no matter how you tag any metrics, even custom metrics, the data available for writing the report will look as follows:

"http_req_duration":{
         "type":"trend",
         "contains":"time",
         "values":{
            "p(95)":2.2873,
            "avg":1.0244,
            "min":0.353,
            "med":0.5935,
            "max":2.308,
            "p(90)":2.2666
         }
      },

as you can see there are no mentions of any tags. The workaround is to add thresholds for each tag that you want to follow, this will cause k6 to show more information regarding the tag in the output. A code example can be seen here:https://github.com/determined-ai/determined/compare/web-1458-prototype#diff 31e2b17ee608e49eacf18f0b0b17988d36964f621b588600e701bfce8466649aR69 and you can see in the example outputs above how the tag information becomes available in the test output.

Future Results Reporting in k6

Thankfully, all information regarding tags is kept within the individual data points created during testing. This data is what will be sent to Grafana for example when we want to implement external result viewing, so we will still be able to build custom dashboards and charts when we decide to enable viewing results sent to a time series db.

Additionally, the individual data points mentioned above can be written to a json or csv file. In the future we could write custom file parsing logic to build a more in depth report from the data in the output file

For reference here is an example point from the file mentioned above, you will notice that all tag data is present.

{
   "metric":"http_reqs",
   "type":"Point",
   "data":{
      "time":"2023-08-25T13:12:56.940746-05:00",
      "value":1,
      "tags":{
         "expected_response":"true",
         "group":"",
         "method":"GET",
         "name":"http://localhost:8080/api/v1/master",
         "proto":"HTTP/1.1",
         "scenario":"smoke",
         "status":"200",
         "test":"visit master endpoint",
         "url":"http://localhost:8080/api/v1/master"
      }
   }
}

Checklist

Changes have been manually QA'd
User-facing API changes need the "User-facing API Change" label.
Release notes should be added as a separate file under docs/release-notes/.
See Release Note for details.
Licenses should be included for new code which was copied and/or modified from any external code.

Ticket

WEB-1458

Co-authored-by: Calderon, Carolina <carolina.calderon@hpe.com>

netlify · 2023-08-25T18:23:08Z

✅ Deploy Preview for determined-ui canceled.

Name	Link
🔨 Latest commit	`ce42d07`
🔍 Latest deploy log	https://app.netlify.com/sites/determined-ui/deploys/64ef8b8773685f000853dd83

performance/package.json

performance/yarn.lock

ashtonG · 2023-08-25T18:51:09Z

performance/package.json

+    "@babel/core": "7.13.16",
+    "@babel/plugin-proposal-class-properties": "7.13.0",
+    "@babel/plugin-proposal-object-rest-spread": "7.13.8",
+    "@babel/preset-env": "7.13.15",
+    "@babel/preset-typescript": "7.13.0",
+    "@types/k6": "^0.45.3",
+    "@types/webpack": "5.28.0",
+    "babel-loader": "8.2.2",
+    "clean-webpack-plugin": "4.0.0-alpha.0",
+    "copy-webpack-plugin": "^9.0.1",
+    "typescript": "4.2.4",
+    "webpack": "5.76.1",
+    "webpack-cli": "5.0.1",
+    "webpack-glob-entries": "^1.0.1"


given that the webui uses vite/esbuild and this has us using webpack/babel, consider tagging some work to migrate this to vite using library mode: https://vitejs.dev/guide/build.html#library-mode

ticket here: https://hpe-aiatscale.atlassian.net/browse/WEB-1623

performance/src/load_tests.ts

hkang1

This looks great! I was able to run through all the steps and see the results and JUNIT export. Let some comments.

How did you generate the JUNIT html pages?

performance/package-lock.json

hkang1 · 2023-08-28T17:04:02Z

performance/package.json

+    "@babel/core": "7.13.16",
+    "@babel/plugin-proposal-class-properties": "7.13.0",
+    "@babel/plugin-proposal-object-rest-spread": "7.13.8",
+    "@babel/preset-env": "7.13.15",
+    "@babel/preset-typescript": "7.13.0",
+    "@types/k6": "^0.45.3",
+    "@types/webpack": "5.28.0",
+    "babel-loader": "8.2.2",
+    "clean-webpack-plugin": "4.0.0-alpha.0",
+    "copy-webpack-plugin": "^9.0.1",
+    "typescript": "4.2.4",
+    "webpack": "5.76.1",
+    "webpack-cli": "5.0.1",
+    "webpack-glob-entries": "^1.0.1"


performance/src/load_tests.ts

performance/webpack.config.js

julian-determined-ai · 2023-08-28T21:17:28Z

This looks great! I was able to run through all the steps and see the results and JUNIT export. Let some comments.

How did you generate the JUNIT html pages?

Another things to note, k6 does not really have native support for outputting to different output types. They do recommend some (helpful libraries)[https://github.com/grafana/awesome-k6] however the bulk of them are small repos maintained by a single individual which I don't think we would want to depend on ourselves. Similar to your comment here: #7741 (comment)

The HTML was actually created using a python library junit2html.

I support adding a CI step to create the html report from the xml and adding that as an artifact in circleCi. The report that is generated off the shelf is decent, it does allow us to easily view failures/passes. But I can imagine we might want to make better reports in the future. How does adding an extra CI step to create the html file sound @hkang1?

hkang1 · 2023-08-30T21:38:54Z

That sounds great, having it as an artifact would be sweet!

hkang1

Thanks for the updates!

julian-determined-ai added 11 commits August 21, 2023 12:09

wip

f3b3c13

scenario examples

a14de80

added some more example cases

3e9de5a

users wip

093c191

prototype retrieving users

f77901a

prototype retrieving users

6f65bcb

Co-authored-by: Bradley Laney <stoksc@users.noreply.github.com>

d859d54

Co-authored-by: Calderon, Carolina <carolina.calderon@hpe.com>

various updates

72c7b15

feat: create performance tests

da83bc2

remove test artifacts

852eb52

restore circleci config

a3af58c

cla-bot bot added the cla-signed label Aug 25, 2023

julian-determined-ai requested review from stoksc, loksonarius and ashtonG August 25, 2023 18:33

julian-determined-ai assigned ashtonG Aug 25, 2023

ashtonG reviewed Aug 25, 2023

View reviewed changes

julian-determined-ai added 2 commits August 25, 2023 16:56

remove unneeded yarn.lock file

3900099

update package information

8c0c77b

julian-determined-ai requested review from ashtonG and hkang1 August 25, 2023 22:34

julian-determined-ai assigned hkang1 and unassigned ashtonG Aug 28, 2023

hkang1 reviewed Aug 28, 2023

View reviewed changes

hkang1 assigned julian-determined-ai and unassigned hkang1 Aug 28, 2023

julian-determined-ai added 2 commits August 29, 2023 10:39

audit npm packages

7426653

remove xml file

d023977

julian-determined-ai added 11 commits August 29, 2023 10:52

add default cluster url

b458641

remove xml file

627c7c7

added k6-summary to repo

e0e9c9a

add average load test

2582b9c

test reporting updates

b6a1312

update test declaration

dd97a12

remove xml

f66fca5

formatting

b334cbd

result reporting updates

aae4809

update threshold definition

375a9d5

update stages

ce42d07

julian-determined-ai requested a review from hkang1 August 30, 2023 19:46

julian-determined-ai assigned hkang1 and unassigned hkang1 and julian-determined-ai Aug 30, 2023

hkang1 approved these changes Aug 30, 2023

View reviewed changes

hkang1 assigned julian-determined-ai and unassigned hkang1 Aug 30, 2023

julian-determined-ai merged commit f8caa0e into main Aug 30, 2023
76 of 87 checks passed

julian-determined-ai deleted the web-1458 branch August 30, 2023 21:59

dannysauer added this to the 0.25.1 milestone Feb 6, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Create performance tests [WEB-1458] #7741

feat: Create performance tests [WEB-1458] #7741

julian-determined-ai commented Aug 25, 2023 •

edited

netlify bot commented Aug 25, 2023 •

edited

ashtonG Aug 25, 2023

hkang1 Aug 28, 2023

julian-determined-ai Aug 30, 2023

hkang1 left a comment •

edited

hkang1 Aug 28, 2023

julian-determined-ai commented Aug 28, 2023

hkang1 commented Aug 30, 2023

hkang1 left a comment

feat: Create performance tests [WEB-1458] #7741

feat: Create performance tests [WEB-1458] #7741

Conversation

julian-determined-ai commented Aug 25, 2023 • edited

Description

Current Setup

Running the test

Example Results

Test Logic

Sample Extension

Test Plan

Commentary

Future Possibilities and Enhancements and Important Notes

Test Structure

User Initialization

API Bindings and Typescript

Result Reporting

Checklist

Ticket

netlify bot commented Aug 25, 2023 • edited

✅ Deploy Preview for determined-ui canceled.

ashtonG Aug 25, 2023

Choose a reason for hiding this comment

hkang1 Aug 28, 2023

Choose a reason for hiding this comment

julian-determined-ai Aug 30, 2023

Choose a reason for hiding this comment

hkang1 left a comment • edited

Choose a reason for hiding this comment

hkang1 Aug 28, 2023

Choose a reason for hiding this comment

julian-determined-ai commented Aug 28, 2023

hkang1 commented Aug 30, 2023

hkang1 left a comment

Choose a reason for hiding this comment

julian-determined-ai commented Aug 25, 2023 •

edited

netlify bot commented Aug 25, 2023 •

edited

hkang1 left a comment •

edited