[Security Solution] [Attack discovery] Overrides default Attack discovery timeouts #183575

andrew-goldstein · 2024-05-15T23:08:33Z

[Security Solution] [Attack discovery] Overrides default Attack discovery timeouts

Summary

This PR fixes an issue where Attack discovery requests may be retried when responses from the LLM take longer than two minutes.

In LangSmith, the retry looks like the following before screenshot:

Before

Above: Before the fix, a retry, shown in LangSmith, for an LLM call > 2 minutes

After the fix, a single pair for runs > 2 minutes are observed in LangSmith:

After

Above: After the fix, a single pair in LangSmith, for an LLM call > 2 minutes

Details

This PR overrides the following default timeouts:

The attack discovery route's idleSocket socket timeout in x-pack/plugins/elastic_assistant/server/routes/attack_discovery/post_attack_discovery.ts
The connector timeout (also in x-pack/plugins/elastic_assistant/server/routes/attack_discovery/post_attack_discovery.ts)
The chain timeout in x-pack/plugins/security_solution/server/assistant/tools/attack_discovery/attack_discovery_tool.ts

with the following defaults:

const ROUTE_HANDLER_TIMEOUT = 10 * 60 * 1000; // 10 * 60 seconds = 10 minutes
const LANG_CHAIN_TIMEOUT = ROUTE_HANDLER_TIMEOUT - 10_000; // 9 minutes 50 seconds
const CONNECTOR_TIMEOUT = LANG_CHAIN_TIMEOUT - 10_000; // 9 minutes 40 seconds

Desk testing

Verify there are ~ 100 open alerts in the last 24 hours in your testing environment
Navigate to Security > Attack discovery
Select an Azure / OpenAI connector
Click Generate

Expected results

LangSmith displays a single pair of LLMChain and AttackDiscovery runs when the LLM responds (with the final answer) in less than 2 minutes
LangSmith displays a single pair of LLMChain and AttackDiscovery runs when the LLM takes longer than two minutes to respond (with the final answer), as illustrated by the before / after screenshots in the description above

…scovery timeouts ### Summary This PR fixes an issue where Attack discovery requests may be retried when responses from the LLM take longer than two minutes. In LangSmith, the retry looks like the following _before_ screenshot: #### Before ![langsmith_before](https://github.com/elastic/kibana/assets/4459398/b02f016c-c260-43f3-a6cc-1260ca8d99c2) _Above: Before the fix, a retry, shown in LangSmith, for an LLM call > 2 minutes_ After the fix, a single pair for runs > 2 minutes are observed in LangSmith: #### After ![langsmith_after](https://github.com/elastic/kibana/assets/4459398/864ef2d4-f845-4d62-ab30-686211aadf30) _Above: After the fix, a single pair in LangSmith, for an LLM call > 2 minutes_ ### Details This PR overrides the following default timeouts: 1) The attack discovery route's `idleSocket` socket timeout in `x-pack/plugins/elastic_assistant/server/routes/attack_discovery/post_attack_discovery.ts` 2) The connector timeout (also in `x-pack/plugins/elastic_assistant/server/routes/attack_discovery/post_attack_discovery.ts`) 3) The chain timeout in `x-pack/plugins/security_solution/server/assistant/tools/attack_discovery/attack_discovery_tool.ts` with the following defaults: ```typescript const ROUTE_HANDLER_TIMEOUT = 10 * 60 * 1000; // 10 * 60 seconds = 10 minutes const LANG_CHAIN_TIMEOUT = ROUTE_HANDLER_TIMEOUT - 10_000; // 9 minutes 50 seconds const CONNECTOR_TIMEOUT = LANG_CHAIN_TIMEOUT - 10_000; // 9 minutes 40 seconds ``` ### Desk testing 1) Verify there are ~ 100 open alerts in the last 24 hours in your testing environment 2) Navigate to Security > Attack discovery 3) Select an Azure / OpenAI connector 4) Click Generate **Expected results** - LangSmith displays a single pair of `LLMChain` and `AttackDiscovery` runs when the LLM responds (with the final answer) in less than 2 minutes - LangSmith displays a single pair of `LLMChain` and `AttackDiscovery` runs when the LLM takes longer than two minutes to respond (with the final answer), as illustrated by the `before` / `after` screenshots in the description above

elasticmachine · 2024-05-15T23:08:35Z

Pinging @elastic/security-solution (Team: SecuritySolution)

YulNaumenko

LGTM!

kibana-ci · 2024-05-16T00:18:04Z

💚 Build Succeeded

Buildkite Build
Commit: 17ae288

Metrics [docs]

Public APIs missing comments

Total count of every public API that lacks a comment. Target amount is 0. Run node scripts/build_api_docs --plugin [yourplugin] --stats comments for more detailed information.

id	before	after	diff
`elasticAssistant`	31	32	+1

Unknown metric groups

API count

id	before	after	diff
`elasticAssistant`	45	46	+1

To update your PR or re-run it, just comment with:
@elasticmachine merge upstream

cc @andrew-goldstein

…very timeouts (elastic#183575) ## [Security Solution] [Attack discovery] Overrides default Attack discovery timeouts ### Summary This PR fixes an issue where Attack discovery requests may be retried when responses from the LLM take longer than two minutes. In LangSmith, the retry looks like the following _before_ screenshot: #### Before ![langsmith_before](https://github.com/elastic/kibana/assets/4459398/b02f016c-c260-43f3-a6cc-1260ca8d99c2) _Above: Before the fix, a retry, shown in LangSmith, for an LLM call > 2 minutes_ After the fix, a single pair for runs > 2 minutes are observed in LangSmith: #### After ![langsmith_after](https://github.com/elastic/kibana/assets/4459398/864ef2d4-f845-4d62-ab30-686211aadf30) _Above: After the fix, a single pair in LangSmith, for an LLM call > 2 minutes_ ### Details This PR overrides the following default timeouts: 1) The attack discovery route's `idleSocket` socket timeout in `x-pack/plugins/elastic_assistant/server/routes/attack_discovery/post_attack_discovery.ts` 2) The connector timeout (also in `x-pack/plugins/elastic_assistant/server/routes/attack_discovery/post_attack_discovery.ts`) 3) The chain timeout in `x-pack/plugins/security_solution/server/assistant/tools/attack_discovery/attack_discovery_tool.ts` with the following defaults: ```typescript const ROUTE_HANDLER_TIMEOUT = 10 * 60 * 1000; // 10 * 60 seconds = 10 minutes const LANG_CHAIN_TIMEOUT = ROUTE_HANDLER_TIMEOUT - 10_000; // 9 minutes 50 seconds const CONNECTOR_TIMEOUT = LANG_CHAIN_TIMEOUT - 10_000; // 9 minutes 40 seconds ``` ### Desk testing 1) Verify there are ~ 100 open alerts in the last 24 hours in your testing environment 2) Navigate to Security > Attack discovery 3) Select an Azure / OpenAI connector 4) Click Generate **Expected results** - LangSmith displays a single pair of `LLMChain` and `AttackDiscovery` runs when the LLM responds (with the final answer) in less than 2 minutes - LangSmith displays a single pair of `LLMChain` and `AttackDiscovery` runs when the LLM takes longer than two minutes to respond (with the final answer), as illustrated by the `before` / `after` screenshots in the description above (cherry picked from commit 1c96c31)

kibanamachine · 2024-05-16T00:31:50Z

💚 All backports created successfully

Status	Branch	Result
✅	8.14

Note: Successful backport PRs will be merged automatically after passing CI.

Questions ?

Please refer to the Backport tool documentation

…k discovery timeouts (#183575) (#183581) # Backport This will backport the following commits from `main` to `8.14`: - [[Security Solution] [Attack discovery] Overrides default Attack discovery timeouts (#183575)](#183575)  ### Questions ? Please refer to the [Backport tool documentation](https://github.com/sqren/backport)  Co-authored-by: Andrew Macri <andrew.macri@elastic.co>

andrew-goldstein self-assigned this May 15, 2024

andrew-goldstein requested review from a team as code owners May 15, 2024 23:08

YulNaumenko approved these changes May 15, 2024

View reviewed changes

andrew-goldstein merged commit 1c96c31 into elastic:main May 16, 2024
52 checks passed

andrew-goldstein deleted the increase_attack_discovery_timeout branch May 16, 2024 00:26

kibanamachine mentioned this pull request May 16, 2024

[8.14] [Security Solution] [Attack discovery] Overrides default Attack discovery timeouts (#183575) #183581

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Security Solution] [Attack discovery] Overrides default Attack discovery timeouts #183575

[Security Solution] [Attack discovery] Overrides default Attack discovery timeouts #183575

andrew-goldstein commented May 15, 2024 •

edited by kibanamachine

elasticmachine commented May 15, 2024

YulNaumenko left a comment

kibana-ci commented May 16, 2024

API count

kibanamachine commented May 16, 2024

[Security Solution] [Attack discovery] Overrides default Attack discovery timeouts #183575

[Security Solution] [Attack discovery] Overrides default Attack discovery timeouts #183575

Conversation

andrew-goldstein commented May 15, 2024 • edited by kibanamachine