Skip to content
This repository has been archived by the owner on Mar 27, 2021. It is now read-only.

Find/Create BigTable Load Simulator #711

Open
sming opened this issue Oct 24, 2020 · 1 comment
Open

Find/Create BigTable Load Simulator #711

sming opened this issue Oct 24, 2020 · 1 comment

Comments

@sming
Copy link
Contributor

sming commented Oct 24, 2020

From #bigtable:

Hi BigTable folks,

I want to analyze how Heroic behaves when BigTable is under severe load. As I understand it, Heroic experiences this as very long response times or time-outs and that's it.

• is that last statement correct or is there another way that BigTable exposes degraded performance to clients (e.g. 5xx errors)?
• what tools (e.g. the BigTable emulator) are available to perform such analysis?
• do you have any tips for responding to degraded BigTable performance?
...
TL;DR: seemingly a couple of hot tablet servers utterly cripple the entire cluster's IO throughput. Cluster CPU also hits the floor, except for the 2 hot tablet servers.

@sming sming created this issue from a note in Observability Kanban (To do) Oct 24, 2020
@sming sming moved this from To do to Inbox in Observability Kanban Oct 24, 2020
@sming sming moved this from Inbox to To do in Observability Kanban Oct 24, 2020
@sming sming moved this from To do to In progress in Observability Kanban Oct 25, 2020
@sming sming self-assigned this Oct 25, 2020
@samfadrigalan samfadrigalan moved this from In progress to To do in Observability Kanban Nov 6, 2020
@sming
Copy link
Contributor Author

sming commented Feb 11, 2021

Q1 2021 Update

So whilst there were some suggestions from #bigtable, nothing usable was surfaced. Then after some time, Adam Steele (adamsteele@google.com) presented his fork of the BT emulator, which supports jittered delayed responses which suits our needs perfectly.

This has been trialled (code is currently in a patch file) and seems to work as advertised, which is great news.

The Heroic API Request Timeouts & Retries Gdoc contains the work that should see this emulator used in unit test/integration test's.

@sming sming moved this from To do to Done in Observability Kanban Feb 11, 2021
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Projects
Development

No branches or pull requests

1 participant