[Tracking/Action] Repair: how broken Kubernetes workloads lead to higher emissions #365

xamebax · 2024-04-03T10:46:14Z

(ticket is part of sustainable k8s practices project work)

Description

What is the carbon cost of leaving broken workloads to run on Kubernetes? What is the untapped potential of making sure workloads repair themselves better, or that broken workloads aren't allowed to run for a long time? Is there a good "Kubernetes hygiene" around repairing workloads that can lead to lowering a cluster's carbon cost?

Outcome

A recommendation in our working document that helps the reader make a choice on how to repair their workloads, with an effort estimation (small, medium, large). Optional extra reading material with extra context if the reader's interested.

To-Do

add relevant labels to this issue when possible,
research if this is a worthy recommendation,
if yes, write a recommendation,
share it for review, implement feedback.

Comments

Only public cloud is in scope here.
I'm gonna work on writing this recommendation. 🙂

@mkorbi I'd love your input on this issue description, do you feel this captures the fullness of what we talked about?

(cc @JacobValdemar)

xamebax · 2024-05-14T13:17:32Z

Just an update that I did start working on this and should hopefully have a draft by the end of the week.

mkorbi · 2024-05-21T15:16:13Z

It's relevant to help the reader to identify broken workload and we have to differentiate here.
You have sprawls, so workload that got "lost" and no one takes care about, and you have idle workload but that "misbehaves".

I think for both there is a fairly easy approach: compare the network traffic vs. the resource consumption ->

no traffic but continuous "high" consumption, something is wrong

There are also other use cases where for example you either have old programming languages false configuration and those demand to much resources.

leonardpahlke assigned xamebax Apr 5, 2024

leonardpahlke added the project/k8s-best-pratices label Apr 5, 2024

leonardpahlke added the board/project label Apr 18, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Tracking/Action] Repair: how broken Kubernetes workloads lead to higher emissions #365

[Tracking/Action] Repair: how broken Kubernetes workloads lead to higher emissions #365

xamebax commented Apr 3, 2024 •

edited

xamebax commented May 14, 2024

mkorbi commented May 21, 2024

[Tracking/Action] Repair: how broken Kubernetes workloads lead to higher emissions #365

[Tracking/Action] Repair: how broken Kubernetes workloads lead to higher emissions #365

Comments

xamebax commented Apr 3, 2024 • edited

Description

Outcome

To-Do

Comments

xamebax commented May 14, 2024

mkorbi commented May 21, 2024

xamebax commented Apr 3, 2024 •

edited