Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Karpenter 0.35.2 Grafana dashboards do not work properly #5934

Open
youwalther65 opened this issue Mar 26, 2024 · 0 comments · May be fixed by #5935
Open

Karpenter 0.35.2 Grafana dashboards do not work properly #5934

youwalther65 opened this issue Mar 26, 2024 · 0 comments · May be fixed by #5935
Assignees
Labels
bug Something isn't working good-first-issue Good for newcomers

Comments

@youwalther65
Copy link

youwalther65 commented Mar 26, 2024

Description

Observed Behavior:

Karpenter Grafana dashboard "Karpenter Capacity" contains metric "karpenter_deprovisioning_actions_performed" which is not available anymore according to Metrics.

Karpenter Grafana dashboard "Karpenter Capacity" uses controller-runtime based metrics "controller_runtime_reconcile_time_seconds_bucket" and "controller_runtime_reconcile_total" which are used by other K8s components like "aws-load-balancer-controller", so a job label to filter for job="karpenter" is necessary to avoid ingesting controllers of these K8s components as well.

Expected Behavior:
Usage of metrics "karpenter_disruption_nodes_disrupted_total" and "karpenter_disruption_eligible_nodes" in Capacity dashboard will provide the needed insides into Karpenter action, consolidation_type and method.

Using job="karpenter" in Performance dashboard will show the correct Karpenter related controllers

Reproduction Steps (Please include YAML):
Original Grafana dashboards installed as described in Monitoring with Grafana (optional)

I have submitted PR #5935 for proposed changes.

Versions: 0.35.2

  • Chart Version: karpenter-0.35.2
  • Kubernetes Version (kubectl version): 1.29.2
  • Please vote on this issue by adding a 👍 reaction to the original issue to help the community and maintainers prioritize this request
  • Please do not leave "+1" or "me too" comments, they generate extra noise for issue followers and do not help prioritize the request
  • If you are interested in working on this issue or have submitted a pull request, please leave a comment
@youwalther65 youwalther65 added bug Something isn't working needs-triage Issues that need to be triaged labels Mar 26, 2024
@tzneal tzneal added good-first-issue Good for newcomers and removed needs-triage Issues that need to be triaged labels Mar 26, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working good-first-issue Good for newcomers
Projects
None yet
2 participants