SRE

Site reliability engineering (SRE) is a set of principles and practices that incorporates aspects of software engineering and applies them to infrastructure and operations problems. The main goals are to create scalable and highly reliable software systems. Site reliability engineering is closely related to DevOps, a set of practices that combine software development and IT operations, and SRE has also been described as a specific implementation of DevOps.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SRE

Here are 654 public repositories matching this topic...

sharjeelsayed / learn.sharjeelsayed.com

Excoriate / daggerx

nobl9 / sloctl

nobl9 / terraform-provider-nobl9

runatlantis / atlantis

getsavvyinc / savvy-cli

robusta-dev / holmesgpt

cloudprober / cloudprober

kaytu-io / kaytu

christiangalsterer / pg-promise-prometheus-exporter

christiangalsterer / kafkajs-prometheus-exporter

christiangalsterer / node-postgres-prometheus-exporter

antonputra / tutorials

pabpereza / pabpereza

enola-dev / enola

rundeck / rundeck

k8sgpt-ai / k8sgpt

GoogleCloudPlatform / reliable-app-platforms

unskript / Awesome-CloudOps-Automation

Build5Nines / terraform-quickstart-templates