site-reliability-engineering

Site reliability engineering (SRE) is a set of principles and practices that incorporates aspects of software engineering and applies them to infrastructure and operations problems. The main goals are to create scalable and highly reliable software systems. Site reliability engineering is closely related to DevOps, a set of practices that combine software development and IT operations, and SRE has also been described as a specific implementation of DevOps.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

site-reliability-engineering

Here are 89 public repositories matching this topic...

league3236 / begindevops

jkpl / sre-env

Knighton-Dev / SharpenUp

abrunner94 / maia

manuelcoppotelli / manuelcoppotelli.github.io

jacob-hudson / ideal-enigma

digitalascension / azure-tm-monitor

at15 / sre-handbook

skyzyx / engineering-for-site-reliability

GreggSchofield / terraformed-pagerduty

shantoroy / site-reliability-engineering-101

apolzek / apolzek.github.io

dfwsre / reliabilityengineering.io

luismendes070 / googlesre

ari-hacks / kubernetes-chaos-sandbox

lukebrady / resourced

githubfoam / gremlin-travisci

mauricioabreu / sre-hands-on

Mregojos / Roadmap-Data-ML-AI-Cloud-DevOps-SRE

figwasp / figwasp