SRE
Site reliability engineering (SRE) is a set of principles and practices that incorporates aspects of software engineering and applies them to infrastructure and operations problems. The main goals are to create scalable and highly reliable software systems. Site reliability engineering is closely related to DevOps, a set of practices that combine software development and IT operations, and SRE has also been described as a specific implementation of DevOps.
Here are 653 public repositories matching this topic...
The Kaytu CLI improves the efficiency of cloud workloads by analyzing historical usage and providing tailored recommendations, such as changing instance sizes. This ensures you only pay for the resources you actually need without compromising stability.
-
Updated
May 29, 2024 - Go
A prometheus exporter exposing metrics for the official MongoDB Node.js driver.
-
Updated
May 29, 2024 - TypeScript
An active monitoring software to detect failures before your customers do.
-
Updated
May 29, 2024 - Go
Terraform Pull Request Automation
-
Updated
May 29, 2024 - Go
🚀🚀A high-performance and high-concurrency ssh tool written in Go. It is 10 times faster than Ansible. If you need much more performance and better ease of use, you will love it.
-
Updated
May 29, 2024 - Go
Create, share, and run runbooks from your terminal.
-
Updated
May 29, 2024 - Go
Enable Self-Service Operations: Give specific users access to your existing tools, services, and scripts
-
Updated
May 29, 2024 - Groovy
Terraform provider for Nobl9
-
Updated
May 29, 2024 - Go
Cloud-ops automation runbooks that are ready to use. Build your own automations using the hundreds of drag and drop actions included in the repository. Built on Jupyter Notebooks, our automation platform jumpstarts your SRE RunBook creation. 😎 published by the unSkript community.
-
Updated
May 29, 2024 - Jupyter Notebook
A collection of git utilities, useful extra git scripts, tutorials and other useful articles.
-
Updated
May 28, 2024 - Shell
Curated Self Study Guide for Computer Science, DevOps, SRE & SysAdmin
-
Updated
May 28, 2024 - HTML
A prometheus exporter exposing metrics for KafkaJS
-
Updated
May 29, 2024 - TypeScript
🐒 🔥 Datadog Failure Injection System for Kubernetes
-
Updated
May 28, 2024 - C
Web UI for Jaeger
-
Updated
May 28, 2024 - JavaScript
A curated list of amazingly awesome open-source sysadmin resources.
-
Updated
May 28, 2024
DevOps Tutorials
-
Updated
May 28, 2024 - HCL
- Followers
- 113 followers
- Wikipedia
- Wikipedia