Technical blogs on topics of Kubernetes, GitOps, CI/CD and SRE in general. Created with ❤️ using Markdown format.
-
Updated
May 29, 2024 - HTML
Site reliability engineering (SRE) is a set of principles and practices that incorporates aspects of software engineering and applies them to infrastructure and operations problems. The main goals are to create scalable and highly reliable software systems. Site reliability engineering is closely related to DevOps, a set of practices that combine software development and IT operations, and SRE has also been described as a specific implementation of DevOps.
Technical blogs on topics of Kubernetes, GitOps, CI/CD and SRE in general. Created with ❤️ using Markdown format.
Web UI for Jaeger
A Chaos Engineering Platform for Kubernetes.
Litmus helps SREs and developers practice chaos engineering in a Cloud-native way. Chaos experiments are published at the ChaosHub (https://hub.litmuschaos.io). Community notes is at https://hackmd.io/a4Zu_sH4TZGeih-xCimi3Q
A chaos engineering platform for supporting the complete fault drill lifecycle.
This project is a proof-of-concept - which is a rewrite of my old college project - to demonstrate my skills as a DevOps Engineer before anything else after earning the Microsoft Certified: DevOps Engineer Expert certification
An easy to use and powerful chaos engineering experiment toolkit.(阿里巴巴开源的一款简单易用、功能强大的混沌实验注入工具)
Devopness - Painless essential DevOps to everyone
A curated collection of publicly available resources on how technology and tech-savvy organizations around the world practice Site Reliability Engineering (SRE)
A curated list of Site Reliability and Production Engineering Tools
[FSE'24] BARO: Robust Root Cause Analysis for Microservices via Multivariate Bayesian Online Change Point Detection
👨💻 blog with github pages | About SRE
Site Reliability Engineering Munich Meetup Page
A repo holding the Kubernetes deployment manifests for otel-collector
A collection of my resources for studying for SWE/SRE interviews!
Site Reliability Engineer Quickstarts! 🦾
A role-playing game for incident management training
A simple blog stack to demonstrate Google's Site Reliability Engineering principles.
Welcome To The World of DevOps. An ongoing & curated collection of awesome software, libraries, learning tutorials, tools and resources and cool stuff about DevOps.
I'm a Professional Mistake Avoider, a.k.a. Strategic Advisor.