Trending repositories for topic site-reliability-engineering
A curated list of Site Reliability and Production Engineering resources.
A curated list of Chaos Engineering resources.
Litmus helps SREs and developers practice chaos engineering in a Cloud-native way. Chaos experiments are published at the ChaosHub (https://hub.litmuschaos.io). Community notes is at https://hackmd....
An easy to use and powerful chaos engineering experiment toolkit.(阿里巴巴开源的一款简单易用、功能强大的混沌实验注入工具)
On-Call Assistant for Prometheus Alerts - Get a head start on fixing alerts with AI investigation
On-Call Assistant for Prometheus Alerts - Get a head start on fixing alerts with AI investigation
A curated list of Chaos Engineering resources.
Litmus helps SREs and developers practice chaos engineering in a Cloud-native way. Chaos experiments are published at the ChaosHub (https://hub.litmuschaos.io). Community notes is at https://hackmd....
A curated list of Site Reliability and Production Engineering resources.
An easy to use and powerful chaos engineering experiment toolkit.(阿里巴巴开源的一款简单易用、功能强大的混沌实验注入工具)
A curated list of Site Reliability and Production Engineering resources.
Litmus helps SREs and developers practice chaos engineering in a Cloud-native way. Chaos experiments are published at the ChaosHub (https://hub.litmuschaos.io). Community notes is at https://hackmd....
On-Call Assistant for Prometheus Alerts - Get a head start on fixing alerts with AI investigation
A curated list of Chaos Engineering resources.
A curated collection of publicly available resources on how technology and tech-savvy organizations around the world practice Site Reliability Engineering (SRE)
An easy to use and powerful chaos engineering experiment toolkit.(阿里巴巴开源的一款简单易用、功能强大的混沌实验注入工具)
A curated list of Site Reliability and Production Engineering Tools
On-Call Assistant for Prometheus Alerts - Get a head start on fixing alerts with AI investigation
A curated list of Site Reliability and Production Engineering Tools
Litmus helps SREs and developers practice chaos engineering in a Cloud-native way. Chaos experiments are published at the ChaosHub (https://hub.litmuschaos.io). Community notes is at https://hackmd....
A curated list of Site Reliability and Production Engineering resources.
A curated list of Chaos Engineering resources.
A curated collection of publicly available resources on how technology and tech-savvy organizations around the world practice Site Reliability Engineering (SRE)
An easy to use and powerful chaos engineering experiment toolkit.(阿里巴巴开源的一款简单易用、功能强大的混沌实验注入工具)
A curated list of Site Reliability and Production Engineering resources.
On-Call Assistant for Prometheus Alerts - Get a head start on fixing alerts with AI investigation
Litmus helps SREs and developers practice chaos engineering in a Cloud-native way. Chaos experiments are published at the ChaosHub (https://hub.litmuschaos.io). Community notes is at https://hackmd....
A curated collection of publicly available resources on how technology and tech-savvy organizations around the world practice Site Reliability Engineering (SRE)
An easy to use and powerful chaos engineering experiment toolkit.(阿里巴巴开源的一款简单易用、功能强大的混沌实验注入工具)
A curated list of Chaos Engineering resources.
A curated list of Site Reliability and Production Engineering Tools
Open-source AI copilot that lets you chat with your observability data and code 🧙♂️
Chaos testing, network emulation, and stress testing tool for containers
This repository includes resources which are more than sufficient to prepare for google interview if you are applying for a software engineer position or a site reliability engineer position
On-Call Assistant for Prometheus Alerts - Get a head start on fixing alerts with AI investigation
Open-source AI copilot that lets you chat with your observability data and code 🧙♂️
A curated list of Site Reliability and Production Engineering Tools
Litmus helps SREs and developers practice chaos engineering in a Cloud-native way. Chaos experiments are published at the ChaosHub (https://hub.litmuschaos.io). Community notes is at https://hackmd....
A curated list of Site Reliability and Production Engineering resources.
An easy to use and powerful chaos engineering experiment toolkit.(阿里巴巴开源的一款简单易用、功能强大的混沌实验注入工具)
A curated list of Chaos Engineering resources.
This repository includes resources which are more than sufficient to prepare for google interview if you are applying for a software engineer position or a site reliability engineer position
A curated collection of publicly available resources on how technology and tech-savvy organizations around the world practice Site Reliability Engineering (SRE)
Chaos testing, network emulation, and stress testing tool for containers
On-Call Assistant for Prometheus Alerts - Get a head start on fixing alerts with AI investigation
Open-source AI copilot that lets you chat with your observability data and code 🧙♂️
[FSE'24 - 🏆 Best Artifact Award] BARO: Robust Root Cause Analysis for Microservice Systems.
A curated list of Site Reliability and Production Engineering resources.
On-Call Assistant for Prometheus Alerts - Get a head start on fixing alerts with AI investigation
Litmus helps SREs and developers practice chaos engineering in a Cloud-native way. Chaos experiments are published at the ChaosHub (https://hub.litmuschaos.io). Community notes is at https://hackmd....
A curated collection of publicly available resources on how technology and tech-savvy organizations around the world practice Site Reliability Engineering (SRE)
An easy to use and powerful chaos engineering experiment toolkit.(阿里巴巴开源的一款简单易用、功能强大的混沌实验注入工具)
A curated list of Chaos Engineering resources.
Open-source AI copilot that lets you chat with your observability data and code 🧙♂️
A curated list of Site Reliability and Production Engineering Tools
Chaos testing, network emulation, and stress testing tool for containers
A chaos engineering platform for supporting the complete fault drill lifecycle.
This repository includes resources which are more than sufficient to prepare for google interview if you are applying for a software engineer position or a site reliability engineer position
[FSE'24 - 🏆 Best Artifact Award] BARO: Robust Root Cause Analysis for Microservice Systems.
OpenShift Guide. Learn about the Red Hat OpenShift Container Platform, Data Science, Code Ready Containers, Podman, Buildah, and Kubernetes.
On-Call Assistant for Prometheus Alerts - Get a head start on fixing alerts with AI investigation
A chaos engineering platform for supporting the complete fault drill lifecycle.
A curated list of Site Reliability and Production Engineering Tools
Welcome To The World of DevOps. An ongoing & curated collection of awesome software, libraries, learning tutorials, tools and resources and cool stuff about DevOps.
OpenShift Guide. Learn about the Red Hat OpenShift Container Platform, Data Science, Code Ready Containers, Podman, Buildah, and Kubernetes.
Litmus helps SREs and developers practice chaos engineering in a Cloud-native way. Chaos experiments are published at the ChaosHub (https://hub.litmuschaos.io). Community notes is at https://hackmd....
This repository helps performance testers and engineers who wants to dive into DevOps and SRE world.
This repository includes resources which are more than sufficient to prepare for google interview if you are applying for a software engineer position or a site reliability engineer position
A curated list of Site Reliability and Production Engineering resources.
An easy to use and powerful chaos engineering experiment toolkit.(阿里巴巴开源的一款简单易用、功能强大的混沌实验注入工具)
Chaos testing, network emulation, and stress testing tool for containers
A curated list of Chaos Engineering resources.
A curated collection of publicly available resources on how technology and tech-savvy organizations around the world practice Site Reliability Engineering (SRE)
Calculate how much downtime should be permitted in your Service Level Agreement or Objective