54 results found Sort:

8.1k
22.6k
apache-2.0
777
A powerful flow control component enabling reliability, resilience and monitoring for microservices. (面向云原生微服务的高可用流控防护组件)
Created 2018-04-04
831 commits to 1.8 branch, last one 7 months ago
1.6k
12.3k
cc0-1.0
503
A curated list of Site Reliability and Production Engineering resources.
Created 2016-04-12
639 commits to master branch, last one 2 years ago
803
9.3k
cc0-1.0
233
A curated collection of publicly available resources on how technology and tech-savvy organizations around the world practice Site Reliability Engineering (SRE)
Created 2021-02-14
381 commits to main branch, last one about a month ago
671
7.2k
mit
54
The most reliable AI agent framework that supports MCP.
Created 2024-05-26
1,174 commits to master branch, last one 3 days ago
Compilation of public failure/horror stories related to Kubernetes
This repository has been archived (exclude archived)
Created 2019-01-19
92 commits to master branch, last one 4 years ago
393
2.0k
gpl-3.0
45
It's just fascinating. How is modern software designed? 🤔 Some design-level considerations for scalability, maintainability eventual consistency, availability & reliability. 👨‍💻 Interview Prep. 👨‍...
Created 2020-03-20
683 commits to master branch, last one about a year ago
1.1k
2.0k
apache-2.0
113
Hands on labs and code to help you learn, measure, and build using architectural best practices.
Created 2018-05-11
3,198 commits to main branch, last one 5 days ago
191
1.9k
apache-2.0
44
Chaos Engineering Toolkit & Orchestration for Developers
Created 2017-09-24
445 commits to master branch, last one 11 months ago
A free book about developing secure and robust systems software.
Created 2022-03-25
187 commits to main branch, last one about a year ago
13
1.1k
mit
6
Production-grade retries for Python
Created 2022-09-30
323 commits to main branch, last one 19 days ago
Sample implementations for cloud design patterns found in the Azure Architecture Center.
Created 2016-02-05
413 commits to main branch, last one 6 days ago
Awesome-LLM-Robustness: a curated list of Uncertainty, Reliability and Robustness in Large Language Models
Created 2023-03-20
156 commits to main branch, last one about a month ago
66
553
apache-2.0
10
A hosted disposable email telegram bot; Extremely privacy friendly; Proudly hosted for community.
Created 2020-05-07
172 commits to master branch, last one 2 years ago
30
394
mit
2.2k
An always-on framework that performs end-to-end functional network testing for reachability, latency, and packet loss
Created 2016-11-16
28 commits to master branch, last one 5 years ago
78
381
apache-2.0
14
An open source Valkey client library that supports Valkey, and Redis open source 6.2, 7.0 and 7.2. Valkey GLIDE is designed for reliability, optimized performance, and high-availability, for Valkey an...
Created 2022-07-06
3,132 commits to main branch, last one a day ago
37
366
other
4
An Open-Source Collection of 230+ Flash Cards to Help You Succeed in Your System Design Interview and More 💯
Created 2022-05-08
26 commits to master branch, last one 5 months ago
106
309
apache-2.0
16
Chaos and resiliency testing tool for Kubernetes with a focus on improving performance under failure conditions. A CNCF sandbox project.
Created 2020-04-19
561 commits to main branch, last one 6 days ago
5
308
mpl-2.0
7
WIP: Next-gen network protocol for reliable data transfer in lossy environments. Outperforms TCP/UDP in high packet loss scenarios.
Created 2024-09-16
9 commits to main branch, last one 6 months ago
Easily run integration tests for your backends
Created 2023-02-16
157 commits to main branch, last one about a year ago
95
255
other
19
Uncertainty treatment library
Created 2015-08-14
6,905 commits to master branch, last one 7 days ago
📚 🐣 软件实践文集。主题不限,思考讨论有趣有料就好,包含如 系统的模型分析/量化分析、开源漫游者指南、软件可靠性设计实践、平台产品的逻辑与执行… 🥤
Created 2014-12-16
98 commits to master branch, last one 2 years ago
🛡️ A module for improving the reliability and fault-tolerance of your NestJS applications
Created 2023-04-15
855 commits to master branch, last one 3 days ago
57
212
other
32
PHP HI-REL SOCKET TCP/UDP/ICMP/Serial .高可靠性PHP通信&控制框架SOCKET-TCP/UDP/ICMP/硬件Serial-RS232/RS422/RS485 AND MORE!
Created 2016-09-06
121 commits to master branch, last one 2 years ago
Notes on Site Reliability Engineering. Leave a 🌟 if you found this useful!
Created 2019-10-09
19 commits to master branch, last one 5 years ago
71
189
unknown
21
A component-based OS
Created 2012-01-09
3,162 commits to main branch, last one 6 months ago
A role-playing game for incident management training
Created 2018-06-02
59 commits to master branch, last one 3 years ago
[ICLR 2025] xFinder: Large Language Models as Automated Evaluators for Reliable Evaluation
Created 2024-05-19
41 commits to main branch, last one about a month ago
29
154
apache-2.0
13
a general library for fatigue and reliability
Created 2019-11-21
1,408 commits to develop branch, last one 6 days ago
Fast computation of Krippendorff's alpha agreement measure in Python.
Created 2017-09-28
135 commits to main branch, last one 2 months ago