Trending repositories for topic sre
The Open Source DevOps Assistant - solve problems twice as fast with an AI teammate
Linux, Jenkins, AWS, SRE, Prometheus, Docker, Python, Ansible, Git, Kubernetes, Terraform, OpenStack, SQL, NoSQL, Azure, GCP, DNS, Elastic, Network, Virtualization. DevOps Interview Questions
The Open Source DevOps Assistant - solve problems twice as fast with an AI teammate
A curated list of amazingly awesome open-source sysadmin resources.
⭐ 【开源书籍】深入讲解内核网络、Kubernetes、ServiceMesh、容器等云原生相关技术。经历实践检验的 DevOps、SRE指南。如发现错误,谢谢提issue
A curated collection of publicly available resources on how technology and tech-savvy organizations around the world practice Site Reliability Engineering (SRE)
A Frida based tool that traces usage of the JNI API in Android apps.
Kaytu's AI platform boosts cloud efficiency by analyzing historical usage and delivering intelligent recommendations—such as optimizing instance sizes—that maintain reliability. Pay for what you need,...
A checklist of anyone practicing Site Reliability Engineering
Enable Self-Service Operations: Give specific users access to your existing tools, services, and scripts
[Moved to cloudprober/cloudprober] An active monitoring software to detect failures before your customers do.
At LinkedIn, we are using this curriculum for onboarding our entry-level talents into the SRE role.
A curated list of Site Reliability and Production Engineering resources.
The principles that help to deploy safely to the production environment. If you like it:
The Open Source DevOps Assistant - solve problems twice as fast with an AI teammate
The principles that help to deploy safely to the production environment. If you like it:
Kaytu's AI platform boosts cloud efficiency by analyzing historical usage and delivering intelligent recommendations—such as optimizing instance sizes—that maintain reliability. Pay for what you need,...
A Frida based tool that traces usage of the JNI API in Android apps.
Cloud-ops automation runbooks that are ready to use. Build your own automations using the hundreds of drag and drop actions included in the repository. Built on Jupyter Notebooks, our automation plat...
An active monitoring software to detect failures before your customers do.
DevOps/SRE community is for those folks who are trying to learn or explore DevOps with the help of experienced professionals. Opportunities are open to share.
⭐ 【开源书籍】深入讲解内核网络、Kubernetes、ServiceMesh、容器等云原生相关技术。经历实践检验的 DevOps、SRE指南。如发现错误,谢谢提issue
NixOS Guide. Learn all about the immutable Nix Operating System and the declarative Nix Expression Language.
[Moved to cloudprober/cloudprober] An active monitoring software to detect failures before your customers do.
A checklist of anyone practicing Site Reliability Engineering
A curated list of amazingly awesome open-source sysadmin resources.
Linux, Jenkins, AWS, SRE, Prometheus, Docker, Python, Ansible, Git, Kubernetes, Terraform, OpenStack, SQL, NoSQL, Azure, GCP, DNS, Elastic, Network, Virtualization. DevOps Interview Questions
The Open Source DevOps Assistant - solve problems twice as fast with an AI teammate
Linux, Jenkins, AWS, SRE, Prometheus, Docker, Python, Ansible, Git, Kubernetes, Terraform, OpenStack, SQL, NoSQL, Azure, GCP, DNS, Elastic, Network, Virtualization. DevOps Interview Questions
A curated list of amazingly awesome open-source sysadmin resources.
The Open Source DevOps Assistant - solve problems twice as fast with an AI teammate
⭐ 【开源书籍】深入讲解内核网络、Kubernetes、ServiceMesh、容器等云原生相关技术。经历实践检验的 DevOps、SRE指南。如发现错误,谢谢提issue
A curated list of Site Reliability and Production Engineering resources.
Kaytu's AI platform boosts cloud efficiency by analyzing historical usage and delivering intelligent recommendations—such as optimizing instance sizes—that maintain reliability. Pay for what you need,...
A curated collection of publicly available resources on how technology and tech-savvy organizations around the world practice Site Reliability Engineering (SRE)
A Frida based tool that traces usage of the JNI API in Android apps.
A curated list of awesome DevOps platforms, tools, practices and resources
CDN Up and Running - Building a CDN from Scratch to Learn about CDN, Nginx, Lua, Prometheus, Grafana, Load balancing, and Containers.
StackStorm (aka "IFTTT for Ops") is event-driven automation for auto-remediation, incident responses, troubleshooting, deployments, and more for DevOps and SREs. Includes rules engine, workflow, 160 i...
At LinkedIn, we are using this curriculum for onboarding our entry-level talents into the SRE role.
The Open Source DevOps Assistant - solve problems twice as fast with an AI teammate
Kaytu's AI platform boosts cloud efficiency by analyzing historical usage and delivering intelligent recommendations—such as optimizing instance sizes—that maintain reliability. Pay for what you need,...
𝖫𝗂𝗇𝗎𝗑, 𝖩𝖾𝗇𝗄𝗂𝗇𝗌, 𝖠𝖶𝖲, 𝖲𝖱𝖤, 𝖯𝗋𝗈𝗆𝖾𝗍𝗁𝖾𝗎𝗌, 𝖣𝗈𝖼𝗄𝖾𝗋, 𝖯𝗒𝗍𝗁𝗈𝗇, 𝖠𝗇𝗌𝗂𝖻𝗅𝖾, 𝖦𝗂𝗍, 𝖪𝗎𝖻𝖾𝗋𝗇𝖾𝗍𝖾𝗌, 𝖳𝖾𝗋𝗋𝖺𝖿𝗈𝗋𝗆, 𝖮𝗉𝖾𝗇𝖲𝗍𝖺𝖼𝗄, 𝖲𝖰𝖫, 𝖭𝗈𝖲𝖰𝖫, ...
NixOS Guide. Learn all about the immutable Nix Operating System and the declarative Nix Expression Language.
The principles that help to deploy safely to the production environment. If you like it:
A Frida based tool that traces usage of the JNI API in Android apps.
A curated list of awesome DevOps platforms, tools, practices and resources
⭐ 【开源书籍】深入讲解内核网络、Kubernetes、ServiceMesh、容器等云原生相关技术。经历实践检验的 DevOps、SRE指南。如发现错误,谢谢提issue
A curated list of Platform Engineering Tools
A curated list of amazingly awesome open-source sysadmin resources.
Cloud-ops automation runbooks that are ready to use. Build your own automations using the hundreds of drag and drop actions included in the repository. Built on Jupyter Notebooks, our automation plat...
A blazing fast tool for building data pipelines: read, process and output events. Our community: https://t.me/file_d_community
The Open Source DevOps Assistant - solve problems twice as fast with an AI teammate
Linux, Jenkins, AWS, SRE, Prometheus, Docker, Python, Ansible, Git, Kubernetes, Terraform, OpenStack, SQL, NoSQL, Azure, GCP, DNS, Elastic, Network, Virtualization. DevOps Interview Questions
A curated list of amazingly awesome open-source sysadmin resources.
Kaytu's AI platform boosts cloud efficiency by analyzing historical usage and delivering intelligent recommendations—such as optimizing instance sizes—that maintain reliability. Pay for what you need,...
⭐ 【开源书籍】深入讲解内核网络、Kubernetes、ServiceMesh、容器等云原生相关技术。经历实践检验的 DevOps、SRE指南。如发现错误,谢谢提issue
A curated list of Site Reliability and Production Engineering resources.
CDN Up and Running - Building a CDN from Scratch to Learn about CDN, Nginx, Lua, Prometheus, Grafana, Load balancing, and Containers.
A curated collection of publicly available resources on how technology and tech-savvy organizations around the world practice Site Reliability Engineering (SRE)
The Open Source DevOps Assistant - solve problems twice as fast with an AI teammate
A curated list of awesome DevOps platforms, tools, practices and resources
Enable Self-Service Operations: Give specific users access to your existing tools, services, and scripts
A curated list of Platform Engineering Tools
A Frida based tool that traces usage of the JNI API in Android apps.
NixOS Guide. Learn all about the immutable Nix Operating System and the declarative Nix Expression Language.
The Open Source DevOps Assistant - solve problems twice as fast with an AI teammate
Kaytu's AI platform boosts cloud efficiency by analyzing historical usage and delivering intelligent recommendations—such as optimizing instance sizes—that maintain reliability. Pay for what you need,...
Telegram channels & groups about DevOps, SRE, and Platform Engineering.
A Kubernetes controller that modifies the CPU and/or memory resources of containers depending on whether they're starting up, according to the startup/post-startup settings you supply.
A curated list of Platform Engineering Tools
Linux commands and basic concepts you need for performing essential tasks on a server as a DevOps, SRE, or SysAdmin are critical. I'll do my best to explain everything as simple as possible.
𝖫𝗂𝗇𝗎𝗑, 𝖩𝖾𝗇𝗄𝗂𝗇𝗌, 𝖠𝖶𝖲, 𝖲𝖱𝖤, 𝖯𝗋𝗈𝗆𝖾𝗍𝗁𝖾𝗎𝗌, 𝖣𝗈𝖼𝗄𝖾𝗋, 𝖯𝗒𝗍𝗁𝗈𝗇, 𝖠𝗇𝗌𝗂𝖻𝗅𝖾, 𝖦𝗂𝗍, 𝖪𝗎𝖻𝖾𝗋𝗇𝖾𝗍𝖾𝗌, 𝖳𝖾𝗋𝗋𝖺𝖿𝗈𝗋𝗆, 𝖮𝗉𝖾𝗇𝖲𝗍𝖺𝖼𝗄, 𝖲𝖰𝖫, 𝖭𝗈𝖲𝖰𝖫, ...
NixOS Guide. Learn all about the immutable Nix Operating System and the declarative Nix Expression Language.
Highly scalable and available reference architecture for Terragrunt.
Automatic SRE Superpowers within your Kubernetes cluster
A curated list of awesome DevOps platforms, tools, practices and resources
Terraform modules for rapidly building production-grade Kubernetes clusters following SRE practices.
An active monitoring software to detect failures before your customers do.
A Frida based tool that traces usage of the JNI API in Android apps.
A curated list of Site Reliability and Production Engineering Tools
Layerform helps engineers create reusable environment stacks using plain .tf files. Ideal for multiple "staging" environments.
Kaytu's AI platform boosts cloud efficiency by analyzing historical usage and delivering intelligent recommendations—such as optimizing instance sizes—that maintain reliability. Pay for what you need,...
Telegram channels & groups about DevOps, SRE, and Platform Engineering.
The Open Source DevOps Assistant - solve problems twice as fast with an AI teammate
🪱 Kermoo offers resilience testing with Process Delays, Back-end Failures, CPU Simulations, and Memory Leaks. Boost your system reliability effortlessly.
𝖫𝗂𝗇𝗎𝗑, 𝖩𝖾𝗇𝗄𝗂𝗇𝗌, 𝖠𝖶𝖲, 𝖲𝖱𝖤, 𝖯𝗋𝗈𝗆𝖾𝗍𝗁𝖾𝗎𝗌, 𝖣𝗈𝖼𝗄𝖾𝗋, 𝖯𝗒𝗍𝗁𝗈𝗇, 𝖠𝗇𝗌𝗂𝖻𝗅𝖾, 𝖦𝗂𝗍, 𝖪𝗎𝖻𝖾𝗋𝗇𝖾𝗍𝖾𝗌, 𝖳𝖾𝗋𝗋𝖺𝖿𝗈𝗋𝗆, 𝖮𝗉𝖾𝗇𝖲𝗍𝖺𝖼𝗄, 𝖲𝖰𝖫, 𝖭𝗈𝖲𝖰𝖫, ...
Linux commands and basic concepts you need for performing essential tasks on a server as a DevOps, SRE, or SysAdmin are critical. I'll do my best to explain everything as simple as possible.
Highly scalable and available reference architecture for Terragrunt.
A Kubernetes controller that modifies the CPU and/or memory resources of containers depending on whether they're starting up, according to the startup/post-startup settings you supply.
TerraDagger is a Go package for managing your infrastructure-as-code through containers.
Linux, Jenkins, AWS, SRE, Prometheus, Docker, Python, Ansible, Git, Kubernetes, Terraform, OpenStack, SQL, NoSQL, Azure, GCP, DNS, Elastic, Network, Virtualization. DevOps Interview Questions
⭐ 【开源书籍】深入讲解内核网络、Kubernetes、ServiceMesh、容器等云原生相关技术。经历实践检验的 DevOps、SRE指南。如发现错误,谢谢提issue
A curated list of amazingly awesome open-source sysadmin resources.
At LinkedIn, we are using this curriculum for onboarding our entry-level talents into the SRE role.
A curated list of Site Reliability and Production Engineering resources.
Layerform helps engineers create reusable environment stacks using plain .tf files. Ideal for multiple "staging" environments.
A checklist of anyone practicing Site Reliability Engineering
CDN Up and Running - Building a CDN from Scratch to Learn about CDN, Nginx, Lua, Prometheus, Grafana, Load balancing, and Containers.
A curated list of awesome DevOps platforms, tools, practices and resources
Kaytu's AI platform boosts cloud efficiency by analyzing historical usage and delivering intelligent recommendations—such as optimizing instance sizes—that maintain reliability. Pay for what you need,...
A curated collection of publicly available resources on how technology and tech-savvy organizations around the world practice Site Reliability Engineering (SRE)
StackStorm (aka "IFTTT for Ops") is event-driven automation for auto-remediation, incident responses, troubleshooting, deployments, and more for DevOps and SREs. Includes rules engine, workflow, 160 i...
Enable Self-Service Operations: Give specific users access to your existing tools, services, and scripts
𝖫𝗂𝗇𝗎𝗑, 𝖩𝖾𝗇𝗄𝗂𝗇𝗌, 𝖠𝖶𝖲, 𝖲𝖱𝖤, 𝖯𝗋𝗈𝗆𝖾𝗍𝗁𝖾𝗎𝗌, 𝖣𝗈𝖼𝗄𝖾𝗋, 𝖯𝗒𝗍𝗁𝗈𝗇, 𝖠𝗇𝗌𝗂𝖻𝗅𝖾, 𝖦𝗂𝗍, 𝖪𝗎𝖻𝖾𝗋𝗇𝖾𝗍𝖾𝗌, 𝖳𝖾𝗋𝗋𝖺𝖿𝗈𝗋𝗆, 𝖮𝗉𝖾𝗇𝖲𝗍𝖺𝖼𝗄, 𝖲𝖰𝖫, 𝖭𝗈𝖲𝖰𝖫, ...
A Kubernetes controller that modifies the CPU and/or memory resources of containers depending on whether they're starting up, according to the startup/post-startup settings you supply.
TerraDagger is a Go package for managing your infrastructure-as-code through containers.
⭐ 【开源书籍】深入讲解内核网络、Kubernetes、ServiceMesh、容器等云原生相关技术。经历实践检验的 DevOps、SRE指南。如发现错误,谢谢提issue
Telegram channels & groups about DevOps, SRE, and Platform Engineering.
Automatic SRE Superpowers within your Kubernetes cluster
NixOS Guide. Learn all about the immutable Nix Operating System and the declarative Nix Expression Language.
Terraform modules for rapidly building production-grade Kubernetes clusters following SRE practices.
The Open Source DevOps Assistant - solve problems twice as fast with an AI teammate
SLOs, Error windows and alerts are complicated. Here an attempt to make it easy
(Chinese Only)Everything I know: DevOps & CloudNative, Linux, Embedded, Homelab, Music, Blockchain, AI, etc...
A curated list of awesome DevOps platforms, tools, practices and resources
🌳 A sustainable Terraform Package which creates Account & IAM resources on AWS