DevOps&SRE Library
18.3K subscribers
456 photos
5 videos
2 files
4.93K links
Библиотека статей по теме DevOps и SRE.

Реклама: @ostinostin
Контент: @mxssl

РКН: https://www.gosuslugi.ru/snet/67704b536aa9672b963777b3
Download Telegram
SLOs and why you should care

Ever wondered what all the fuss over Service Level Objectives (SLOs) is about? Let’s find out.

https://engineering.solarisbank.com/slos-and-why-you-should-care-136f80bf686e
kubermetrics

Kubermetrics is an open-source dev tool that provides Kubernetes cluster monitoring as well as data visualization in a simple and easy to understand user interface. Kubermetrics intergrates both the Prometheus and Grafana Dashboards on one page! Allowing for custominzable dashboards and alerts.

https://github.com/oslabs-beta/kubermetrics
What is expected in the SRE role? We analyzed 30 job postings to find out

https://spike.sh/blog/sre-role-2021-analysed-30-job-postings
Making Kubernetes Operations Easy with kubectl Plugins

https://martinheinz.dev/blog/58
peirates

Peirates, a Kubernetes penetration tool, enables an attacker to escalate privilege and pivot through a Kubernetes cluster. It automates known techniques to steal and collect service accounts, obtain further code execution, and gain control of the cluster.

https://github.com/inguardians/peirates
youki

youki is an implementation of the OCI runtime-spec in Rust, similar to runc.

https://github.com/containers/youki
Reverse Proxy, HTTP Keep-Alive Timeout, and sporadic HTTP 502s

https://iximiuz.com/en/posts/reverse-proxy-http-keep-alive-and-502s
automated-cloud-advisor

Automated Cloud Advisor is an extensible tool that aims at facilitating cost optimization in AWS, by collecting data for resources that are under utilized. In addition, this is a great learning tool for new DevOps/Cloud engineers that want to start automating things in AWS.

https://github.com/disneystreaming/automated-cloud-advisor
The Speed of Time

How long does it take to read the time? How would you time time? These strange questions came to the fore back in 2014 when Netflix was switching services from CentOS Linux to Ubuntu, and I helped debug several weird performance issues including one I'll describe here.

https://www.brendangregg.com/blog/2021-09-26/the-speed-of-time.html
pgmetrics

pgmetrics is an open-source, zero-dependency, single-binary tool that can collect 350+ metrics from a running PostgreSQL server and display it in easy-to-read text format or export it as JSON and CSV for scripting.

https://pgmetrics.io
10 Trends in real-world container use

Updated October 2021.

1. Nearly 90 percent of Kubernetes users leverage cloud-managed services
2. Amazon ECS users are shifting to AWS Fargate
3. The average number of pods per organization has doubled
4. Host density is 3 times higher on Kubernetes than on Amazon ECS
5. Pod auto-scaling is becoming more popular
6. Organizations are deploying more stateful workloads on containers
7. Organizations running container environments create more monitors
8. Organizations are starting to replace Docker with containerd as their preferred runtime for Kubernetes
9. OpenShift adoption is growing rapidly
10. NGINX, Redis, and Postgres are the top three container images

https://www.datadoghq.com/container-report
Deploy without credentials with GitHub Actions and OIDC

https://blog.alexellis.io/deploy-without-credentials-using-oidc-and-github-actions
CloudGraph

An instant GraphQL API to query your cloud infrastructure and configuration so that you can solve a host of complex security, compliance, and governance challenges 10x faster.

https://github.com/cloudgraphdev/cli