Amazon Devops Role Experience
I recently applied for Devops role in Amazon. I'm not looking for switch but I'm targeting MAANG in coming years so I applied for this position to get at least experience of hiring process and surprisingly my resume got shortlisted and I received an assessment link.
There were user experience, work style and Devops related questions. I did good only in the last section but fortunately I received call from HR after 4 days from assessment. 🤞
She took all the basic details and asked me how good I'm at coding. I showed my stupidity here by being brutally honest. I replied that " I am mostly working on kubernetes and AI ML part in my company so In coding I would rate myself 6/10 "
And here we go.... Instant Regret ! 🥹
I never heard back from HR.
But now that my urge for these companies has already increased, I want to give another shot after few months.
I'm sharing this experience just to know how I can prepare myself and what skills I should develop
to stand out from crowd of experienced people. ✨
Happy Learning !!!
https://redd.it/1lnez95
@r_devops
I recently applied for Devops role in Amazon. I'm not looking for switch but I'm targeting MAANG in coming years so I applied for this position to get at least experience of hiring process and surprisingly my resume got shortlisted and I received an assessment link.
There were user experience, work style and Devops related questions. I did good only in the last section but fortunately I received call from HR after 4 days from assessment. 🤞
She took all the basic details and asked me how good I'm at coding. I showed my stupidity here by being brutally honest. I replied that " I am mostly working on kubernetes and AI ML part in my company so In coding I would rate myself 6/10 "
And here we go.... Instant Regret ! 🥹
I never heard back from HR.
But now that my urge for these companies has already increased, I want to give another shot after few months.
I'm sharing this experience just to know how I can prepare myself and what skills I should develop
to stand out from crowd of experienced people. ✨
Happy Learning !!!
https://redd.it/1lnez95
@r_devops
Reddit
From the devops community on Reddit
Explore this post and more from the devops community
How do you handle trusted software delivery at a global scale?
Hey 👋
Right now I’m working on something pretty exciting (and a bit nerve-wracking, not gonna lie):
We have a global customer base, teams spread across Australia, the US, and Europe, and I need to build an infrastructure that ensures they can quickly and securely fetch container images from a registry that’s geographically close to them.
But speed isn’t enough.
I also need to guarantee that what they pull is exactly what I built, no tampering, no surprises, just trust.
So this isn’t just about performance, but it’s about authenticity and integrity.
When a customer deploys my software, I want them to know:
1. It came from us
2. It hasn’t been touched
3. It’s the version they expected
Still brainstorming the best way to approach this (edge replication? verified signatures? something more elegant?), but would love to hear how others tackled similar challenges.
How do you handle trusted software delivery at a global scale?
https://redd.it/1ln9tqb
@r_devops
Hey 👋
Right now I’m working on something pretty exciting (and a bit nerve-wracking, not gonna lie):
We have a global customer base, teams spread across Australia, the US, and Europe, and I need to build an infrastructure that ensures they can quickly and securely fetch container images from a registry that’s geographically close to them.
But speed isn’t enough.
I also need to guarantee that what they pull is exactly what I built, no tampering, no surprises, just trust.
So this isn’t just about performance, but it’s about authenticity and integrity.
When a customer deploys my software, I want them to know:
1. It came from us
2. It hasn’t been touched
3. It’s the version they expected
Still brainstorming the best way to approach this (edge replication? verified signatures? something more elegant?), but would love to hear how others tackled similar challenges.
How do you handle trusted software delivery at a global scale?
https://redd.it/1ln9tqb
@r_devops
Reddit
From the devops community on Reddit
Explore this post and more from the devops community
>8YoE, majority of which at AWS Infra
So here's the thing. I quit from AWS after being abused at work. They keep contacting me to apply at their job postings. Of course, that's never going to happen.
I'm looking at the job market and almost all the postings are for seniors. I match most of the 5+ years of experience, though, I don't match on experience with AWS per se (I worked on internal infrastructure in AWS not on the cloud side - not to say I didn't use S3, DynamoDB, IAM, Cloudformation, SNS/SQS).
I'm at the moment working on DSA after having learned a bit of Kubernetes, Terraform, Docker and OpenAPI3.
Planning to start system design on educative.io this week after wrapping up DSA (arrays, linked lists, sorting). Leaving out BFS, DFS, BST, hash maps, DP - is this a good idea?
I'll get more AWS hands on experience with the labs I'll be doing with educative.io
What do you folks recommend since I don't have experience with Kubernetes/EKS in production and, similarly, using the other tools such as Terraform, Jenkins, Ansible, GitHub Actions and Docker in production?
I'm aiming for a job after 4 years and a half of being unemployed.
https://redd.it/1lni4ug
@r_devops
So here's the thing. I quit from AWS after being abused at work. They keep contacting me to apply at their job postings. Of course, that's never going to happen.
I'm looking at the job market and almost all the postings are for seniors. I match most of the 5+ years of experience, though, I don't match on experience with AWS per se (I worked on internal infrastructure in AWS not on the cloud side - not to say I didn't use S3, DynamoDB, IAM, Cloudformation, SNS/SQS).
I'm at the moment working on DSA after having learned a bit of Kubernetes, Terraform, Docker and OpenAPI3.
Planning to start system design on educative.io this week after wrapping up DSA (arrays, linked lists, sorting). Leaving out BFS, DFS, BST, hash maps, DP - is this a good idea?
I'll get more AWS hands on experience with the labs I'll be doing with educative.io
What do you folks recommend since I don't have experience with Kubernetes/EKS in production and, similarly, using the other tools such as Terraform, Jenkins, Ansible, GitHub Actions and Docker in production?
I'm aiming for a job after 4 years and a half of being unemployed.
https://redd.it/1lni4ug
@r_devops
Reddit
From the devops community on Reddit
Explore this post and more from the devops community
Has platform engineering quietly become the “new backend”?
Lately I’ve noticed more companies shifting engineering responsibilities toward platform teams — managing infra, CI/CD, observability, even spinning up internal dev tools and platforms-as-a-product.
Meanwhile, traditional backend roles seem to be getting squeezed between frontend-heavy full-stack positions and infrastructure-heavy platform roles.
Is this just me, or are platform teams slowly absorbing more of what used to be backend territory?
Curious if others are seeing the same trend — and how backend devs or SREs are adapting.
https://redd.it/1lnjsxs
@r_devops
Lately I’ve noticed more companies shifting engineering responsibilities toward platform teams — managing infra, CI/CD, observability, even spinning up internal dev tools and platforms-as-a-product.
Meanwhile, traditional backend roles seem to be getting squeezed between frontend-heavy full-stack positions and infrastructure-heavy platform roles.
Is this just me, or are platform teams slowly absorbing more of what used to be backend territory?
Curious if others are seeing the same trend — and how backend devs or SREs are adapting.
https://redd.it/1lnjsxs
@r_devops
Reddit
From the devops community on Reddit
Explore this post and more from the devops community
The company I work for has made an internal custom Jenkins
Ok, here’s the thing, I work for an IT consultancy here in Spain, and some of the executives had the idea to create a custom Jenkins setup where agents are installed on isolated client nodes (they only have outbound access to a Jenkins job endpoint).
The catch is that the agents send system info or info related to isolated apps to a Jenkins job URL, and Jenkins then tells them to run certain scripts based on rules and input data (for example, if an email with a specific subject arrives and a user is logged in, don’t kick them out).
The thing is, they don’t want to go public with this but I keep telling my boss it’s a great Jenkins mod.
Is this due to corporate strategy? Or just plain ignorance?
https://redd.it/1lnk2xz
@r_devops
Ok, here’s the thing, I work for an IT consultancy here in Spain, and some of the executives had the idea to create a custom Jenkins setup where agents are installed on isolated client nodes (they only have outbound access to a Jenkins job endpoint).
The catch is that the agents send system info or info related to isolated apps to a Jenkins job URL, and Jenkins then tells them to run certain scripts based on rules and input data (for example, if an email with a specific subject arrives and a user is logged in, don’t kick them out).
The thing is, they don’t want to go public with this but I keep telling my boss it’s a great Jenkins mod.
Is this due to corporate strategy? Or just plain ignorance?
https://redd.it/1lnk2xz
@r_devops
Reddit
From the devops community on Reddit
Explore this post and more from the devops community
Just graduated – Need project ideas for my resume
Hey! I just finished my engineering degree and I’m looking to build 1–2 solid projects to help land my first job.
I’m thinking of starting with a Website Uptime Monitor. Do you think it’s a good idea for showcasing skills? Any other project suggestions that would stand out to employers?
Thanks!
https://redd.it/1lnkzav
@r_devops
Hey! I just finished my engineering degree and I’m looking to build 1–2 solid projects to help land my first job.
I’m thinking of starting with a Website Uptime Monitor. Do you think it’s a good idea for showcasing skills? Any other project suggestions that would stand out to employers?
Thanks!
https://redd.it/1lnkzav
@r_devops
Reddit
From the devops community on Reddit
Explore this post and more from the devops community
App Support
Hello, i am building a new app, i am a product person and i have a software engineering supporting me. He is mostly familiar with AWS but i am open to any Cloud based platform. Could you please suggest a good stack for an app to be scalable but not massively costly at first ( being a start up) ideally on AWS or any other Cloud provider. Thanks
https://redd.it/1lnoaf4
@r_devops
Hello, i am building a new app, i am a product person and i have a software engineering supporting me. He is mostly familiar with AWS but i am open to any Cloud based platform. Could you please suggest a good stack for an app to be scalable but not massively costly at first ( being a start up) ideally on AWS or any other Cloud provider. Thanks
https://redd.it/1lnoaf4
@r_devops
Reddit
From the devops community on Reddit
Explore this post and more from the devops community
Doing labs locally or AWS ?
Hi all,
I'm working on my skills on devops, doing git, CI/CD, ansible etc
Do you use AWS or doing it locally on a local VM ?
https://redd.it/1lnptvi
@r_devops
Hi all,
I'm working on my skills on devops, doing git, CI/CD, ansible etc
Do you use AWS or doing it locally on a local VM ?
https://redd.it/1lnptvi
@r_devops
Reddit
From the devops community on Reddit
Explore this post and more from the devops community
PSA: Crossplane API version migrations can completely brick your cluster (and how I survived it)
Just spent 4 hours recovering from what started as an "innocent" Lambda Permission commit. Thought this might save someone else's Thursday.
What happened: Someone committed a Crossplane resource using `lambda.aws.upbound.io/v1beta1`, but our cluster expected
The death spiral:
Error: conversion webhook failed: cannot convert from spoke version "v1beta1" to hub version "v1beta2":
value at field path loggingConfig must be any, not "mapstringinterface {}"
This error completely locked us out of ALL Lambda Function resources:
`kubectl get functions` → webhook error
Raw API calls → still blocked
ArgoCD stuck in permanent Unknown state
Standard troubleshooting that DIDN'T work:
Disabling validating webhooks
Hard refresh ArgoCD
Patching resources directly
Restarting provider pods
What finally worked (nuclear option):
bash
# Delete the entire CRD - this removes ALL lambda functions
kubectl delete crd functions.lambda.aws.upbound.io --force --grace-period=0
# Wait for Crossplane to recreate the CRD
kubectl get pods -n crossplane-system
# Update your manifests to v1beta2 and fix loggingConfig format:
# OLD: loggingConfig: { applicationLogLevel: INFO }
# NEW: loggingConfig: { applicationLogLevel: INFO }
# Then sync everything back
Key lesson: When Crossplane conversion webhooks fail, they can create a catch-22 where you can't access resources to fix them, but you can't fix them without accessing them. Sometimes nuking the CRD is the only way out.
Anyone else hit this webhook deadlock? What was your escape route?
Edit: For the full play-by-play of this disaster, I wrote it up here if you're into technical war stories.
https://redd.it/1lnor51
@r_devops
Just spent 4 hours recovering from what started as an "innocent" Lambda Permission commit. Thought this might save someone else's Thursday.
What happened: Someone committed a Crossplane resource using `lambda.aws.upbound.io/v1beta1`, but our cluster expected
v1beta2. The conversion webhook failed because the loggingConfig field format changed from a map to an array between versions.The death spiral:
Error: conversion webhook failed: cannot convert from spoke version "v1beta1" to hub version "v1beta2":
value at field path loggingConfig must be any, not "mapstringinterface {}"
This error completely locked us out of ALL Lambda Function resources:
`kubectl get functions` → webhook error
kubectl delete functions → webhook errorRaw API calls → still blocked
ArgoCD stuck in permanent Unknown state
Standard troubleshooting that DIDN'T work:
Disabling validating webhooks
Hard refresh ArgoCD
Patching resources directly
Restarting provider pods
What finally worked (nuclear option):
bash
# Delete the entire CRD - this removes ALL lambda functions
kubectl delete crd functions.lambda.aws.upbound.io --force --grace-period=0
# Wait for Crossplane to recreate the CRD
kubectl get pods -n crossplane-system
# Update your manifests to v1beta2 and fix loggingConfig format:
# OLD: loggingConfig: { applicationLogLevel: INFO }
# NEW: loggingConfig: { applicationLogLevel: INFO }
# Then sync everything back
Key lesson: When Crossplane conversion webhooks fail, they can create a catch-22 where you can't access resources to fix them, but you can't fix them without accessing them. Sometimes nuking the CRD is the only way out.
Anyone else hit this webhook deadlock? What was your escape route?
Edit: For the full play-by-play of this disaster, I wrote it up here if you're into technical war stories.
https://redd.it/1lnor51
@r_devops
Medium
How I Survived a Crossplane Conversion Webhook Apocalypse (And You Can Too)
Or: When one innocent Lambda Permission nearly killed our entire Kubernetes cluster
Can you cut observability bill by 50% with an eBPF-first stack?
Datadog costs. **A lot.**
Companies are paying more for telemetry than some production workloads. I’ve been researching how SaaS teams are quietly cutting 30–70% of their observability costs by replacing per-host agents with kernel-native tooling.
Companies like [EX.CO](https://EX.CO) and open-source adopters using [SigNoz ](https://signoz.io/)are moving away from Datadog + CloudWatch and adopting **eBPF-first architectures** that are leaner, faster, and significantly cheaper.
# Stack shift
**Replace:**
• Datadog APM
• CloudWatch Logs
• CloudWatch Metrics
**With:**
• Cilium + Hubble (network flows)
• Pixie + Parca (profiling/traces)
• ClickHouse or Iceberg (raw storage)
**Result:**
• Zero sidecars
• < 1% CPU overhead
• Usage-based pipelines instead of per-host licenses
# Key takeaways
* eBPF probes run once per node → < 1 % CPU, zero sidecars
* Usage-based pipelines (ClickHouse / Iceberg) beat per-host licences
* Removing duplicate log streams saved another 40 % ingest
# 6-week roadmap & KPIs
1. **Deploy Cilium/Hubble** in a non-prod cluster; export to ClickHouse or S3. *Target: < 1 % node overhead*
2. **Enable eBPF profiling** (Pixie/Parca); compare to language agents. *Target: span parity*
3. **Shadow live traffic**; validate SLOs. *Target: < 2 % trace drop*
4. **Disable Datadog log ingest** for eBPF-covered namespaces. *Target: GB/day ↓ 40 %*
5. **Remove per-pod agents**; right-size node groups. *Target: CPU-hrs ↓*
6. **Pipe trimmed streams** to Iceberg / Redshift streaming for long-term ML/BI. *Target: $/GB storage ↓ 80 %*
https://redd.it/1lnrr6i
@r_devops
Datadog costs. **A lot.**
Companies are paying more for telemetry than some production workloads. I’ve been researching how SaaS teams are quietly cutting 30–70% of their observability costs by replacing per-host agents with kernel-native tooling.
Companies like [EX.CO](https://EX.CO) and open-source adopters using [SigNoz ](https://signoz.io/)are moving away from Datadog + CloudWatch and adopting **eBPF-first architectures** that are leaner, faster, and significantly cheaper.
# Stack shift
**Replace:**
• Datadog APM
• CloudWatch Logs
• CloudWatch Metrics
**With:**
• Cilium + Hubble (network flows)
• Pixie + Parca (profiling/traces)
• ClickHouse or Iceberg (raw storage)
**Result:**
• Zero sidecars
• < 1% CPU overhead
• Usage-based pipelines instead of per-host licenses
# Key takeaways
* eBPF probes run once per node → < 1 % CPU, zero sidecars
* Usage-based pipelines (ClickHouse / Iceberg) beat per-host licences
* Removing duplicate log streams saved another 40 % ingest
# 6-week roadmap & KPIs
1. **Deploy Cilium/Hubble** in a non-prod cluster; export to ClickHouse or S3. *Target: < 1 % node overhead*
2. **Enable eBPF profiling** (Pixie/Parca); compare to language agents. *Target: span parity*
3. **Shadow live traffic**; validate SLOs. *Target: < 2 % trace drop*
4. **Disable Datadog log ingest** for eBPF-covered namespaces. *Target: GB/day ↓ 40 %*
5. **Remove per-pod agents**; right-size node groups. *Target: CPU-hrs ↓*
6. **Pipe trimmed streams** to Iceberg / Redshift streaming for long-term ML/BI. *Target: $/GB storage ↓ 80 %*
https://redd.it/1lnrr6i
@r_devops
EX.CO - the machine-learning video platform
EX.CO is the smarter, machine-learning video technology that maximizes revenue for media companies across web, apps, CTV, and DOOH.
what else?
RHCSA+K8s+AWS cloud practitioner & sysops+azure Az-900+terraform+ansible+git+docker.
what should i do next im still a fresh graduate looking for a job, any advices , what about remotely ?
https://redd.it/1lnu65o
@r_devops
RHCSA+K8s+AWS cloud practitioner & sysops+azure Az-900+terraform+ansible+git+docker.
what should i do next im still a fresh graduate looking for a job, any advices , what about remotely ?
https://redd.it/1lnu65o
@r_devops
Reddit
From the devops community on Reddit
Explore this post and more from the devops community
Octopus Deploy Reviews... What's your feedback?
I'm curious about Octopus Deploy in practical DevOps settings... It seems to have great ratings especially for integration and support. While it gets praise for customizable steps and its UI, I’ve seen mentions of permissions headaches. If you've used it, what do you think: love it or hate it? How does it handle complex scaling? Any quirks I should know about? And with all the options out there, is it still worth using in 2025? Looking forward to this communities takes. I've gotten a ton of value as a lurker. Thanks in advance...
https://redd.it/1lnu62b
@r_devops
I'm curious about Octopus Deploy in practical DevOps settings... It seems to have great ratings especially for integration and support. While it gets praise for customizable steps and its UI, I’ve seen mentions of permissions headaches. If you've used it, what do you think: love it or hate it? How does it handle complex scaling? Any quirks I should know about? And with all the options out there, is it still worth using in 2025? Looking forward to this communities takes. I've gotten a ton of value as a lurker. Thanks in advance...
https://redd.it/1lnu62b
@r_devops
Reddit
From the devops community on Reddit
Explore this post and more from the devops community
Ansible vs Terraform for idempotency?
This post assumes all of us are familiar with these two tools for infrastructure provisioning and configuration. This has been bugging me for a while. The shop I’m at is in hybrid cloud setup and I’ve been using both of these tools and finding out how terraform is becoming redundant slowly. Both of the tools are sold for their idempotency for provisioning and configuration.
Terraform handles idempotency using statefiles with a persistent data store.
Ansible handles idempotency with “gathering facts” in memory and avoid any drift.
Pardon my ignorance as this might have been ask in another angle in this sub. But why would I choose terraform over ansible for infrastructure provisioning at this point with the hassle of handling persistent statefiles when I can just do a dry run of ansible to see the state of my infrastructure all handled in memory?
https://redd.it/1lnx00o
@r_devops
This post assumes all of us are familiar with these two tools for infrastructure provisioning and configuration. This has been bugging me for a while. The shop I’m at is in hybrid cloud setup and I’ve been using both of these tools and finding out how terraform is becoming redundant slowly. Both of the tools are sold for their idempotency for provisioning and configuration.
Terraform handles idempotency using statefiles with a persistent data store.
Ansible handles idempotency with “gathering facts” in memory and avoid any drift.
Pardon my ignorance as this might have been ask in another angle in this sub. But why would I choose terraform over ansible for infrastructure provisioning at this point with the hassle of handling persistent statefiles when I can just do a dry run of ansible to see the state of my infrastructure all handled in memory?
https://redd.it/1lnx00o
@r_devops
Reddit
From the devops community on Reddit
Explore this post and more from the devops community
Cloud SIEM
Irrespective of the costs associated with the tools, why would you choose any other Cloud SIEM tool over Datadog's Cloud SIEM?
https://redd.it/1lnyuy8
@r_devops
Irrespective of the costs associated with the tools, why would you choose any other Cloud SIEM tool over Datadog's Cloud SIEM?
https://redd.it/1lnyuy8
@r_devops
Reddit
From the devops community on Reddit
Explore this post and more from the devops community
Best Practices for Prompt Testing — learned from companies like Anthropic and OpenAI
Hey everyone! 👋
After months of research and talking to AI teams at top companies, we've compiled everything we've learned about building robust testing frameworks for LLM applications into one comprehensive guide.
What's covered:
🔬LLM-as-a-Judge evaluation - How to scale quality assessment beyond manual review (with detailed implementation strategies)
📈 Statistical significance testing - Proper hypothesis testing for prompt comparisons (because gut feelings don't cut it in production)
🎯 Comprehensive test set design - Coverage strategies that actually catch edge cases before users do
⚡ Advanced techniques - Adversarial testing, performance testing, and production monitoring
Key insights from the research:
• Systematic prompt evaluation can improve model performance by 40-60%
• Failure rates can be reduced by up to 80% with proper testing
• Most teams are still winging it with manual spot-checks (don't be most teams)
Why this matters: As LLMs move from demos to production systems handling real user traffic, the "move fast and break things" approach becomes... problematic. The companies that are winning are the ones treating prompt engineering like actual engineering.
The guide includes real implementation examples, statistical analysis methods, and a practical roadmap for getting started (even if you're currently doing zero testing).
Link: https://usebanyan.com/news/prompt-testing-best-practices
Would love to hear about your experiences with prompt testing - what's worked, what hasn't, and what challenges you're facing. Always looking to learn from the community!
— The Banyan Team 🌳
https://redd.it/1lo25df
@r_devops
Hey everyone! 👋
After months of research and talking to AI teams at top companies, we've compiled everything we've learned about building robust testing frameworks for LLM applications into one comprehensive guide.
What's covered:
🔬LLM-as-a-Judge evaluation - How to scale quality assessment beyond manual review (with detailed implementation strategies)
📈 Statistical significance testing - Proper hypothesis testing for prompt comparisons (because gut feelings don't cut it in production)
🎯 Comprehensive test set design - Coverage strategies that actually catch edge cases before users do
⚡ Advanced techniques - Adversarial testing, performance testing, and production monitoring
Key insights from the research:
• Systematic prompt evaluation can improve model performance by 40-60%
• Failure rates can be reduced by up to 80% with proper testing
• Most teams are still winging it with manual spot-checks (don't be most teams)
Why this matters: As LLMs move from demos to production systems handling real user traffic, the "move fast and break things" approach becomes... problematic. The companies that are winning are the ones treating prompt engineering like actual engineering.
The guide includes real implementation examples, statistical analysis methods, and a practical roadmap for getting started (even if you're currently doing zero testing).
Link: https://usebanyan.com/news/prompt-testing-best-practices
Would love to hear about your experiences with prompt testing - what's worked, what hasn't, and what challenges you're facing. Always looking to learn from the community!
— The Banyan Team 🌳
https://redd.it/1lo25df
@r_devops
Banyan
Banyan - Visual LLM Workflow Builder | Version Control for AI Prompts
Build LLM workflows like flowcharts. Git-style version control for prompts, A/B testing, and automated evaluation. Professional prompt management for AI teams.
Python learning path
Hey guys wanted to learn python , for quite a while now, could someone please suggest any resources that are useful , I have worked with python a bit tweaking code here and there .
Could someone please share a course that they have found useful.
Also is it worth to put in learning efforts , especially when ai is there?
https://redd.it/1lo31ki
@r_devops
Hey guys wanted to learn python , for quite a while now, could someone please suggest any resources that are useful , I have worked with python a bit tweaking code here and there .
Could someone please share a course that they have found useful.
Also is it worth to put in learning efforts , especially when ai is there?
https://redd.it/1lo31ki
@r_devops
Reddit
From the devops community on Reddit
Explore this post and more from the devops community
Certified Kubernetes Administrator (CKA) Exam Guide - V1.32 (2025)
Your ultimate resource for acing the CKA exam on your first attempt! This repo offers detailed explanations, hands-on labs, and essential study materials, empowering aspiring Kubernetes administrators to master their skills and achieve certification success. Unlock your Kubernetes potential today!
https://github.com/techwithmohamed/CKA-Certified-Kubernetes-Administrator
https://redd.it/1lo3aba
@r_devops
Your ultimate resource for acing the CKA exam on your first attempt! This repo offers detailed explanations, hands-on labs, and essential study materials, empowering aspiring Kubernetes administrators to master their skills and achieve certification success. Unlock your Kubernetes potential today!
https://github.com/techwithmohamed/CKA-Certified-Kubernetes-Administrator
https://redd.it/1lo3aba
@r_devops
GitHub
GitHub - techwithmohamed/CKA-Certified-Kubernetes-Administrator: CKA Certification Exam Guide 2026 — study notes, practice questions…
CKA Certification Exam Guide 2026 — study notes, practice questions, kubectl cheat sheet, exam tips, and full Kubernetes v1.35 syllabus breakdown. Covers etcd backup, RBAC, kubeadm, Gateway API, Ne...
Got Amazon Devops 2 interview in a few days!
Got Amazon Devops 2 interview in a few days! Pls if someone can help me with what to prepare and what type of questions I can expect in the interview. Thank you
https://redd.it/1lo4p8n
@r_devops
Got Amazon Devops 2 interview in a few days! Pls if someone can help me with what to prepare and what type of questions I can expect in the interview. Thank you
https://redd.it/1lo4p8n
@r_devops
Reddit
From the devops community on Reddit
Explore this post and more from the devops community
I'm getting an error after certificate renewal please help
Hello,
My Kubernetes cluster was running smoothly until I tried to renew the certificates after they expired. I ran the following commands:
>sudo kubeadm certs renew all
>echo 'export KUBECONFIG=/etc/kubernetes/admin.conf' >> \~/.bashrc
>source \~/.bashrc
After that, some abnormalities started to appear in my cluster. Calico is completely down and even after deleting and reinstalling it, it does not come back up at all.
When I check the daemonsets and deployments in the kube-system namespace, I see:
>kubectl get daemonset -n kube-system
>NAME DESIRED CURRENT READY UP-TO-DATE AVAILABLE NODE SELECTOR AGE
>calico-node 0 0 0 0 0 kubernetes.io/os=linux 4m4s
>
>kubectl get deployments -n kube-system
>NAME READY UP-TO-DATE AVAILABLE AGE
>calico-kube-controllers 0/1 0 0 4m19s
Before this, I was also getting "unauthorized" errors in the kubelet logs, which started after renewing the certificates. This is definitely abnormal because the pods created from deployments are not coming up and remain stuck.
There is no error message shown during deployment either. Please help.
https://redd.it/1lo52hc
@r_devops
Hello,
My Kubernetes cluster was running smoothly until I tried to renew the certificates after they expired. I ran the following commands:
>sudo kubeadm certs renew all
>echo 'export KUBECONFIG=/etc/kubernetes/admin.conf' >> \~/.bashrc
>source \~/.bashrc
After that, some abnormalities started to appear in my cluster. Calico is completely down and even after deleting and reinstalling it, it does not come back up at all.
When I check the daemonsets and deployments in the kube-system namespace, I see:
>kubectl get daemonset -n kube-system
>NAME DESIRED CURRENT READY UP-TO-DATE AVAILABLE NODE SELECTOR AGE
>calico-node 0 0 0 0 0 kubernetes.io/os=linux 4m4s
>
>kubectl get deployments -n kube-system
>NAME READY UP-TO-DATE AVAILABLE AGE
>calico-kube-controllers 0/1 0 0 4m19s
Before this, I was also getting "unauthorized" errors in the kubelet logs, which started after renewing the certificates. This is definitely abnormal because the pods created from deployments are not coming up and remain stuck.
There is no error message shown during deployment either. Please help.
https://redd.it/1lo52hc
@r_devops
Reddit
From the devops community on Reddit
Explore this post and more from the devops community
Update: DockedUp v1.0.0 release, check the demo once !!!
Hey r/devops!
Last week I introduced **DockedUp** — a real-time, interactive terminal dashboard for managing Docker containers. Thanks so much for the support and feedback! 🙌
I’ve just pushed a big update with performance improvements, better logs, and smoother UI — plus a new demo to show it off:
**Check out the new demo GIF**
### Install via pip or pipx:
### Then just run:
#### Links:
GitHub: [github.com/anilrajrimal1/dockedup](https://github.com/anilrajrimal1/dockedup)
PyPI: pypi.org/project/dockedup
https://redd.it/1loa4j8
@r_devops
Hey r/devops!
Last week I introduced **DockedUp** — a real-time, interactive terminal dashboard for managing Docker containers. Thanks so much for the support and feedback! 🙌
I’ve just pushed a big update with performance improvements, better logs, and smoother UI — plus a new demo to show it off:
**Check out the new demo GIF**
### Install via pip or pipx:
pipx install dockedup
### or
pip install dockedup
### Then just run:
dockedup
#### Links:
GitHub: [github.com/anilrajrimal1/dockedup](https://github.com/anilrajrimal1/dockedup)
PyPI: pypi.org/project/dockedup
https://redd.it/1loa4j8
@r_devops
GitHub
GitHub - anilrajrimal1/dockedup: A real-time, interactive CLI dashboard for monitoring Docker containers. View status, health,…
A real-time, interactive CLI dashboard for monitoring Docker containers. View status, health, CPU, and memory usage with a clean, color-coded interface. Supports docker-compose grouping and hotkeys...
Suggestions for an innovation sprint project? What useful new concepts or tech is 'trending'?
We are planning an innovation sprint (1 week to create a demo/PoC for a green-field project, 1 week to finalise, prep slides and demonstrate) and are at the ideas stage. I had hard plans of what I wanted to use the time for which were completely trainwrecked by a late directive to fit RnD tax credits.
I'm now in a position where I am absolutely uninterested and would like some help taking back some control of this valuable time - and not get roped in as a 6th person working on a 'support hub chat bot' project.
Any suggestions for things to consider?
\- Is there somewhere I follow for good coverage of new trends and evolution in the DevOps field?
\- We have aks clusters in azure for deployments without any tools like Kubecost implemented. Could be a good way to brush up on my k8s/helm knowledge and deliver something that would look good in my annual review if it manages any costs savings?
Thanks for any advice!
https://redd.it/1loa5w5
@r_devops
We are planning an innovation sprint (1 week to create a demo/PoC for a green-field project, 1 week to finalise, prep slides and demonstrate) and are at the ideas stage. I had hard plans of what I wanted to use the time for which were completely trainwrecked by a late directive to fit RnD tax credits.
I'm now in a position where I am absolutely uninterested and would like some help taking back some control of this valuable time - and not get roped in as a 6th person working on a 'support hub chat bot' project.
Any suggestions for things to consider?
\- Is there somewhere I follow for good coverage of new trends and evolution in the DevOps field?
\- We have aks clusters in azure for deployments without any tools like Kubecost implemented. Could be a good way to brush up on my k8s/helm knowledge and deliver something that would look good in my annual review if it manages any costs savings?
Thanks for any advice!
https://redd.it/1loa5w5
@r_devops
Reddit
From the devops community on Reddit
Explore this post and more from the devops community