How do you not burn out?
I’ll Try to TLDR - Not in a senior role, under that and brought on with no prior devops experience but definitely a role supporting dev teams pushing through CI/CD implementation.
It seems that now I am the main point of contact for our applications. Which they are a few - For the most part my senior has migrated them to a more stable state. With no previous devops experience, I have been able to swim despite being thrown into the deep end. Now, I’ve run across a few issues which took a LOT longer than i would have liked, (days / weeks) and it turned out to be the silliest of things. Although I’m glad it’s resolved, i feel mentally exhausted lol. I am unofficially the point of contact for our apps. Any discussion on new implementation of anything, has to go through me. I sh*t my pants cause half the time I honestly dont know what or how to implement what they are looking for. Imposter syndrome is real. Have been in the role for sometime now, but its all starting to hit me, and i feel like everyone knows i dont know squat lol.
Implementing new infrastructure requires a lot of trail and error and i may skip things or miss things, much to the annoyance of the team i support. I’ll most likely take a day or two in the next few days or wait till the holiday.
https://redd.it/1kis5mg
@r_devops
I’ll Try to TLDR - Not in a senior role, under that and brought on with no prior devops experience but definitely a role supporting dev teams pushing through CI/CD implementation.
It seems that now I am the main point of contact for our applications. Which they are a few - For the most part my senior has migrated them to a more stable state. With no previous devops experience, I have been able to swim despite being thrown into the deep end. Now, I’ve run across a few issues which took a LOT longer than i would have liked, (days / weeks) and it turned out to be the silliest of things. Although I’m glad it’s resolved, i feel mentally exhausted lol. I am unofficially the point of contact for our apps. Any discussion on new implementation of anything, has to go through me. I sh*t my pants cause half the time I honestly dont know what or how to implement what they are looking for. Imposter syndrome is real. Have been in the role for sometime now, but its all starting to hit me, and i feel like everyone knows i dont know squat lol.
Implementing new infrastructure requires a lot of trail and error and i may skip things or miss things, much to the annoyance of the team i support. I’ll most likely take a day or two in the next few days or wait till the holiday.
https://redd.it/1kis5mg
@r_devops
Reddit
From the devops community on Reddit
Explore this post and more from the devops community
Using kube-downscaler to reduce Kubernetes costs—my take
If you're running dev/staging clusters or workloads with predictable low-traffic hours, kube-downscaler is a simple win.
It lets you define schedules (via annotations) to scale Deployments down—without interfering with HPA.
I shared my setup, where it fits well, and a few caveats here:
https://blog.abhimanyu-saharan.com/posts/reduce-kubernetes-costs-with-kube-downscaler
Curious—anyone using this in production? Or paired it with Keda?
https://redd.it/1kis091
@r_devops
If you're running dev/staging clusters or workloads with predictable low-traffic hours, kube-downscaler is a simple win.
It lets you define schedules (via annotations) to scale Deployments down—without interfering with HPA.
I shared my setup, where it fits well, and a few caveats here:
https://blog.abhimanyu-saharan.com/posts/reduce-kubernetes-costs-with-kube-downscaler
Curious—anyone using this in production? Or paired it with Keda?
https://redd.it/1kis091
@r_devops
Preparation for Broadcom
Hello, I have an onsite interview with Broadcom in 7 days, final step in the interview process. It's from 9 am to 4 pm with a lunch session. This is for a SWEII Dev Ops position.
I have been unemployed since February and I really need this job.
One good thing is that after the first 1 hour conversation, they told me they'd get back to me the next business day, but got back to me the same night.
When i asked more about the onsite interview, they said, "The technical interview will cover a range of computer science topics—from fundamental programming concepts to development processes and best practices. You may also be presented with on-the-spot scenarios to gauge how you approach problems with limited context and time. The interview is intended to be interactive, so you're encouraged to ask questions for clarification rather than make assumptions."
Which is super vague in my opinion? Obviously I will explain my thinking out loud.
Could anyone offer any tips? Or interviewed with Broadcom before?
I have never done Dev Ops before, I explained that I feel I am a good fit for this role because I have always found my self trying to automate processes or make a process more efficient at my most recent job. It was at a defense company, but a lot of their processes were manual for some reason.
https://redd.it/1kiz4nv
@r_devops
Hello, I have an onsite interview with Broadcom in 7 days, final step in the interview process. It's from 9 am to 4 pm with a lunch session. This is for a SWEII Dev Ops position.
I have been unemployed since February and I really need this job.
One good thing is that after the first 1 hour conversation, they told me they'd get back to me the next business day, but got back to me the same night.
When i asked more about the onsite interview, they said, "The technical interview will cover a range of computer science topics—from fundamental programming concepts to development processes and best practices. You may also be presented with on-the-spot scenarios to gauge how you approach problems with limited context and time. The interview is intended to be interactive, so you're encouraged to ask questions for clarification rather than make assumptions."
Which is super vague in my opinion? Obviously I will explain my thinking out loud.
Could anyone offer any tips? Or interviewed with Broadcom before?
I have never done Dev Ops before, I explained that I feel I am a good fit for this role because I have always found my self trying to automate processes or make a process more efficient at my most recent job. It was at a defense company, but a lot of their processes were manual for some reason.
https://redd.it/1kiz4nv
@r_devops
Reddit
From the devops community on Reddit
Explore this post and more from the devops community
Should I pursue AWS and Kubernetes certificates? + please critique my learning plan
Are AWS and K8s certs worth it from the job hunt perspective?
\- Are AWS and K8s certs a pre-requisite to getting a DevOps job?
Are AWS and K8s certs worth it from a learning perspective?
I see many posts that either support certifications or diss certifications, and I am confused.
\---
Also, please critique my personal plan to learn more about DevOps:
Context:
\- 2.2 years experience SWE, \~8 months of professional experience with terraform, github actions, and docker.
\- I enjoy infrastructure stuff and want to break into DevOps (teams focused on infra)
\- have a lot of free time
I plan to obtain the following certifications:
AWS: Solutions Architect associate, Developer Associate, Sysadmin Associate, DevOps Professional
K8s: KCNA, CKA, and CKAD
As I study for each certification, I will implement each thing I learn into my homelab. That way, I get the conceptual knowledge, and also apply said knowledge in a hands-on fashion. This will solidify my understanding of what I learned, and also build me an amazing resume project over time. I imagine the learning gains from this will be immense, which I look forward to.
The main reason I want to get certifications is to obtain more knowledge and skills. Certifications are a structured way to do so, and also can help me a get a job (I've heard).
Why I think my plan is a good idea:
\- Certifications expose me to things I don't know. (You don't know what you don't know)
\- I obtain new knowledge, apply it practically via my homelab, deepening my understanding and building my resume.
\- I also get certifications, which can help me get a job (i've heard)
https://redd.it/1kj1stu
@r_devops
Are AWS and K8s certs worth it from the job hunt perspective?
\- Are AWS and K8s certs a pre-requisite to getting a DevOps job?
Are AWS and K8s certs worth it from a learning perspective?
I see many posts that either support certifications or diss certifications, and I am confused.
\---
Also, please critique my personal plan to learn more about DevOps:
Context:
\- 2.2 years experience SWE, \~8 months of professional experience with terraform, github actions, and docker.
\- I enjoy infrastructure stuff and want to break into DevOps (teams focused on infra)
\- have a lot of free time
I plan to obtain the following certifications:
AWS: Solutions Architect associate, Developer Associate, Sysadmin Associate, DevOps Professional
K8s: KCNA, CKA, and CKAD
As I study for each certification, I will implement each thing I learn into my homelab. That way, I get the conceptual knowledge, and also apply said knowledge in a hands-on fashion. This will solidify my understanding of what I learned, and also build me an amazing resume project over time. I imagine the learning gains from this will be immense, which I look forward to.
The main reason I want to get certifications is to obtain more knowledge and skills. Certifications are a structured way to do so, and also can help me a get a job (I've heard).
Why I think my plan is a good idea:
\- Certifications expose me to things I don't know. (You don't know what you don't know)
\- I obtain new knowledge, apply it practically via my homelab, deepening my understanding and building my resume.
\- I also get certifications, which can help me get a job (i've heard)
https://redd.it/1kj1stu
@r_devops
Reddit
From the devops community on Reddit
Explore this post and more from the devops community
What infrastructure monitoring topic would you like to see covered by an Observability Architect?
Hey everyone,
I’m a DevOps/Observability architect at an enterprise-scale SAAS startup, and I’m planning a deep-dive blog post on infrastructure monitoring. Before I lock down the topic, I want to hear from you:
>
Here are a few ideas I’m kicking around, feel free to up-vote the ones you’d find most valuable or suggest something completely different:
1. Designing SLO-Driven Monitoring Pipelines
2. High-Cardinality Metrics at Scale
3. Alert Fatigue & Noise Reduction
4. Observability for Containerized/Kubernetes Environments
5. Optimized Data Retention
6. Central vs. Cluster-Specific Monitoring
7. Grafana Dashboards & Performance
8. Alerting Mechanisms & Routing
9. Noise Reduction & Metric Hygiene
What do you think? Which of these resonates the most, or is there another niche edge case you’d love to see tackled by someone who lives and breathes observability every day? Drop your thoughts below I appreciate your input!
https://redd.it/1kj4329
@r_devops
Hey everyone,
I’m a DevOps/Observability architect at an enterprise-scale SAAS startup, and I’m planning a deep-dive blog post on infrastructure monitoring. Before I lock down the topic, I want to hear from you:
>
Here are a few ideas I’m kicking around, feel free to up-vote the ones you’d find most valuable or suggest something completely different:
1. Designing SLO-Driven Monitoring Pipelines
2. High-Cardinality Metrics at Scale
3. Alert Fatigue & Noise Reduction
4. Observability for Containerized/Kubernetes Environments
5. Optimized Data Retention
6. Central vs. Cluster-Specific Monitoring
7. Grafana Dashboards & Performance
8. Alerting Mechanisms & Routing
9. Noise Reduction & Metric Hygiene
What do you think? Which of these resonates the most, or is there another niche edge case you’d love to see tackled by someone who lives and breathes observability every day? Drop your thoughts below I appreciate your input!
https://redd.it/1kj4329
@r_devops
Reddit
From the devops community on Reddit
Explore this post and more from the devops community
Looks like I need to switch my field from devops - where should I go ?
I’m currently working as a DevOps Engineer, but I feel like I’m not a "complete" DevOps engineer because I struggle with programming. I’m fairly comfortable with cloud computing (AWS), Kubernetes, Terraform, Docker, monitoring tools, CI/CD pipelines, automation, infrastructure management, troubleshooting, etc.
In my day-to-day role, I:
Manage infrastructure and Kubernetes clusters
Set up and maintain CI/CD pipelines
Dockerize and release applications
Handle automation tasks and general troubleshooting
However, around 20% of my tasks require scripting or programming, and this is where I fall short. I don't know Python, and my Bash scripting skills are not strong. I usually rely on AI tools to help me write scripts when needed. Without them, I struggle to do it independently.
I’ve tried learning Python and improving my Bash, but it’s been tough and progress has been slow. So I’m wondering:
Given that I’m strong with DevOps tools but weak in programming and scripting, what direction should I consider moving into?
Of course, I plan to keep upskilling, but I’d appreciate guidance on a career path that aligns better with my strengths.
https://redd.it/1kj4rxz
@r_devops
I’m currently working as a DevOps Engineer, but I feel like I’m not a "complete" DevOps engineer because I struggle with programming. I’m fairly comfortable with cloud computing (AWS), Kubernetes, Terraform, Docker, monitoring tools, CI/CD pipelines, automation, infrastructure management, troubleshooting, etc.
In my day-to-day role, I:
Manage infrastructure and Kubernetes clusters
Set up and maintain CI/CD pipelines
Dockerize and release applications
Handle automation tasks and general troubleshooting
However, around 20% of my tasks require scripting or programming, and this is where I fall short. I don't know Python, and my Bash scripting skills are not strong. I usually rely on AI tools to help me write scripts when needed. Without them, I struggle to do it independently.
I’ve tried learning Python and improving my Bash, but it’s been tough and progress has been slow. So I’m wondering:
Given that I’m strong with DevOps tools but weak in programming and scripting, what direction should I consider moving into?
Of course, I plan to keep upskilling, but I’d appreciate guidance on a career path that aligns better with my strengths.
https://redd.it/1kj4rxz
@r_devops
Reddit
From the devops community on Reddit
Explore this post and more from the devops community
Argo CD Setup with Terraform on EKS Clusters
I have an EKS cluster that I use for labs, which is deployed and destroyed using Terraform. I want to configure Argo CD on this cluster, but I would like the setup to be automated using Terraform. This way, I won't have to manually configure Argo CD every time I recreate the cluster. Can anyone point me in the right direction? Thanks!
https://redd.it/1kj52hu
@r_devops
I have an EKS cluster that I use for labs, which is deployed and destroyed using Terraform. I want to configure Argo CD on this cluster, but I would like the setup to be automated using Terraform. This way, I won't have to manually configure Argo CD every time I recreate the cluster. Can anyone point me in the right direction? Thanks!
https://redd.it/1kj52hu
@r_devops
Reddit
From the devops community on Reddit
Explore this post and more from the devops community
Devops without CS degree
Is it possible ? At base i wanna follow mechanical engineering but i have a smiliarly big passion for linux and programming aswell(although its pretty challanging) . Will i be able to switch or choose careers without a CS degree? (With a decent github repo of good ideas in python , automation and networking)
https://redd.it/1kj6p3u
@r_devops
Is it possible ? At base i wanna follow mechanical engineering but i have a smiliarly big passion for linux and programming aswell(although its pretty challanging) . Will i be able to switch or choose careers without a CS degree? (With a decent github repo of good ideas in python , automation and networking)
https://redd.it/1kj6p3u
@r_devops
Reddit
From the devops community on Reddit
Explore this post and more from the devops community
Did platform engineering also kill all small devops teams in your corpo BUs?
So I was in such small devops team in one of BUs. Platform department abstracted more and more stuff behind their IDP clickops.
After some time all the work we did (even of I still think was done better than many platform solutions) was abstracted. Infrastructure ? use UI to generate it. Need cicd? Use template. Template does not fit you exactly? Well too bad. GL.
Almost every part of regular devops engineer work was automated with a layer of ClickOps on top.
I strongly believe platform engineering is a direct competitor to devops (aka „devops at scale”).
Was this the same for your corpo ? (Ps. We are talking here about big corpos ~ few thousend ppl min)
https://redd.it/1kja11q
@r_devops
So I was in such small devops team in one of BUs. Platform department abstracted more and more stuff behind their IDP clickops.
After some time all the work we did (even of I still think was done better than many platform solutions) was abstracted. Infrastructure ? use UI to generate it. Need cicd? Use template. Template does not fit you exactly? Well too bad. GL.
Almost every part of regular devops engineer work was automated with a layer of ClickOps on top.
I strongly believe platform engineering is a direct competitor to devops (aka „devops at scale”).
Was this the same for your corpo ? (Ps. We are talking here about big corpos ~ few thousend ppl min)
https://redd.it/1kja11q
@r_devops
Reddit
From the devops community on Reddit
Explore this post and more from the devops community
I have been a SDET for the last 6 years, how do I move to devops ?
Got laid off recently and looking for new areas I can transition to, I am pretty good in python and have decent understanding of ci/cd principles. At one of my jobs I created test and deployment pipeline in Jenkins as well. How devops jobs that I see demand a lot. So I had following questions.
What skill sets do I have to learn to get my foot in the door ?
I can probably get the free OCI associate certificate within a week, would that help ?
How devops is different than SRE jobs ?
https://redd.it/1kj9yma
@r_devops
Got laid off recently and looking for new areas I can transition to, I am pretty good in python and have decent understanding of ci/cd principles. At one of my jobs I created test and deployment pipeline in Jenkins as well. How devops jobs that I see demand a lot. So I had following questions.
What skill sets do I have to learn to get my foot in the door ?
I can probably get the free OCI associate certificate within a week, would that help ?
How devops is different than SRE jobs ?
https://redd.it/1kj9yma
@r_devops
Reddit
From the devops community on Reddit
Explore this post and more from the devops community
ChallENGES with MOBILE
[https://www.reddit.com/r/interviewhammer/comments/1kjazgr/challenge\_can\_any\_interview\_platform\_detect\_our/](https://www.reddit.com/r/interviewhammer/comments/1kjazgr/challenge_can_any_interview_platform_detect_our/)
We built [interviewHammer](https://interviewhammer.com/) AI tool that helps with coding and regular interviews, and we’re challenging anyone (recruiters, platforms like HackerRank, LeetCode, Coderpad, etc..) to detect it.
* Here’s the deal: No subscription needed, try it with the free trial.
* Works on both Windows and Mac.
* If you manage to prove any site/tool can detect it, we’ll give you a **free 2-month subscription to interviewHammer the latest ChatGPT model**.
.................
***We’re confident that our tool is completely undetectable.***
*For example, in a coding interview, most other tools rely on the laptop to take screenshots, which can be flagged.*
*But our tool uses your mobile phone to capture screenshots of the questions, and the answers are displayed directly on your phone.*
*This means there’s* ***no way*** *for any website or application to detect it.*
.................
To win:
1. Record a full video showing how detection happens.
2. Include the **exact steps to reproduce the scenario**.
3. If we’re able to reproduce it ourselves, we’ll confirm it publicly, shout you out by name, and reward you.
We’re confident. This is your chance to prove us wrong. 👀
https://redd.it/1kjbv6j
@r_devops
[https://www.reddit.com/r/interviewhammer/comments/1kjazgr/challenge\_can\_any\_interview\_platform\_detect\_our/](https://www.reddit.com/r/interviewhammer/comments/1kjazgr/challenge_can_any_interview_platform_detect_our/)
We built [interviewHammer](https://interviewhammer.com/) AI tool that helps with coding and regular interviews, and we’re challenging anyone (recruiters, platforms like HackerRank, LeetCode, Coderpad, etc..) to detect it.
* Here’s the deal: No subscription needed, try it with the free trial.
* Works on both Windows and Mac.
* If you manage to prove any site/tool can detect it, we’ll give you a **free 2-month subscription to interviewHammer the latest ChatGPT model**.
.................
***We’re confident that our tool is completely undetectable.***
*For example, in a coding interview, most other tools rely on the laptop to take screenshots, which can be flagged.*
*But our tool uses your mobile phone to capture screenshots of the questions, and the answers are displayed directly on your phone.*
*This means there’s* ***no way*** *for any website or application to detect it.*
.................
To win:
1. Record a full video showing how detection happens.
2. Include the **exact steps to reproduce the scenario**.
3. If we’re able to reproduce it ourselves, we’ll confirm it publicly, shout you out by name, and reward you.
We’re confident. This is your chance to prove us wrong. 👀
https://redd.it/1kjbv6j
@r_devops
Reddit
From the interviewhammer community on Reddit: 📢 Challenge: Can ANY interview platform detect our interviewHammer assistant tool?
Explore this post and more from the interviewhammer community
The biggest DevOps lesson I’ve learned? It’s not about the tools—it’s about ownership
When I first got into DevOps, I obsessed over tools: Docker, Jenkins, Terraform, you name it. I thought knowing the tech would make me a great engineer.
But over time, I’ve realized the real shift is in how you think. DevOps isn’t just automation—it’s taking ownership from code to production. If something breaks in prod? You don’t say “that’s the dev team’s fault.” You own it, debug it, and fix the pipeline or infra that caused it.
Tools come and go. What sticks is this mindset of responsibility and constant improvement.
Anyone else feel like their biggest DevOps growth came from a shift in how they think—not what they use?
https://redd.it/1kjgtcp
@r_devops
When I first got into DevOps, I obsessed over tools: Docker, Jenkins, Terraform, you name it. I thought knowing the tech would make me a great engineer.
But over time, I’ve realized the real shift is in how you think. DevOps isn’t just automation—it’s taking ownership from code to production. If something breaks in prod? You don’t say “that’s the dev team’s fault.” You own it, debug it, and fix the pipeline or infra that caused it.
Tools come and go. What sticks is this mindset of responsibility and constant improvement.
Anyone else feel like their biggest DevOps growth came from a shift in how they think—not what they use?
https://redd.it/1kjgtcp
@r_devops
Reddit
From the devops community on Reddit
Explore this post and more from the devops community
What are some good resources for learning about devops for mobile apps?
Looking to learn about Mobile DevOps. Share your experiences also.
https://redd.it/1kjlhib
@r_devops
Looking to learn about Mobile DevOps. Share your experiences also.
https://redd.it/1kjlhib
@r_devops
Reddit
From the devops community on Reddit
Explore this post and more from the devops community
Best secure VCS to use in big companies
Hello everyone, my company is aiming to use a version control system (VCS) in our development team, up till now our IT team task were simple but overtime the team grew and our codes became more complex.
Thus we want a VCS application that is efficient but also secure, we need to make sure our codes don’t get leaked out.
I have suggested Git and GitHub since it’s the only one I know, but to be honest idk if they are secure enough or if we can manage it locally in our servers instead of GitHub servers
So what are your suggestions? Maybe something that big companies use? do you have other suggestions that are more secure and managed locally in our servers if possible, if not then something secure enough so I can suggest it to the team.
Thanks 🫂
https://redd.it/1kjwbgr
@r_devops
Hello everyone, my company is aiming to use a version control system (VCS) in our development team, up till now our IT team task were simple but overtime the team grew and our codes became more complex.
Thus we want a VCS application that is efficient but also secure, we need to make sure our codes don’t get leaked out.
I have suggested Git and GitHub since it’s the only one I know, but to be honest idk if they are secure enough or if we can manage it locally in our servers instead of GitHub servers
So what are your suggestions? Maybe something that big companies use? do you have other suggestions that are more secure and managed locally in our servers if possible, if not then something secure enough so I can suggest it to the team.
Thanks 🫂
https://redd.it/1kjwbgr
@r_devops
Reddit
From the devops community on Reddit
Explore this post and more from the devops community
Looking for feedback on GitHub Actions runner alternatives
Hey all,
We currently use **x64 Ubuntu machines** via **GitHub-hosted runners** for our workflows and are evaluating alternatives for cost and performance improvements.
Has anyone here used any of the following runner platforms?
* **Blacksmith**
* **Ubicloud**
* **BuildJet**
* **WarpBuild**
* **runs-on**
* **Namespace**
I’m particularly interested in:
* **Startup time / cold start latency**
* **Job execution performance**
* **Pricing**
* **Integration complexity with GitHub Actions**
* **Any gotchas or unexpected limitations**
Would love to hear from anyone who's adopted one of these, or has done benchmarking against GitHub-hosted runners. Any insights or experiences would help us decide if it's worth migrating or sticking with what we have.
Thanks in advance!
https://redd.it/1kjypf8
@r_devops
Hey all,
We currently use **x64 Ubuntu machines** via **GitHub-hosted runners** for our workflows and are evaluating alternatives for cost and performance improvements.
Has anyone here used any of the following runner platforms?
* **Blacksmith**
* **Ubicloud**
* **BuildJet**
* **WarpBuild**
* **runs-on**
* **Namespace**
I’m particularly interested in:
* **Startup time / cold start latency**
* **Job execution performance**
* **Pricing**
* **Integration complexity with GitHub Actions**
* **Any gotchas or unexpected limitations**
Would love to hear from anyone who's adopted one of these, or has done benchmarking against GitHub-hosted runners. Any insights or experiences would help us decide if it's worth migrating or sticking with what we have.
Thanks in advance!
https://redd.it/1kjypf8
@r_devops
Reddit
From the devops community on Reddit
Explore this post and more from the devops community
Startup Founders
When does SAAS startup or any startup think about IT infrastructure as per your experience?
https://redd.it/1kk4rv4
@r_devops
When does SAAS startup or any startup think about IT infrastructure as per your experience?
https://redd.it/1kk4rv4
@r_devops
Reddit
From the devops community on Reddit
Explore this post and more from the devops community
I’m done applying. I’ll fix your cloud/SRE problem in 48 hours and for free.
I’m a Site Reliability Engineer with 3 years of experience stabilizing cloud chaos , scaling infrastructure, optimizing observability, and putting out production fires nobody else could trace.
But after months of getting ghosted by hiring pipelines, I’m flipping the script.
Here’s the deal:
Give me one real, gnarly infra or SRE issue I’ll solve it in 48 hours. Free. No strings.
Dealing with stuff like:
ML workloads starving your GPU nodes and breaking autoscaling?
CI runners hogging ephemeral disks and silently failing deploys?
OpenTelemetry or Datadog showing 0% CPU... right before your pod dies?
Terraform state files locking up during high-frequency changes?
Real-time APIs randomly timing out under load but only during inference spikes?
S3 buckets quietly serving stale model files after a blue/green deployment?
IAM policies growing into unmanageable beasts breaking least privilege by accident?
Docker build cache exploding and pushing deploy times past 15 minutes?
EKS upgrades failing because of legacy node taints?
GitHub Actions burning free minutes due to missing cache keys?
Broken rollback logic that works in staging but fails in production?
Load balancers routing traffic unevenly across AZs during scale events?
Secrets leaking from ENV vars in ephemeral test environments?
Lambda cold starts doubling after a version bump and nobody knows why?
These are the problems I love solving and the kind of fires I’ve put out before.
Reply here or DM me your toughest infra/SRE pain. I’ll pick a few, solve them fast, and share anonymized fixes publicly.
You get a real solution. I get to prove what I can do no fluff, just execution.
Let’s build.
https://redd.it/1kk685o
@r_devops
I’m a Site Reliability Engineer with 3 years of experience stabilizing cloud chaos , scaling infrastructure, optimizing observability, and putting out production fires nobody else could trace.
But after months of getting ghosted by hiring pipelines, I’m flipping the script.
Here’s the deal:
Give me one real, gnarly infra or SRE issue I’ll solve it in 48 hours. Free. No strings.
Dealing with stuff like:
ML workloads starving your GPU nodes and breaking autoscaling?
CI runners hogging ephemeral disks and silently failing deploys?
OpenTelemetry or Datadog showing 0% CPU... right before your pod dies?
Terraform state files locking up during high-frequency changes?
Real-time APIs randomly timing out under load but only during inference spikes?
S3 buckets quietly serving stale model files after a blue/green deployment?
IAM policies growing into unmanageable beasts breaking least privilege by accident?
Docker build cache exploding and pushing deploy times past 15 minutes?
EKS upgrades failing because of legacy node taints?
GitHub Actions burning free minutes due to missing cache keys?
Broken rollback logic that works in staging but fails in production?
Load balancers routing traffic unevenly across AZs during scale events?
Secrets leaking from ENV vars in ephemeral test environments?
Lambda cold starts doubling after a version bump and nobody knows why?
These are the problems I love solving and the kind of fires I’ve put out before.
Reply here or DM me your toughest infra/SRE pain. I’ll pick a few, solve them fast, and share anonymized fixes publicly.
You get a real solution. I get to prove what I can do no fluff, just execution.
Let’s build.
https://redd.it/1kk685o
@r_devops
Reddit
From the devops community on Reddit
Explore this post and more from the devops community
Containers with azure functions
hello lately I have started a new project that have few apps hosted on azure functions, but not as a container. I want to start deploying the apps as containers in azure functions.
the base image is pretty big, the base azure function for node is around 2GB. I used dive to get inside, and I have found there are some unused runtimes installed and some azure function bundles with older version that I can delete.
with cleaning and using slim version, I can get the base image to 1 GB.
I was wondering if you have any tips and tricks for containerized azure function to keep the image small.
cheers
https://redd.it/1kk4zb8
@r_devops
hello lately I have started a new project that have few apps hosted on azure functions, but not as a container. I want to start deploying the apps as containers in azure functions.
the base image is pretty big, the base azure function for node is around 2GB. I used dive to get inside, and I have found there are some unused runtimes installed and some azure function bundles with older version that I can delete.
with cleaning and using slim version, I can get the base image to 1 GB.
I was wondering if you have any tips and tricks for containerized azure function to keep the image small.
cheers
https://redd.it/1kk4zb8
@r_devops
Reddit
From the devops community on Reddit
Explore this post and more from the devops community
15 Years of DevOps, yet manual schema migrations still a thing
Hey All,
My name is Rotem, co-founder of atlasgo.io
One of the most surprising things I learned since starting the company 4 years ago is that manual database schema changes are still a thing. Way more common that I had thought.
We commonly see this is in customer calls - the team has CI/CD pipelines for app delivery, maybe even IaC for cloud stuff - but the database - still devs/DBAs connect directly to prod to apply changes.
This came as a surprise to me since tools for automating schema changes have existed since at least 2006.
Our DevRel Engineer u/noarogo published a piece about it today:
https://atlasgo.io/blog/2025/05/11/auto-vs-manual
What's your experience? Do you still see this practice?
If you see it, what's your explanation for this gap?
https://redd.it/1kk8x91
@r_devops
Hey All,
My name is Rotem, co-founder of atlasgo.io
One of the most surprising things I learned since starting the company 4 years ago is that manual database schema changes are still a thing. Way more common that I had thought.
We commonly see this is in customer calls - the team has CI/CD pipelines for app delivery, maybe even IaC for cloud stuff - but the database - still devs/DBAs connect directly to prod to apply changes.
This came as a surprise to me since tools for automating schema changes have existed since at least 2006.
Our DevRel Engineer u/noarogo published a piece about it today:
https://atlasgo.io/blog/2025/05/11/auto-vs-manual
What's your experience? Do you still see this practice?
If you see it, what's your explanation for this gap?
https://redd.it/1kk8x91
@r_devops
atlasgo.io
Atlas is a language-agnostic tool for managing and migrating database schemas using modern DevOps principles. It enables developers to automate schema changes through both declarative (schema-as-code) and versioned migration workflows, supporting inputs like…
Getting env file to digitalocean droplet
Hello I currently have a next.js app and I'm currently deploying to digitalocean droplets using github actions, but I'm kind of confused on how to get my .env file to the droplet. Would I manually just add it to the cloned repo on the droplet? Or scp my env to the droplet. Or some other way? I'm a bit new to this.
https://redd.it/1kk7sw8
@r_devops
Hello I currently have a next.js app and I'm currently deploying to digitalocean droplets using github actions, but I'm kind of confused on how to get my .env file to the droplet. Would I manually just add it to the cloned repo on the droplet? Or scp my env to the droplet. Or some other way? I'm a bit new to this.
https://redd.it/1kk7sw8
@r_devops
Reddit
From the devops community on Reddit
Explore this post and more from the devops community
What tool are you using for easy provisioning?
Hi, I am experimenting with self managed kubernetes cluster. Kubernetes is cool and all but the underlying servers where the pods run on still need to be provisioned and managed. I understand that terraform can create/manage the infra resources such as network, storage, vm etc. But for provisioning other tools such as Ansible is used. I am looking for an easy to use with web ui preferably to provision my servers.
https://redd.it/1kkaqdq
@r_devops
Hi, I am experimenting with self managed kubernetes cluster. Kubernetes is cool and all but the underlying servers where the pods run on still need to be provisioned and managed. I understand that terraform can create/manage the infra resources such as network, storage, vm etc. But for provisioning other tools such as Ansible is used. I am looking for an easy to use with web ui preferably to provision my servers.
https://redd.it/1kkaqdq
@r_devops
Reddit
From the devops community on Reddit
Explore this post and more from the devops community