What happed to the DevOps Paradox podcast?
The DevOps Paradox podcast is my favorite and they haven't done a show since February.
Does anyone know why??
https://redd.it/1k6ujiv
@r_devops
The DevOps Paradox podcast is my favorite and they haven't done a show since February.
Does anyone know why??
https://redd.it/1k6ujiv
@r_devops
Reddit
From the devops community on Reddit
Explore this post and more from the devops community
Exploring Serverless Stack Architecture – How Do You Manage Environments & Security?
Hey folks,
I’m experimenting with a serverless stack on AWS using S3 + CloudFront for static hosting, API Gateway + Lambda for backend, DynamoDB for data, and Cognito for auth.
It’s been great for learning, and I’m thinking ahead about how to scale and manage this more professionally.
Curious to hear from others:
* How do you structure environments (dev/staging/prod)? Separate accounts, or manage via IaC/tagging?
* Best practices for securing this kind of stack — IAM roles, access boundaries, etc.?
* Any underrated tools or AWS services that help you keep things maintainable and cost-effective?
Appreciate any insight — always looking to learn from real-world setups. Happy to share my setup later once it’s more polished.
https://redd.it/1k6sux8
@r_devops
Hey folks,
I’m experimenting with a serverless stack on AWS using S3 + CloudFront for static hosting, API Gateway + Lambda for backend, DynamoDB for data, and Cognito for auth.
It’s been great for learning, and I’m thinking ahead about how to scale and manage this more professionally.
Curious to hear from others:
* How do you structure environments (dev/staging/prod)? Separate accounts, or manage via IaC/tagging?
* Best practices for securing this kind of stack — IAM roles, access boundaries, etc.?
* Any underrated tools or AWS services that help you keep things maintainable and cost-effective?
Appreciate any insight — always looking to learn from real-world setups. Happy to share my setup later once it’s more polished.
https://redd.it/1k6sux8
@r_devops
Reddit
From the devops community on Reddit
Explore this post and more from the devops community
Best Practices for Horizontally Scaling a Dockerized Backend on a VM
I need advice on scaling a Dockerized backend application hosted on a Google Compute Engine (GCE) VM.
# Current Setup:
* Backend runs in Docker containers on a single GCE VM.
* Nginx is installed on the **same VM** to route requests to the backend.
* Monitoring via Prometheus/Grafana shows backend CPU usage spiking to **200%**, indicating severe resource contention.
# Proposed Solution and Questions:
1. **Horizontal Scaling Within the Same VM**:
* Is adding more backend containers to the same VM a viable approach? Since the VM’s CPU is already saturated, won’t this exacerbate resource contention?
* If traffic grows further, would scaling require adding more VMs regardless?
2. **Nginx Placement**:
* Should Nginx be decoupled from the backend VM to avoid resource competition (e.g., moving it to a dedicated VM or managed load balancer)?
3. **Alternative Strategies**:
* How would you architect this system for scalability?
https://redd.it/1k6x7tp
@r_devops
I need advice on scaling a Dockerized backend application hosted on a Google Compute Engine (GCE) VM.
# Current Setup:
* Backend runs in Docker containers on a single GCE VM.
* Nginx is installed on the **same VM** to route requests to the backend.
* Monitoring via Prometheus/Grafana shows backend CPU usage spiking to **200%**, indicating severe resource contention.
# Proposed Solution and Questions:
1. **Horizontal Scaling Within the Same VM**:
* Is adding more backend containers to the same VM a viable approach? Since the VM’s CPU is already saturated, won’t this exacerbate resource contention?
* If traffic grows further, would scaling require adding more VMs regardless?
2. **Nginx Placement**:
* Should Nginx be decoupled from the backend VM to avoid resource competition (e.g., moving it to a dedicated VM or managed load balancer)?
3. **Alternative Strategies**:
* How would you architect this system for scalability?
https://redd.it/1k6x7tp
@r_devops
Reddit
From the devops community on Reddit
Explore this post and more from the devops community
Procore Technologies
I have cleared my rounds at Procore Technologies, if any of you guys are working in the company or have worked previously please let me know the work culture.
https://redd.it/1k6x1r8
@r_devops
I have cleared my rounds at Procore Technologies, if any of you guys are working in the company or have worked previously please let me know the work culture.
https://redd.it/1k6x1r8
@r_devops
Reddit
From the devops community on Reddit
Explore this post and more from the devops community
Manager said “that doesn’t make any sense!”
…to which I reply: “well neither does me driving into the office every day to do a job I can literally do from anywhere with an Internet connection but here I am”
https://redd.it/1k70np7
@r_devops
…to which I reply: “well neither does me driving into the office every day to do a job I can literally do from anywhere with an Internet connection but here I am”
https://redd.it/1k70np7
@r_devops
Reddit
From the devops community on Reddit
Explore this post and more from the devops community
Have only worked in Jenkins, Git, Docker and Linux as DevOps Engineer– What all Skills Should I Learn as DevOps to Get Hired? Can't find jobs in Naukri for this
I’ve worked in DevOps using these: Jenkins, Git, and Linux, but in Job Portals like Linkedin, Naukri I am not seeing job openings that match just these skills.
What should I focus on learning next to actually get hired?
https://redd.it/1k70qjb
@r_devops
I’ve worked in DevOps using these: Jenkins, Git, and Linux, but in Job Portals like Linkedin, Naukri I am not seeing job openings that match just these skills.
What should I focus on learning next to actually get hired?
https://redd.it/1k70qjb
@r_devops
Reddit
From the devops community on Reddit
Explore this post and more from the devops community
Simplecontainer.io
In the past few months, I've been developing an orchestration platform to improve the experience of managing Docker deployments on VMs. It operates atop the container engine and takes over orchestration. It supports GitOps and plain old apply. The engine is open sourced.
Apart from the terminal CLI, I've also created a sleek UI dashboard to further ease the management. Dashboard is available as an app https://app.simplecontainer.io and can be used as it is. It is also possible to deploy the dashboard on-premises.
The dashboard can be a central platform to manage operations for multiple projects. Contexts are a way to authenticate against the simplecontainer node and can be shared with other users via organizations. The manager could choose which context is shared with which organization.
On the security side, the dashboard acts as a proxy, and no information about access is persisted on the app. Also, everywhere mTLS and TLS.
Demos on how to use the platform + dashboard can be found at:
- https://app.simplecontainer.io/demos/gitops
- https://app.simplecontainer.io/demos/declarative
Photos of container and gitops dashboards are attached. Currently it is alpha and sign ups will be opened soon. Interested in what you guys think and if someone wants to try it out you can hit me up in DM for more info.
https://redd.it/1k72nb3
@r_devops
In the past few months, I've been developing an orchestration platform to improve the experience of managing Docker deployments on VMs. It operates atop the container engine and takes over orchestration. It supports GitOps and plain old apply. The engine is open sourced.
Apart from the terminal CLI, I've also created a sleek UI dashboard to further ease the management. Dashboard is available as an app https://app.simplecontainer.io and can be used as it is. It is also possible to deploy the dashboard on-premises.
The dashboard can be a central platform to manage operations for multiple projects. Contexts are a way to authenticate against the simplecontainer node and can be shared with other users via organizations. The manager could choose which context is shared with which organization.
On the security side, the dashboard acts as a proxy, and no information about access is persisted on the app. Also, everywhere mTLS and TLS.
Demos on how to use the platform + dashboard can be found at:
- https://app.simplecontainer.io/demos/gitops
- https://app.simplecontainer.io/demos/declarative
Photos of container and gitops dashboards are attached. Currently it is alpha and sign ups will be opened soon. Interested in what you guys think and if someone wants to try it out you can hit me up in DM for more info.
https://redd.it/1k72nb3
@r_devops
Reddit
From the devops community on Reddit
Explore this post and more from the devops community
Help Tool for managing helm charts
Hey everyone, current flow is keel,helm,github actions on gke.
We have a chart per app (unsustainable I know) and values file per environment. I am working on cutting down the chart number to be per application type.
Meanwhile I wanted to see if anyone came across an open source or paid tool that allows for helm chart management like a catalog. Where we could for example make env var changes to a selected number of charts and redeploy them all.
If this doesn’t exist i will probably have to write it in ruyaml myself,which I don’t want to
https://redd.it/1k6wnpm
@r_devops
Hey everyone, current flow is keel,helm,github actions on gke.
We have a chart per app (unsustainable I know) and values file per environment. I am working on cutting down the chart number to be per application type.
Meanwhile I wanted to see if anyone came across an open source or paid tool that allows for helm chart management like a catalog. Where we could for example make env var changes to a selected number of charts and redeploy them all.
If this doesn’t exist i will probably have to write it in ruyaml myself,which I don’t want to
https://redd.it/1k6wnpm
@r_devops
Reddit
From the devops community on Reddit
Explore this post and more from the devops community
AI Agents real life usage
I am looking for real life examples of people using AI Agents in their daily DevOps tasks. I know that RooCode for example is useful to generate IaC code or scripts but I am looking for examples that go beyond the "code generation" tasks.
Any experience you guys would like to share?
https://redd.it/1k79u9a
@r_devops
I am looking for real life examples of people using AI Agents in their daily DevOps tasks. I know that RooCode for example is useful to generate IaC code or scripts but I am looking for examples that go beyond the "code generation" tasks.
Any experience you guys would like to share?
https://redd.it/1k79u9a
@r_devops
Reddit
From the devops community on Reddit
Explore this post and more from the devops community
Tailpipe - The Log Interrogation Game Changer
SQL has been the data access standard for decades, it levels the playing field, easily integrates with other systems and accelerates delivery. So why not leverage it for things other than the database, like querying APIs and Cloud services? Tailpipe follows along the same lines, this time by enabling SQL to query log files.
https://www.i-programmer.info/news/90-tools/17992-tailpipe-the-log-interrogation-game-changer.html
https://redd.it/1k7cbpm
@r_devops
SQL has been the data access standard for decades, it levels the playing field, easily integrates with other systems and accelerates delivery. So why not leverage it for things other than the database, like querying APIs and Cloud services? Tailpipe follows along the same lines, this time by enabling SQL to query log files.
https://www.i-programmer.info/news/90-tools/17992-tailpipe-the-log-interrogation-game-changer.html
https://redd.it/1k7cbpm
@r_devops
I Programmer
Tailpipe - The Log Interrogation Game Changer
By using the expressiveness of the SQL language, TailPipe makes querying log files as easy as doing "select * from logs;".
Career Advice: Is it beneficial for a Software Engineer to study CCNA, MCSA, and MCSE?
I'm a software engineer considering studying CCNA, MCSA, and MCSE. Would these certifications give me any advantages? My goal is to work in system-related roles in the future
https://redd.it/1k7e2dx
@r_devops
I'm a software engineer considering studying CCNA, MCSA, and MCSE. Would these certifications give me any advantages? My goal is to work in system-related roles in the future
https://redd.it/1k7e2dx
@r_devops
Reddit
From the devops community on Reddit
Explore this post and more from the devops community
Where to take UI DevOps courses online here? Does anyone know where can take these courses?
Hi there, I'm looking for learn UI DevOps but just I see DevOps courses, so I was wondering if anyone knows any courses where I can find?
I appreciated your response!
https://redd.it/1k7eqzn
@r_devops
Hi there, I'm looking for learn UI DevOps but just I see DevOps courses, so I was wondering if anyone knows any courses where I can find?
I appreciated your response!
https://redd.it/1k7eqzn
@r_devops
Reddit
From the devops community on Reddit
Explore this post and more from the devops community
Journey from Windows admin to k8s
From training with PowerShell to deploying Kubernetes clusters — here’s how I made the leap and how you can too.
The Starting Point: A Windows-Centric Foundation
In 2021, I began my journey as an IT Specialist in System Integration. My daily tools were PowerShell, Azure, Microsoft Server, and Terraform. I spent 2–3 years mastering these technologies during my training, followed by a year as a Junior DevOps Engineer at a company with around 1,000 employees, including a 200-person IT department. My role involved managing infrastructure, automating processes, and working with cloud technologies like Azure.
The Turning Point: Embracing a New Tech Stack
In January 2025, I made a significant career move. I transitioned from a familiar Windows-based environment to a new role that required me to work with macOS, Linux, Kubernetes (K8s), Docker, AWS, OTC Cloud, and the Atlassian Suite. This shift was both challenging and exhilarating.
The Learning Curve: Diving into New Technologies
Initially, I focused on Docker, Bash, and Kubernetes, as these tools were central to the new infrastructure. Gradually, I built on that foundation and delved deeper into the material.
A major milestone was taking on the role of project lead for a migration project for the Atlassian Suite. Our task was to transition the entire team and workflows to tools like Jira and Confluence. This experience allowed me to delve deep into software development and project management processes, highlighting the importance of choosing the right tools to improve team collaboration and communication.
Building Infrastructure: Hands-On Experience
I set up my own K3s cluster on a Proxmox host using Ansible and integrated ArgoCD to automate continuous delivery (CD). This process demonstrated the power of Kubernetes in managing containerized applications and the importance of a well-functioning CI/CD pipeline.
Additionally, I created five Terraform modules, including a network module, for the OTC Cloud. This opportunity allowed me to dive deeper into cloud infrastructure, ensuring everything was designed and built correctly. Terraform helped automate the infrastructure while adhering to best practices.
Optimizing Pipelines: Integrating AWS and Cloudflare
I worked on optimizing existing pipelines running in Bamboo, focusing on integrating AWS and Cloudflare. Adapting Bamboo to work seamlessly with our cloud infrastructure was an interesting challenge. It wasn’t just about automating build and deployment processes; it was about optimizing and ensuring the smooth flow of these processes to enhance team efficiency.
Embracing Change: Continuous Learning and Growth
Since joining this new role, I’ve learned a great deal and grown both professionally and personally. I’m taking on more responsibility and continuously growing in different areas. Optimizing pipelines, working with new technologies, and leading projects motivate me every day. I appreciate the challenge and look forward to learning even more in the coming months.
Lessons Learned and Tips for Aspiring DevOps Engineers
Start with the Basics: Familiarize yourself with core technologies like Docker, Bash, and Kubernetes.
Hands-On Practice: Set up your own environments and experiment with tools.
Take on Projects: Lead initiatives to gain practical experience.
Optimize Existing Systems: Work on improving current processes and pipelines.
Embrace Continuous Learning: Stay updated with new technologies and best practices.
Stay Connected
I’ll be regularly posting about my homelab and experiences with new technologies. Stay tuned — there’s much more to explore!
Inspired by real-world experiences and industry best practices, this blog aims to provide actionable insights for those looking to transition into DevOps roles. Check also my dev blog for more write ups and homelabbing content:
https://salad1n.dev/
https://redd.it/1k7fc50
@r_devops
From training with PowerShell to deploying Kubernetes clusters — here’s how I made the leap and how you can too.
The Starting Point: A Windows-Centric Foundation
In 2021, I began my journey as an IT Specialist in System Integration. My daily tools were PowerShell, Azure, Microsoft Server, and Terraform. I spent 2–3 years mastering these technologies during my training, followed by a year as a Junior DevOps Engineer at a company with around 1,000 employees, including a 200-person IT department. My role involved managing infrastructure, automating processes, and working with cloud technologies like Azure.
The Turning Point: Embracing a New Tech Stack
In January 2025, I made a significant career move. I transitioned from a familiar Windows-based environment to a new role that required me to work with macOS, Linux, Kubernetes (K8s), Docker, AWS, OTC Cloud, and the Atlassian Suite. This shift was both challenging and exhilarating.
The Learning Curve: Diving into New Technologies
Initially, I focused on Docker, Bash, and Kubernetes, as these tools were central to the new infrastructure. Gradually, I built on that foundation and delved deeper into the material.
A major milestone was taking on the role of project lead for a migration project for the Atlassian Suite. Our task was to transition the entire team and workflows to tools like Jira and Confluence. This experience allowed me to delve deep into software development and project management processes, highlighting the importance of choosing the right tools to improve team collaboration and communication.
Building Infrastructure: Hands-On Experience
I set up my own K3s cluster on a Proxmox host using Ansible and integrated ArgoCD to automate continuous delivery (CD). This process demonstrated the power of Kubernetes in managing containerized applications and the importance of a well-functioning CI/CD pipeline.
Additionally, I created five Terraform modules, including a network module, for the OTC Cloud. This opportunity allowed me to dive deeper into cloud infrastructure, ensuring everything was designed and built correctly. Terraform helped automate the infrastructure while adhering to best practices.
Optimizing Pipelines: Integrating AWS and Cloudflare
I worked on optimizing existing pipelines running in Bamboo, focusing on integrating AWS and Cloudflare. Adapting Bamboo to work seamlessly with our cloud infrastructure was an interesting challenge. It wasn’t just about automating build and deployment processes; it was about optimizing and ensuring the smooth flow of these processes to enhance team efficiency.
Embracing Change: Continuous Learning and Growth
Since joining this new role, I’ve learned a great deal and grown both professionally and personally. I’m taking on more responsibility and continuously growing in different areas. Optimizing pipelines, working with new technologies, and leading projects motivate me every day. I appreciate the challenge and look forward to learning even more in the coming months.
Lessons Learned and Tips for Aspiring DevOps Engineers
Start with the Basics: Familiarize yourself with core technologies like Docker, Bash, and Kubernetes.
Hands-On Practice: Set up your own environments and experiment with tools.
Take on Projects: Lead initiatives to gain practical experience.
Optimize Existing Systems: Work on improving current processes and pipelines.
Embrace Continuous Learning: Stay updated with new technologies and best practices.
Stay Connected
I’ll be regularly posting about my homelab and experiences with new technologies. Stay tuned — there’s much more to explore!
Inspired by real-world experiences and industry best practices, this blog aims to provide actionable insights for those looking to transition into DevOps roles. Check also my dev blog for more write ups and homelabbing content:
https://salad1n.dev/
https://redd.it/1k7fc50
@r_devops
salad1n
Home
Minimal Jekyll theme for storytellers
Making Sense of Cloud Spend
Hey y'all.. Wrote an article on sharing some throughts on Cloud Spend
https://medium.com/@mfundo/diagnosing-the-cloud-cost-mess-fe8e38c62bd3
https://redd.it/1k7eabl
@r_devops
Hey y'all.. Wrote an article on sharing some throughts on Cloud Spend
https://medium.com/@mfundo/diagnosing-the-cloud-cost-mess-fe8e38c62bd3
https://redd.it/1k7eabl
@r_devops
Medium
Making Sense of Cloud Spend
Thoughts on why cloud spend drifts, and what to do about it.
Devops workflow tips for a frontend application developer who needs to take on more ops responsibilities.
What is an efficient workflow/work environment setup to tackle an ops task that involves a Github 'Action', and a Bitrise build 'Workflow'.
I've written the GitHub Action as a bash script, and the Bitrise Workflow is a collection of pluggable Bitrise 'Steps' and some custom scripts in the repository that are triggered from the Bitrise Workflow.
The GitHub Action responds to the creation of a new tag with a name that matches, and the Bitrise Workflow runs build tasks that call our backend REST API for dynamic configuration specifics.
I find working on the ops stuff outside the monorepo slow and inefficient.
* Re-running scripts on remote machines/services is slower (I run the service using their local client to debug, but it's difficult to replicate the VM environment accurately in my local machine)
* They often break because I miss mistakes in the bash scripts (don't have editor/language based tools to help me here)
* The cloud based builds need time to execute because the VMs need to setup everything every time (I've cached some stuff but not all)
**Can I please get some tips on how to work more efficiently when working on processes that are distributed across systems?**
For context, I'm usually a frontend app developer and I've set up our monorepo to make our lives as easy as possible:
* Typed language (TS) and linter so we can see our errors in the editor as we work
* automated unit test runner with a 'watcher' that runs on 'save' to make sure our application logic doesn't get broken
* integrated testing pipeline that runs upon creation of pull requests
* hot module reloading so that we can visually see the results of our latests changes
* separation of presentational components and application logic with strict architectural guidelines to keep things modular
* monorepo tooling with task-runner to enable the above
**What are some devops techniques to achieve the same type of workflow efficiencies when configuring processes that run across distributed systems?**
I suspect that I need to look into:
* Modularizing logic into independent scripts
* Containers?
Anything else?
https://redd.it/1k7he32
@r_devops
What is an efficient workflow/work environment setup to tackle an ops task that involves a Github 'Action', and a Bitrise build 'Workflow'.
I've written the GitHub Action as a bash script, and the Bitrise Workflow is a collection of pluggable Bitrise 'Steps' and some custom scripts in the repository that are triggered from the Bitrise Workflow.
The GitHub Action responds to the creation of a new tag with a name that matches, and the Bitrise Workflow runs build tasks that call our backend REST API for dynamic configuration specifics.
I find working on the ops stuff outside the monorepo slow and inefficient.
* Re-running scripts on remote machines/services is slower (I run the service using their local client to debug, but it's difficult to replicate the VM environment accurately in my local machine)
* They often break because I miss mistakes in the bash scripts (don't have editor/language based tools to help me here)
* The cloud based builds need time to execute because the VMs need to setup everything every time (I've cached some stuff but not all)
**Can I please get some tips on how to work more efficiently when working on processes that are distributed across systems?**
For context, I'm usually a frontend app developer and I've set up our monorepo to make our lives as easy as possible:
* Typed language (TS) and linter so we can see our errors in the editor as we work
* automated unit test runner with a 'watcher' that runs on 'save' to make sure our application logic doesn't get broken
* integrated testing pipeline that runs upon creation of pull requests
* hot module reloading so that we can visually see the results of our latests changes
* separation of presentational components and application logic with strict architectural guidelines to keep things modular
* monorepo tooling with task-runner to enable the above
**What are some devops techniques to achieve the same type of workflow efficiencies when configuring processes that run across distributed systems?**
I suspect that I need to look into:
* Modularizing logic into independent scripts
* Containers?
Anything else?
https://redd.it/1k7he32
@r_devops
Reddit
From the devops community on Reddit
Explore this post and more from the devops community
ServerlessDays Belfast 2025 – “Serverless is Serving” (Thursday 15th May)
Hey folks 👋
We’re excited to announce that ServerlessDays Belfast is back for 2025! Mark your calendars for Thursday 15th May, and get ready for a full day of talks, learning, and networking—all centered around building confidently and excellently with serverless technologies.
📍 Venue: The stunning Drawing Offices at Titanic Hotel Belfast
🎯 Theme: Serverless is Serving – building with confidence and excellence
🎟 Tickets: £60 (includes breakfast, lunch, and snacks!)
Group discounts available!
This year’s focus is all about how serverless empowers developers, teams, and communities by removing the ops overhead and letting us focus on delivering real value. Whether you're a seasoned cloud engineer or just curious about getting started with serverless, this event is for you.
Expect talks from local and international speakers, including Patrick Debois, the father/grandfather of DevOps! Expect real-world stories, innovative builds, and practical techniques that show how far we’ve come since the early days of serverless. It’s not just about infra anymore—it’s about service.
🙌 A massive shoutout to our sponsors for making this possible: AWS, EverQuote, and G-P
👥 Proudly organised by volunteers from AWS, G-P, Kainos, Liberty IT, Workrise, Rapid7, EverQuote, and The Serverless Edge.
Come for the talks, stay for the community.
💻 More info & tickets: https://serverlessdaysbelfast.com/
Got questions? Drop them below.
Hope to see you there!
https://redd.it/1k7h8ug
@r_devops
Hey folks 👋
We’re excited to announce that ServerlessDays Belfast is back for 2025! Mark your calendars for Thursday 15th May, and get ready for a full day of talks, learning, and networking—all centered around building confidently and excellently with serverless technologies.
📍 Venue: The stunning Drawing Offices at Titanic Hotel Belfast
🎯 Theme: Serverless is Serving – building with confidence and excellence
🎟 Tickets: £60 (includes breakfast, lunch, and snacks!)
Group discounts available!
This year’s focus is all about how serverless empowers developers, teams, and communities by removing the ops overhead and letting us focus on delivering real value. Whether you're a seasoned cloud engineer or just curious about getting started with serverless, this event is for you.
Expect talks from local and international speakers, including Patrick Debois, the father/grandfather of DevOps! Expect real-world stories, innovative builds, and practical techniques that show how far we’ve come since the early days of serverless. It’s not just about infra anymore—it’s about service.
🙌 A massive shoutout to our sponsors for making this possible: AWS, EverQuote, and G-P
👥 Proudly organised by volunteers from AWS, G-P, Kainos, Liberty IT, Workrise, Rapid7, EverQuote, and The Serverless Edge.
Come for the talks, stay for the community.
💻 More info & tickets: https://serverlessdaysbelfast.com/
Got questions? Drop them below.
Hope to see you there!
https://redd.it/1k7h8ug
@r_devops
ServerlessDays Belfast
One Day. One Track. One Community
Setting up DevOps pipelines is my worst nightmare
Sorry for the rant, but I need to let off some steam. I’ve been building and running cloud stacks for some years now, and it still amazes me how terrible the whole process is—no matter the provider.
You’ve got your application, you start fresh with a new template and a new cloud account (clients finally wants to migrate to the cloud). You set up your CI/CD pipeline, and the goal is to have it provision your resources in the end. You write your first draft, push it, wait for builds/tests/linting/etc... and then it hits the final step: deployment. And italways fails.
Something's broken. You missed a dependency. The runner or the deployment principal doesn’t have the right set of permissions. No one can tell you exactly what permissions your final principal needs. So you enter this endless loop of trial and error. You could skip some of that by just granting full admin rights—but who wants to do that?
Resources get created, the deployment fails but fails to clean up properly. You need to manually delete things. But wait—some resources depend on others, so you can’t delete X before Y is gone. Meanwhile, your stack is a half-broken mess, and you're deep in a cloud console trying to figure out which dangling part is blocking the cleanup.
Hours gone. Again.
You feel like you’re so close every time—just one last permission tweak, one last missing variable... but wait, are those variables even passed correctly from the CI template to the container to the deployment script?
Error messages? Super cryptic. “Something failed while deploying your stack.” Thanks. “mysql password requirements not met.” Wait—there are password requirements? Where’s that documented? Oh, it’s not in the main docs. It’s in one of the five different documentation sets—SDKs, CLI tools, Terraform providers, custom template languages... each with just enough difference to make you scream.
And the worst part? I love cloud-native development. I’m a big fan of serverless, and I genuinely believe in infrastructure-as-code. Once it’s up and running, it’s amazing. But getting there? It still feels outdated, clunky, and overly complex. It’s the opposite of intuitive.
I’m used to fast (almost instant) feedback loops when developing applications on my local machine. AI tools give me huge productivity boost. But CI/CD? It’s still “make a change, wait minutes (or hours), get an error, repeat.” It kills motivation.
And don’t even get me started on the environmental cost of spinning up and tearing down all these failed resources, countless hours of pipeline runs that fail on the last step - deploy...
Anyway, rant over. Just had to vent because this cycle has been getting to me. Same problems across AWS, Azure, GCP. Anyone else feeling this pain? Got any strategies to make it suck less?
https://redd.it/1k7j1bv
@r_devops
Sorry for the rant, but I need to let off some steam. I’ve been building and running cloud stacks for some years now, and it still amazes me how terrible the whole process is—no matter the provider.
You’ve got your application, you start fresh with a new template and a new cloud account (clients finally wants to migrate to the cloud). You set up your CI/CD pipeline, and the goal is to have it provision your resources in the end. You write your first draft, push it, wait for builds/tests/linting/etc... and then it hits the final step: deployment. And italways fails.
Something's broken. You missed a dependency. The runner or the deployment principal doesn’t have the right set of permissions. No one can tell you exactly what permissions your final principal needs. So you enter this endless loop of trial and error. You could skip some of that by just granting full admin rights—but who wants to do that?
Resources get created, the deployment fails but fails to clean up properly. You need to manually delete things. But wait—some resources depend on others, so you can’t delete X before Y is gone. Meanwhile, your stack is a half-broken mess, and you're deep in a cloud console trying to figure out which dangling part is blocking the cleanup.
Hours gone. Again.
You feel like you’re so close every time—just one last permission tweak, one last missing variable... but wait, are those variables even passed correctly from the CI template to the container to the deployment script?
Error messages? Super cryptic. “Something failed while deploying your stack.” Thanks. “mysql password requirements not met.” Wait—there are password requirements? Where’s that documented? Oh, it’s not in the main docs. It’s in one of the five different documentation sets—SDKs, CLI tools, Terraform providers, custom template languages... each with just enough difference to make you scream.
And the worst part? I love cloud-native development. I’m a big fan of serverless, and I genuinely believe in infrastructure-as-code. Once it’s up and running, it’s amazing. But getting there? It still feels outdated, clunky, and overly complex. It’s the opposite of intuitive.
I’m used to fast (almost instant) feedback loops when developing applications on my local machine. AI tools give me huge productivity boost. But CI/CD? It’s still “make a change, wait minutes (or hours), get an error, repeat.” It kills motivation.
And don’t even get me started on the environmental cost of spinning up and tearing down all these failed resources, countless hours of pipeline runs that fail on the last step - deploy...
Anyway, rant over. Just had to vent because this cycle has been getting to me. Same problems across AWS, Azure, GCP. Anyone else feeling this pain? Got any strategies to make it suck less?
https://redd.it/1k7j1bv
@r_devops
Reddit
From the devops community on Reddit
Explore this post and more from the devops community
Off shore DevOps engineers - possible?
I started working in tech support for an American company, but I work from Europe remotely.
The possibilities of getting into DevOps from support are real, there are several people who started in support and then moved to DevOps / SRE / QA.
During interviews, the hiring manager told me that in 1.5 to 2 years it is expected for me to become either a senior supporter or move into engineering.
What wasn’t clear to me at that time, is that only customer care teams (including mine) are spread in different time zones. All engineers are in USA or similar time zones (south/central America), including DevOps team.
I am trying to understand if progression into engineering is actually possible or I have been lied to.
In support we have a global “standup” that is at 17 for the EU and 8/9 for the American supporters (they are all in east coast).
But, the devOps team is made of 3/4 folks, al in USA. They seem to have a short stand up at my 18:30 and then other meetings later.
So the question for you: say you are in a 4 people DevOps team. Would you consider a person working mostly asynchronously in your team? I wanted to trust my hiring manager at first, but my hopes went low when I realized that all the engineers are in USA time. I find it hard to believe that DevOps engineers would be up to change their times for me, and that things could work given also the need to sync and meet with developers, SREs etc in real time.
The support team uses a “follow the sun” strategy while devOps (and all engineering teams) doesn’t seem to use, want or need different hours coverage. I would also be a junior devops if I was to move into their team, so possibly in need of mentoring.
I also am very new and already asking my manager questions about leaving the team doesn’t feel right.
So.. am I cooked and have I been lied to by the hiring manager so that they could fill the position asap?
https://redd.it/1k7ioaa
@r_devops
I started working in tech support for an American company, but I work from Europe remotely.
The possibilities of getting into DevOps from support are real, there are several people who started in support and then moved to DevOps / SRE / QA.
During interviews, the hiring manager told me that in 1.5 to 2 years it is expected for me to become either a senior supporter or move into engineering.
What wasn’t clear to me at that time, is that only customer care teams (including mine) are spread in different time zones. All engineers are in USA or similar time zones (south/central America), including DevOps team.
I am trying to understand if progression into engineering is actually possible or I have been lied to.
In support we have a global “standup” that is at 17 for the EU and 8/9 for the American supporters (they are all in east coast).
But, the devOps team is made of 3/4 folks, al in USA. They seem to have a short stand up at my 18:30 and then other meetings later.
So the question for you: say you are in a 4 people DevOps team. Would you consider a person working mostly asynchronously in your team? I wanted to trust my hiring manager at first, but my hopes went low when I realized that all the engineers are in USA time. I find it hard to believe that DevOps engineers would be up to change their times for me, and that things could work given also the need to sync and meet with developers, SREs etc in real time.
The support team uses a “follow the sun” strategy while devOps (and all engineering teams) doesn’t seem to use, want or need different hours coverage. I would also be a junior devops if I was to move into their team, so possibly in need of mentoring.
I also am very new and already asking my manager questions about leaving the team doesn’t feel right.
So.. am I cooked and have I been lied to by the hiring manager so that they could fill the position asap?
https://redd.it/1k7ioaa
@r_devops
Reddit
From the devops community on Reddit
Explore this post and more from the devops community
Anyone looking for a part-time devops/consultant with previous startup experience?
Hi,
If there’s anyone US or UK based that’s looking for a part time devops, I would be open for discussion.
I was a part of several startups, one of which skyrocketed and got acquired, and the other made it to some big fancy investments. We can talk more if anyone is interested in having me on their team.
I’m open to both engineering and consulting.
Best regards!
https://redd.it/1k7km7v
@r_devops
Hi,
If there’s anyone US or UK based that’s looking for a part time devops, I would be open for discussion.
I was a part of several startups, one of which skyrocketed and got acquired, and the other made it to some big fancy investments. We can talk more if anyone is interested in having me on their team.
I’m open to both engineering and consulting.
Best regards!
https://redd.it/1k7km7v
@r_devops
Reddit
From the devops community on Reddit
Explore this post and more from the devops community
Looking for DevOps feedback
Hey all, I'm a developer @ Korbit AI and I was hoping to get some feedback from QA / Dev Ops engineers as to how we can make our reviews even more useful for this specific type of focus.
Currently we focus on these 8 categories: Functionality, Security, Performance, Error Handling, Readability, Logging, Design and Documentation.
My question is, as a dev ops engineer / qa, what are specific types of things our reviews can really focus on to help save time in this particular subject. We're planning on releasing a new feature called Korbit Policies, where you are able to tell Korbit specific things to flag ( example is like refactoring from one class to another and enforcing usage ).
Let me know and thank you in advanced.
https://redd.it/1k7r4n1
@r_devops
Hey all, I'm a developer @ Korbit AI and I was hoping to get some feedback from QA / Dev Ops engineers as to how we can make our reviews even more useful for this specific type of focus.
Currently we focus on these 8 categories: Functionality, Security, Performance, Error Handling, Readability, Logging, Design and Documentation.
My question is, as a dev ops engineer / qa, what are specific types of things our reviews can really focus on to help save time in this particular subject. We're planning on releasing a new feature called Korbit Policies, where you are able to tell Korbit specific things to flag ( example is like refactoring from one class to another and enforcing usage ).
Let me know and thank you in advanced.
https://redd.it/1k7r4n1
@r_devops
Reddit
From the devops community on Reddit
Explore this post and more from the devops community
API Sprawl - issue for you or na?
Do y'alls bosses see API sprawl as a real problem? Or is just your problem? We need more discoverability for our APIs for sure, too many people doing too many things off in the corner. But I also need to make sure my boss sees it as a legit issue so that I can do something about it.
https://redd.it/1k7w38d
@r_devops
Do y'alls bosses see API sprawl as a real problem? Or is just your problem? We need more discoverability for our APIs for sure, too many people doing too many things off in the corner. But I also need to make sure my boss sees it as a legit issue so that I can do something about it.
https://redd.it/1k7w38d
@r_devops
Reddit
From the devops community on Reddit
Explore this post and more from the devops community