Reddit DevOps
266 subscribers
30.9K links
Reddit DevOps. #devops
Thanks @reddit2telegram and @r_channels
Download Telegram
I ruined a POC

Been a DevOps from 4.5 years. Started from Linux administrator and now I'm managing cloud, db and container orchestration. So my manager asked me to do a POC on traefik which is a reverse proxy just like nginix. I did well, explored the features but was unable to implement fail2ban plugin in it. When I was presenting the same to my manager, i forgot basic docker compose syntax and now I think my role is in jeopardy. Anyone else faced this? Motivate me please, I'm scared.

https://redd.it/1kw0o9g
@r_devops
What’s the best SSO solution for a +50 mid-sized company in 2025?

Curious to hear what the DevOps community is seeing work best today.

For companies with \~50–200 employees, minimal internal IT, and tools like GitHub, Gmail, Vault, AWS, and Graylog — what are your go-to SSO solutions?

Looking for feedback on:

Ease of integration (SAML/OIDC)
Multi-IDP support
Support for SCIM provisioning
Transparent, scalable pricing (no bloated enterprise overhead)
Good developer experience

Here’s a list I often see in conversations:

Azure AD (Entra ID)
[Keycloak](https://www.keycloak.org/)
Authentik
[WorkOS](https://workos.com/)
SSOJet

Would love to hear your experience with any of these or other favorites — especially across multi-tenant or external user auth use cases.

https://redd.it/1kw0uvh
@r_devops
Docker images works fine on local but not on gcp.

Hi everyone,

I’m running a Docker image with an old Ruby version on Debian. It works locally with Docker Compose, but fails with “Service Unavailable” on GCP Cloud Run. The issue seems to be incompatibility with the latest Ubuntu version used in the infra.

I can’t upgrade Ruby due to legacy constraints—we’re rewriting it in another language. Any suggestions for getting this to run on Cloud Run as-is?

Thanks!



https://redd.it/1kw0fpi
@r_devops
Multi-stage release pipeline, how to require one approval from each of two separate groups?

Hi all I am trying to implement a release pipeline using Azure DevOps and using yaml.

I have a requirement where two groups need to manually approve a release. At least one person per group must approve. So I deploy to an environment like `staging` or `prod`, but before deployment I want a manual approval gate where at least one person from `group a` and at least one person from `group b` need to manually approve.

I want to avoid using the Classic Release UI as I want the whole process to be code-defined in yaml.

I have tried looking at yaml definition but I did not get very far, to be honest if I could version control groups here that would be a really nice feature. Using ManualValidation@0 in yaml sounded interesting but given that anyone can approve and no concept of groups as far as I can tell so this is out of the question.

I have tried looking into `environments` with approval checks but Azure DevOps only supports assigning a single group to an environment’s approval gate. That doesn't seem to allow me to enforce the "one per group" logic.

I came across the idea of using two environments per stage eg `staging-group-a` and `staging-group-b`. I was also thinking to have two representatives for the workflow and let them defer approval if necessary. Both options sound clunky and I think I prefer the latter one the most.

Is there a simple way to solve this problem? It feels more complicated than it has to be.

https://redd.it/1kw5khg
@r_devops
AWS Native macOS App

I'm a huge infrastructure dev and love working in AWS. But I absolutely hate the UI, and I think it turns a lot of people off by making it seem to complicated.

I'm curious what folks think about a UI on top of AWS. I've been working on a project in the background and curious if others feel similarly or this is just me. Not sure the best way to share pics

I love native apps, so building it as a macOS app to start.

Edit: posted a Imgur link in the comments

https://redd.it/1kw7sk4
@r_devops
Can someone please show me a better way to find related resources in Kubernetes?

I know this problem is solved, I just don't want to go on google and try a few specific tools and I want to find a good tool that:


Allows me to link my deployments to github repositories and show me what services are connected to other services or resources (eg databases)

I want to know the tables of the database and the data models and contracts so I can focus on my features/testing rather than going through loads of microservice repositories

https://redd.it/1kw7s9o
@r_devops
Enterprise application requirements management?

Hi all,

My team manage over 100 applications and requirements management hasn't been a strong suit in the past.

What business-facing processes and systems would be considered best practice to manage current and in-development functional and non-functional requirements/stories in an Enterprise?

We can maintain product backlogs in a SDLC process, but for large initiatives/projects, we have PMs that often create new Azure DevOps or Jira projects and end up with a de-centralised list of requirements to link test cases to.

I want transparency and collaboration with the various product owners in our organisation to help maintain a central list of requirements that we can establish test cases against and refer to it when needed for root-cause analysis and change management.



https://redd.it/1kwa545
@r_devops
Thinking of transition into devops from QA .

Hi all, I'm currently working in QA with 5 years of experience and considering a transition into DevOps. Is DevOps a good long-term career option? Will I be treated as a fresher after switching? Also, is it possible to get DevOps roles at entry level with self-learning and certifications?"

https://redd.it/1kweawt
@r_devops
Confused and struggling on a project for learning

So I am studying about the DevOps and azure. And I want to make a project on 3 tier application deployment. And I wanted to use App gateway, app service, database. But I can't get my head around it. I learned these services, now it's time to connect them.

But I'm confused on application code, how they will deploy on each app service, what are best practices.

Somebody guide me in details so I can have confidence and create this project for better learning!

https://redd.it/1kwffpv
@r_devops
Building a SaaS for Generating CI/CD Pipelines for Legacy Enterprise Apps — Worth It?

Hi all,



I’m considering building a web-based SaaS that helps developers automatically generate CI/CD pipelines — specifically targeting legacy enterprise applications, like those built with J2EE.

The idea is to take a minimal project context (e.g., pom.xml/build.xml, framework type, deployment target), and generate a tailored GitHub Actions workflow (or other CI systems) that includes steps like building, testing, Dockerizing, and deploying the app.

While modern frameworks like Spring Boot and Quarkus get a lot of tooling love, J2EE and older enterprise stacks often get left behind. I’m wondering:

Is this a problem worth solving?
Would teams maintaining older Java systems actually pay for a tool like this?
How much CI/CD is still being written manually for legacy apps in 2025?
Should I broaden beyond J2EE to support more ecosystems from the start?

Happy to hear your thoughts, feedback, or if you’ve built something similar. Appreciate any input before I go too deep into MVP land.



Thanks!

https://redd.it/1kwf1de
@r_devops
Brief daily traffic spikes when downstream teams resist scaling

I have a pretty messy infrastructure. Every day at a specific time, we experience a traffic spike, and our service doesn't behave properly. More precisely, our downstream services aren't scaled well enough to handle that load. They're also reluctant to scale out, since doing so would mean being heavily over-scaled during the rest of the day. They are saying it's overkill to scale out just for a 1–2 minute spike in out service.

I see two possible solutions:

1. Push for scheduled scaling of the downstream services and ask them to scale out temporarily during our spike time to handle it. But the is a lot of bureaucracy in the company and provisioning new instances might require days of approval.
2. Add caching on our service level and cache responses from the downstream services, so we can use the cache as a fallback if those services are unavailable. But this feels like a hack to me as it introduces another failure point and just shifts the scaling issue from the downstream to the cache. Eventually, this will also hit a wall.

What do you think? Should I push for the first option or is the second good enough? Maybe there's a better way I’m not seeing? Queue is not an option as latency is very important for us

https://redd.it/1kwh0mg
@r_devops
Seeking Advice: How To Scale AI Models Without Huge Upfront Investment?

Hey folks,
Our startup is exploring AI-powered features but building and managing GPU clusters is way beyond our current budget and expertise. Are there good cloud services that provide ready-to-use AI models via API?Anyone here used similar “model APIs” to speed up AI deployment and avoid heavy infrastructure? Insights appreciated!q

https://redd.it/1kwha0l
@r_devops
Cloud/integrations asset inventory

Hello,

I have been using CloudQuery as a cloud asset inventory for more than a year now.
I use postgres as a destinations and I gave several systems reading from it several purposes, all of them part of our product.

I was asked to find a replacement, but haven’t found anything even remotely close in terms of quality and work done.
Steampipe is now for adhoc stuff, definitely not something I would integrate in my product, also it forces me to create a schema for the data.

Any ideas?


https://redd.it/1kwilbf
@r_devops
How do you keep learning when you’re burned out?

Lately I’ve been hitting a wall.

I want to keep learning new AWS stuff, CI/CD tools, maybe even try out some Kubernetes labs but I just don’t have the energy after work. every blog post feels overwhelming. Even watching a 10 min video feels like too much.

I used to be excited to dig into this stuff at night. Now I’m just tired.

Anyone else go through this?
How do you stay sharp without burning out?
Would love to hear how others recharge and keep growing.

https://redd.it/1kwjkcb
@r_devops
Work life as a Platform Engineer at PlayStation?

Hi DevOps Legends!

I’m interested in hearing about what a typical day looks like as a Platform Engineer at a large gaming company, especially those using Kubernetes on AWS at scale. I may have the opportunity to join a big player like PlayStation soon and wanted to get a sense of the good and the bad that comes with supporting infrastructure at that scale.

Currently, I’m coming from a small but high-functioning team at a startup, so I’m curious about the differences when working in a large organization.

Any tips, big do’s and don’ts, or insights on how you handle the challenges would be greatly appreciated!

TIA!

https://redd.it/1kwkdn2
@r_devops
Local testing of CI/CD Pipelines

Heya guys! First time poster, long time lurker. I've been a DevOps Engineer for roughly a year now, been doing DevOps "stuff" since my second year of apprenticeship, my main points are mostly CI/CD, automating, scripting, working with containers, etc ... but enough about that.

I've been wondering, is there a tool or an IDE extension to test your pipeline code locally or in some sort of environment? I'm working on Azure DevOps (I switched from GitLab when changing company) and this might be a me-problem but always committing your changes and then running your pipeline manually just to wait minutes for it to fail is dreading me sometimes. Built-in linters are nice but unfortunately it doesn't really check if my logic is working.

Thanks in advance!

https://redd.it/1kwkyr1
@r_devops
What must a DevOps engineer know?

I am a developer whose only experience with DevOps is:

1. Using GitHub Actions and its workflows for CI/CD
2. Maybe read a little about Jenkins
3. Know how to write automation scripts (e.g. shell, Python, Perl)

But certainly, still not enough to be a DevOps engineer.

So I am wondering what else must I know or be good at in order to qualify for a DevOps engineer job?

https://redd.it/1kwm89m
@r_devops
Stuck with Puppet at work - should I double down or focus on Ansible and modern IaC?

Hey guys,

I’m a DevOps engineer currently working in a company where everything is built with Puppet (configs, infra automation, the whole stack). I learned Ansible during my apprenticeship and liked it way more (felt cleaner and more readable), but in this new job, Puppet is the standard.
Puppet feels kinda outdated to me (syntax-heavy, more boilerplate, less momentum?), but maybe I’m missing something.

Now I’m wondering:
\- Is Puppet worth investing more time in, or is it a dying horse at this point?
\- Should I use my free time to sharpen my Ansible, or even move on to Terraform, Pulumi, etc.?

Thanks!

https://redd.it/1kwnrb4
@r_devops
Career Advice DevOps Internship Completed, Now Confused Between Certifications, Full-Time Job, or Higher Studies — Need Guidance

Hey everyone,

I could really use some advice right now.

I recently graduated and completed a 7-month internship in a DevOps role at a startup (6 months officially, 1 month extended). The experience was great — I learned a lot about cloud, CI/CD, monitoring, containerization, etc.

Now, here’s the situation:

My manager is suggesting that I complete three certifications —

CKA (Certified Kubernetes Administrator)
AZ-104 (Microsoft Azure Associate)
AWS Certified Solutions Architect – Associate

He mentioned that getting these would help me secure a full-time role.

Now I’m at a crossroads. I’m confused between:

Should I stay, do the certs, and hopefully get a full-time job?
Or should I look for jobs at other startups or companies that might offer better pay/growth?
Or should I consider going for higher education (MS) instead?

I’m not sure how valuable these certifications are in the current job market. Also, I’m unsure whether staying at a startup is the right long-term move.

Would love to hear from people who’ve been in a similar situation or are working in DevOps/Cloud roles.

TL;DR: Completed 7-month DevOps internship. Manager expects CKA + Azure + AWS certs for full-time job. Should I go for it, explore other job options, or pursue higher studies? Confused on what’s the best path.

Thanks in advance!

https://redd.it/1kwj6th
@r_devops
Are we just expected to be full stack engineers now?

Before we start, warning: this is a little bit of a rant.

Just got passed up for an interview because I was told the company thought I was "too junior." Which is complete bullshit. They said because I don't have any real world experience with their stack, they thought I wouldn't be a good fit.

The thing is, I have a LOAD of experience with the DevOps stack they want to use. They want to deploy a bunch of GraphQL servers as microservices and use Apollo federated graphql to manage all of them. Great. I have a lot of Kubernetes experience. I literally just passed the CKS last week to boot. They want to do it on Azure. Great again. I hate azure but I've been working on that the longest and know a bunch of the workarounds and annoying caveats to a lot of things. They even want to use Azure DevOps over GH actions which, boy, I've spent way too many sleepless nights attempting and succeeding in bending AZDO to my will to accept I'm anything less than a SME by this point. Add on the fact that, even though I've only been doing this for a few years, I've been working as a consultant making multiple different deployments for multiple different clients.

Their logic was since they had multiple guys with 10+ years of tenure on their team they needed someone with more experience with the actual tooling with Redis and GraphQL. And my mind just goes blank. You've got to be absolutely fucking kidding me.

It really doesn't take a JavaScript genius to figure out how to build a deploy a node app in kube. But you're gonna be really sore real quick if you think you can back fill a DevOps engineer position with a JavaScript guy.

https://redd.it/1kwrxts
@r_devops
Hi, how best to learn kubernetes and aws from both theoretical and practical stand points..

I have learned Linux as of now and I want to learn kubernetes and aws for certification to apply for and to get jobs.

Please help.

https://redd.it/1kwtcyp
@r_devops