Reddit DevOps
268 subscribers
2 photos
31K links
Reddit DevOps. #devops
Thanks @reddit2telegram and @r_channels
Download Telegram
What are your build and deploy pipelines like for database migration? Do you use a tool like SSDT, flyway or Liquibase?

My company is deciding on their CICD tools. Right now, they're choosing between flyway and SSDT. We're SQL Server only.

My experience is either hand coded deployment scripts or something only slightly fancier. A coworker wrote 2 powershell scripts: a build script and a deploy script. The build script just zips up all the files in a folder. That folder contains subfolder procs, function, views, custom scripts and such. All the scripts for table and index changes go in the custom scripts folder. The deploy script unzips them, runs the custom scripts in order and writes their names in a table in the database. It skips the scripts if the name already exists. Then it runs all the proc, function, view and such scripts. Works fine except it wrecks all the execution plans.

https://redd.it/1j3mdfd
@r_devops
Looking for DevOps tips in a mid-sized environment

Hey! My team (around 50 developers) is trying to adopt more mature DevOps practices. We currently use GitHub with a basic Jenkins setup, and we do manual deployments via SSH. Some pain points:

Slow, unpredictable releases.
Minimal automation (Docker is used for a few services only).
Dev and Ops aren’t fully aligned, causing workflow snags.
Testing vs. production setups don’t always match.

We’d love to hear how you structure DevOps in a similar-sized org, which CI/CD + infra-as-code tools you rely on, and any security best practices to bake in early. Thanks for any advice!

https://redd.it/1j3p74z
@r_devops
Longtime DevOps Engineer looking for career advice

Hey everyone, longtime lurker here looking for some career advice.

I was someone who was always knew I wanted my career to be something involving technology since that is what I've always been passionate about. Originally wanted to go for a computer science degree but because my love/hate relationship with math, decided against that and went the route of MIS. After graduation, ended up getting hired at a big body shop. Spent a year studying Oracle and other technologies but ultimately did not wind up landing any interviews with clients so after a year I quit.

Spent the next few years building my skills with web development with Python and getting familiar with AWS cloud while working on a few projects for a family member who works in the industry. One of those projects was building out a lab management tool for a NoSQL database company where this family member had previously worked at a high level.

Finally got a break and landed a DevOps role with an oil and gas software company. It was mostly CI/CD b*tch work along mixed with Windows support. I also touched Configuration Management tools such as Ansible and Chef for the first time in this role. I loved my team and felt like I grew a lot through this experience but it was definitely a dead end and several folks were leaving for other companies or back to their previous employers because of these factors. Ended up quitting because it got so depressing without having anything else lined up.

This turned out to be a good decision. I found myself at a fortune 500 tech company shortly after and again up-leveled substantially getting to touch all the cool tech I'd wanted to since that last role, mainly Kubernetes. Was the primary person responsible for developing and maintaining a custom provisioning API for internal services. I had a lot of fun working on this project and even got the chance to mentor junior engineers which was very rewarding. But after my manager and other people on my team ended up leaving, I finally quit because it was clear the future wasn't bright here.

Did a solo project after this before winding up with another company as a devops engineer. Making the most money I had ever made in my career and my first "Senior" title but I was handicapped substantially and the politics were absolutely brutal. The company was experiencing significant challenges and finally quit before the next round of layoffs that would have effected me.

\----------------------------------------------------------------------------------------------------------------------

So now we get to the actual discussion part. I have been dealing with some pretty serious mental illness concerns lately and am not in a place right now where I feel comfortable interviewing but when I am able to again, I'm wondering if it would be better to try and just hard pivot into dev or if that transition would be too brutal and I'm better off continuing to build my skills in this domain? I kind of just hate the tech scene in general right now but just looking for solid practical advice on how to proceed in my career.


*EDIT:* The reason I ask about transitioning to dev is pretty self explanatory to a lot of folks here. The entire idea of DevOps as a role is kind of being eroded over time. Knowledge of DevOps related topics is just kind of mandatory for anyone in SWE these days and "Platform Engineer" in the true sense isn't done properly except by a handful of high level teams who have their stuff together. But I've never been that great with DSA questions, actually terrified of them, and I recognize I'd be competing with a lot of young and hungry new grads with much better grasp of these concepts than me so that discourages me from going that route. I've thought about maybe going for SRE roles as opposed to DevOps? Idk.

https://redd.it/1j3hc0u
@r_devops
Ctrlplane – Open-Source Deployment Orchestration Tool

Hey everyone!

With the community’s help and feedback, we built Ctrlplane, an open-source deployment orchestration tool designed to manage complex deployments across diverse infrastructures. It provides a centralized platform for coordinating deployments, ensuring consistency, and improving scalability.

We built this because we manage 300+ Kubernetes clusters cross mutiple cloud providers and needed a way to roll out changes and upgrade systems efficiently.

If you're familiar with Octopus Deploy, think of Ctrlplane as a free and open-source alternative!

Ctrlplane isn’t meant to replace your existing CI/CD tooling—instead, it acts as an orchestration layer on top of it to streamline and automate deployments.

# Key Features:

Integration with existing CI/CD tools
Advanced orchestration policies
Automated and synchronized deployment activities
Version management
Environment standardization
Resource and CI analysis

# Helpful Links:

GitHub: https://github.com/ctrlplanedev/ctrlplane
Docs: https://docs.ctrlplane.dev
Website: https://ctrlplane.dev
Discord: https://ctrlplane.dev/discord

We are still building out a lot of features and are offically using it in our production workloads. Give it a try (or star)—hope you find it useful! 🚀

https://redd.it/1j3vif2
@r_devops
[HIRING] [INDIA] [REMOTE] [Sr.DevOps Engineer] [2 -3 years of experience]



Experience: 2 to 4 years of experience

**Requirements**

* Extensive Linux experience, comfortable between Debian and Redhat.

* Experience architecting, deploying/developing software, or internet scale production-grade cloud solutions in virtualized environments, such as Google Cloud Platform or other public clouds.
* Experience refactoring monolithic applications to microservices, APIs, and/or serverless models.
* Good Understanding of OSS and managed SQL and NoSQL Databases.
* Coding knowledge in one or more scripting languages - Python, NodeJS, bash etc and 1 programming language preferably Go.
* Experience in containerisation technology - Kubernetes, Docker
* Experience in the following or similar technologies-  GKE, API Management tools like API Gateway, Service Mesh technologies like Istio,  Serverless technologies like Cloud Run, Cloud functions, Lambda etc.
* Build pipeline (CI) tools experience; both design and implementation preferably using Google Cloud build but open to other tools like Circle CI, Gitlab and Jenkins
* Experience in any of  the Continuous Delivery tools (CD)  preferably Google Cloud Deploy but open to other tools like ArgoCD, Spinnaker.
* Automation  experience using  any of the IaC tools  preferably Terraform with Google Provider.
* Expertise in Monitoring & Logging tools preferably Google Cloud Monitoring & Logging but open to other tools like Prometheus/Grafana, Datadog, NewRelic
* Consult with clients in  automation and migration strategy and execution
* Must have experience working with version control tools such as Bitbucket, Github/Gitlab
* Must have good communication skills
* Strongly goal oriented individual with a continuous drive to learn and grow
* Emanates ownership, accountability and integrity



**Certifications**

* Google Cloud Professional Architect
* Certified Kubernetes Administrator (CKA)



**Responsibilities**

* Support seniors on at least 2 to 3 customer projects, able to handle customer communication with the coordination of products owners and project managers.
* Support seniors on creating well-informed, in-depth cloud strategy and  manage its adaptation process.
* Initiative to create solutions, always find improvements and offer assistance when needed without being asked.
* Takes ownership of projects, processes, domain and people and holds themselves accountable to achieve successful results.
* Understands their area of work and shares their knowledge frequently with their teammates.
* Given an introduction to the context in which a task fits, design and complete a medium to large sized task independently.
* Perform the tasks review of their colleagues and ensure it conforms to the task requirements and best practices.
* Troubleshoot incidents, identify root cause, fix and document problems, and implement preventive measures and solve issues before they affect business productivity.
* Ensure application performance, uptime, and scale, maintaining high standards of code quality and thoughtful design.
* Managing cloud environments in accordance with company security guidelines.
* Define and document best practices and strategies regarding application deployment and infrastructure maintenance.

https://redd.it/1j3wo1j
@r_devops
Dilemma - advice welcomed

hi guys,

Just want to see if someone can come up with great advice. Here's the thing - I came from operation support/ biz analyst/ system analyst background, and had the opportunity to dab into cloud/devOps stuff in my current role (because I asked for it). Mainly doing CI/CD, IaC stuff. I fall in love with what I am doing.

Recently I have started applying DevOps jobs. Most of the applications ended with the rejection email. Only had two interviews so far but never progressed to the next round. The interviewers in both interviews threw some programming questions to me, which of course I failed to provide any satisfactory answers.


The only programming that I have done in my current role is some small scripts to automate stuff. However, giving that my company is not a managed service provider/consultancy firm, there is only so little to automate.

Is programming experience pre-requisite for DevOps role? Am I screwed? Should I just give up my dream and start applying for Sys Analyst/BA roles instead?

Thanks all in advance!

https://redd.it/1j3vjmn
@r_devops
Is anyone here works in EPAM system , need referral for a Devops Engineer role

Is there anyone here who is in EPAM systems ? I need a referral for a role of devops engineer there is a vacancy for a 2 years of experience , i have the required experience, if anyone here can refer me it would mean a lot to me as i have been unemployed from 17th october 2024 , i have not been receiving any interview calls let alone offer letters , meanwhile i have also done my CKA certification as well

https://redd.it/1j3y3u4
@r_devops
Running a Go Lambda with the provided.al2023 runtime

Hi all, I am struggling to get my Golang lambda function running with the new provided.al2023 runtime.
I am using the SAM CLI and the Hello World Template (the basics). I have updated the template.yaml to use the provided.al2023 runtime (I'm not sure why AWS toolkit doesn't do this by default now since the go1.x runtime is now deprecated). See below:

template.yaml

AWSTemplateFormatVersion: "2010-09-09"
Transform: AWS::Serverless-2016-10-31
Description: >
  test-go-lambda

  Sample SAM Template for test-go-lambda

# More info about Globals: https://github.com/awslabs/serverless-application-model/blob/master/docs/globals.rst
Globals:
  Function:
    Timeout: 25

Resources:
  HelloWorldFunction:
    Type: AWS::Serverless::Function # More info about Function Resource: https://github.com/awslabs/serverless-application-model/blob/master/versions/2016-10-31.md#awsserverlessfunction
    Metadata:
      BuildMethod: go1.x
    Properties:
      CodeUri: hello-world/
      Handler: bootstrap
      Runtime: provided.al2023
      Architectures:
        - x8664
      Events:
        CatchAll:
          Type: Api # More info about API Event Source:
https://github.com/awslabs/serverless-application-model/blob/master/versions/2016-10-31.md#api
          Properties:
            Path: /hello
            Method: GET
      Environment: # More info about Env Vars:
https://github.com/awslabs/serverless-application-model/blob/master/versions/2016-10-31.md#environment-object
        Variables:
          PARAM1: VALUE

Outputs:
  # ServerlessRestApi is an implicit API created out of Events key under Serverless::Function
  # Find out more about other implicit resources you can reference within SAM
  #
https://github.com/awslabs/serverless-application-model/blob/master/docs/internals/generatedresources.rst#api
  HelloWorldAPI:
    Description: "API Gateway endpoint URL for Prod environment for First Function"
    Value: !Sub "https://${ServerlessRestApi}.execute-api.${AWS::Region}.amazonaws.com/Prod/hello/"
  HelloWorldFunction:
    Description: "First Lambda Function ARN"
    Value: !GetAtt HelloWorldFunction.Arn
  HelloWorldFunctionIamRole:
    Description: "Implicit IAM Role created for Hello World function"
    Value: !GetAtt HelloWorldFunctionRole.Arn

Now when i run sam build & then sam local start-api my request just hangs and then times out! Why is this?

Please note I am on a Windows system

https://redd.it/1j3z6go
@r_devops
Podman vs Colima in 2025

I'm a beginner in the realm of containerization, and I've been doing plenty of reading into the various pros and cons of the offerings available, but it's difficult to find any recent discussion on the matter particularly when it comes to Apple Silicon Macbooks

 

There are plenty of posts from a few years ago when Docker Desktop became a paid product and everybody started moving to Colima, but since then it seems discussion has died down.

What's the 2025 state of Podman on M-chip macOS? Is the virtiofs thing figured out yet?

Has podman quadlets reached competitive parity with Docker Compose?

Ive read that Rancher Desktop had connectivity issues. Is this still true?

Is there any substantive difference in implementation between CRI-O and containerd?

I know that podman doesn't have the popularity and therefore amount of discussion, and documentation available for docker, but is Podman substantively more difficult to learn as a beginner?

Which would you rather work with?

 

#Unimportant Contextual Information Follows
Why Im asking: All of the "intro to containerization" youtube videos are essentially "intro to docker" videos. All of the intro to docker videos start by telling me to click - install a DMG GUI interface. I don't like having a GUI forced on me, and so I began searching through the alternatives. There are a lot of comparison posts, but they're all 2 years old or more.

https://redd.it/1j3zwxw
@r_devops
Managing Dev EC2 instances

Hello!
We're a fairly big team with around 300-500 active dev EC2 instances. Currently all of the requests to dev EC2 instances mostly go through me, whether it be to create a new instance or modify anything relate to an already existing instance. We don't have a streamlined process for this so i do this any way I can (mostly through the console) just to get it done asap. I would like to know if there is a good 3rd party software out there (preferably free) which is good at handling dev EC2 instances. Or if somebody could recommend a good way to build your own dev ec2 manager of sorts. I had an idea to make a script which just takes in parameters like instance size, user email, ami etc... and just make a wrapper around it and create a slack bot or use streamlit and deploy that somewhere for people to access. Any help is much appreciated! Thank you!

https://redd.it/1j40t3j
@r_devops
CTO wants me to start DevOps initiatives, where do I even start?

The place is a fintech firm but their tech is behind in terms of best practices since the main business isn't software. They finally have run way to start handling their technical debt.

Some remarks from other developers:
- Terrible developer onboarding, long time to get apps running locally
- Outdated design docs without a process to update them upon feature changes - word docs, confluence
- Releases used to be on weekends but now happen every 2-3 days
- Developers want the process to be more streamlined but haven't talked to them to understand what that means

Their tech stack:
- Datadog for tracing
- AWS
- Ruby on rails
- Python

Any low-hanging fruit and domains to deep dive to get started? Any good questions to conversations on? My DevOps role seems pretty open so as long I'm making the developer's life easier then it'll make my life easier. Trying to catch up on all the automation tools available. I'm a past developer hired to be on the support side and basically starting up their support ops team. They have a sister dev support ops team but they're based in india.

https://redd.it/1j42mwi
@r_devops
How Does Your Team Handle Prometheus Alerts? Manual vs. Automated

Does your team write Prometheus/Grafana alert rules manually, or do you use an automated tool? If automated, which tool do you use, and does it work well?

Some things I’m curious about:

1. How do you manage and update alert rules at scale?
1. Do you struggle with alert fatigue or false positives?
1. How do you test and validate alerts before deploying?
1. What are your biggest pain points with Prometheus/Grafana alerting?

Would love to hear what works (or doesn’t) for your team!

https://redd.it/1j41plm
@r_devops
First time DevOps Engineer - any advice?

Hey all. I've worked before as IT Support, QA and Sysadmin and finally landed a job as Junior DevOps, which I felt really lucky and extremely lucky to get.

It's a company with around 1000 members, with dedicated DevOps team.

I have some knowledge with networking, feeling more and more comfortable with AWS, played a bit with Terraform. but haven't grasp much yet with Ansible and k8s - aaand I know I have shitload of stuff to learn, but I am prepared for it. I also know that things can get stressful.

Do you have any advice or tips for first weeks and months? Some pitfalls, traps?

https://redd.it/1j440eb
@r_devops
dnsdist over TLS timing out with my setup on GKE

I used BIND9 to create a DNS server in Kubernetes that forwards traffic to Cloudflare DNS and handles few endpoints, and attached it to a Load Balancer on UDP port 53 and assigned a public IP to it, it works fine with the dig command and am able to hook it to my network.

But then I introduced dnsdist to have DNS over TLS and to properly use a hostname for the DNS server instead so had the BIND9 Load Balancer converted to a ClusterIP and configured dnsdist to forward to it and listen on port 853 and 53 both, for 853 I enabled TLS and used certbot to generate the certificate and key using the Cloudflare plugin where I have my domain and I intend to create the A record for it as follows dns.example.com of course not proxied (DNS only).

The certificate and key are valid and are mounted correctly to the container, I double-checked with openssl and everything is fine there, I allowed dnsdist ACL access from 0.0.0.0 and made firewall rules for my VPC to allow ingress connections on ports 53 and 853.

Now, when I run:
dig @ dns.example.com google.com it works perfectly fine!

However with:

dig @ dns.example.com google.com +tcp I get a timeout?

Can someone elaborate on what could the problem be?

https://redd.it/1j42rw9
@r_devops
How long did it take you to become a dev ops engineer and how did you get there?

How long did it take in your career to make it to dev ops. I'm 29 going on 30. I've been in IT help desk for 3 years now I am a consultant basically a tier 1.5 at my job not a tier 2 and not a full tier 1. We have half tiers at my job. I am going to WGU for a software engineering degree and I'm 53 percent done. I want to get into dev ops but it seems nearly impossible my career does not advance at all. I feel stuck and can't figure out why I want to be a dev ops engineer.

https://redd.it/1j47yag
@r_devops
How do I deal with an annoying, entitled coworker who won’t leave me alone making my job hell?

Hello Everyone,

I’m a DevOps engineer, and I have this coworker who’s driving me insane. He’s lazy, clueless, entitled, and somehow managed to land this job by lying about his skills. He didn’t even study CS—he just jumped into tech because he couldn’t find a job in his original field.

# Here’s why he’s unbearable:

Acts like he knows everything but actually knows nothing – He argues about tech he clearly doesn’t understand, contributes almost nothing when we get tasks, and instead of actually working, he just asks me to share my screen so he can "learn" while I do everything.
Cheap and hypocritical – Constantly shits on my certifications, saying stuff like "certs are useless," "why do you even study for those?" or "how many certs do you need?"—yet he spends his time desperately trying to get free vouchers instead of actually learning.
Nosy and gossipy – He constantly talks about other people’s business, telling me what’s going on between seniors, managers, and other teams. It’s unprofessional and just plain annoying.
Brings up religion, politics, and other cringe topics – He thinks he's funny, but he's not. I have zero issues with anyone else in my team or even other teams, but this guy just won’t shut up about irrelevant stuff.
Procrastinates and leeches off me – He acts like he’s doing me a favor by doing his own job – He pretends that his assigned tasks are somehow helping me, even though they’re his responsibility, not mine. Then, he uses that as an excuse to ask for my help whenever he’s busy wasting time on irrelevant crap. I don’t get paid to do his job for him. To make it worse, he constantly lies about what he’s done, taking credit for things he barely contributed to or didn’t even work on.
Doesn’t take a hint – I stay polite and professional, yet he constantly complains that I’m "too strict" with him, even though I treat him like any other coworker.

I genuinely enjoy my job, have great relationships with my seniors and other departments, and help out wherever I can. But this one guy is making my work life miserable.

I don’t want to babysit him, and I don’t want to cause drama, but he’s seriously pushing me to my limits. The job market isn’t great, so I can’t just leave right now.

# How do I get him to back off without making my work life hell?

Has anyone dealt with a coworker like this before? Would appreciate any advice on how to handle this situation.

https://redd.it/1j49b5v
@r_devops
Failing in devops role

I have over 10 years of experience. Started a application consultant then moved to cloud infra migration projects. Due to the demand of upskilling and sudden shift i learnt k8s, terraform, devops by myself. Got deployed to a devops project. But here everything looks like a mess or i feel like am unable to keep up.

Random tasks got assigned, for eg, need to do a modifications for a cloud service which am not familiar,when asked the team about how the workflow of the services, nobody knows it. The guy who implemented it had left the team is the reply i got. Another one is related to some issue in the CI which i don't know hot to debug it. The team am working is not corporative. They will assure you that we will help each other but the next day they will question us only like why it got delayed.

I feel like i don't have the skill, i am thinking of moving to a cloud architect role or customer sucess role as I had good background in cloud transition projects.

https://redd.it/1j4ausq
@r_devops
Automating VM creation

Hi,

I want to use Vagrant with KVM/QEMU to automatically create VMs with different hardawre emulated and open a browser on them when they boot. I would like to not have to go through the system installation every single time a new VM is set up. What would be the correct way to do it? I have come across Veewee and it looks like it can take any iso file for a template, would it work if i were to set up let's say debian 12 manually first and convert it to an iso file? I am also worried about internet connection since I don't think a new IP would be assigned from KVM's DHCP when i bring up a second VM.

https://redd.it/1j48i5b
@r_devops
Deploy to VPS without SSH

I am using Terraform and Docker Swarm to deploy my application stack on a VPS.

I'm looking for a way of doing that from my pipeline, but I don't have an SSH key as the server might not exist yet or the SSH (private) key changed since the last deployment.

Is there a way of using Terraform ONLY to execute code remotely without having it create a whole new server or is there another way in which I can deploy my application stack from GitHub Actions without knowing the servers SSH credentials?

The trigger for a deployment to the test environment is a merge to master, by the way. I was thinking something that triggers a `docker stack deploy` from my GitHub action, but I cannot find anything that triggers this remotely.

PS: I'm not a fan of pull-based approaches like Watchtower polling for the newest image. I prefer deploying from my actions when I merge :)

https://redd.it/1j4e1l3
@r_devops
What are some of the most advanced things you have learned in the last 5 years?

What are some of the most advanced things you have learned in the last 5 years? I am interested to learn what I might be able to learn on my own in the coming years. Feel free to share.

https://redd.it/1j4gjb8
@r_devops