Reddit DevOps
270 subscribers
5 photos
31K links
Reddit DevOps. #devops
Thanks @reddit2telegram and @r_channels
Download Telegram
Senior devs, here's a FREE workshop on release management!

# What it is not:

This isn’t another 101 session—You'll get advanced insights tailored to engineers operating at scale. Whether you’re managing large-scale production systems or refining your team’s delivery processes, this workshop will deliver actionable takeaways you can implement immediately.

# What it is:

We’re hosting a free workshop for experienced engineers and engineering leaders managing complex systems and an AMA session focused on scaling release management processes.

You will learn directly from leaders who’ve optimized software delivery in some of the most demanding

# Meet the Experts:

\- Ankit Jain: CEO and co-founder of Aviator, a developer productivity startup. Ankit is a former Google engineer with extensive experience leading engineering teams and building efficient release pipelines.

\- Vilas Veeraraghavan: Former Engineering Leader at Netflix, Walmart, Bill . com, and TruckStop. With deep expertise in scaling CI/CD, chaos engineering, cloud-native systems, and DevEx tooling, Vilas has delivered solutions in industries ranging from streaming to logistics.

\## What to Expect:

🔍 Analyze Key Challenges
Get clarity on common pitfalls in release cycles, including:
Streamlining deployments and rollbacks.
Managing production risks and distributed systems at scale.
Identifying bottlenecks that slow delivery in high-performing teams.
🔧 Learn Scalable Best Practices:
Discover actionable strategies for:
Automating release workflows tailored to complex infrastructures.
Improving deployment visibility for better incident management.
Managing service-specific release processes in diverse team setups.
💡 Interactive Problem-Solving Session:
Engage directly with our speakers and an open AMA to tackle your toughest challenges.

Here's the RSVP link with more info

See you there! 👨‍💻👩‍💻

https://redd.it/1i16qqx
@r_devops
Options for in-house container (potential VM) platform

Most of our production workloads are in the cloud but we have a legacy setup spanning back nearly 20 years in-house that we are trying to modernize.

I'm looking to shift most development/staging to containers. I have a decent amount of experience with containers/docker etc. but not with orchestration, kubernetes etc. Nomad seems like a decent option but I'm weary about getting into best with HashiCorp too.

I'm looking at options for a smaller environment without having to get super deep into the complexities of kubernetes. I've seen nomad mentioned as well as mini kube, k3s etc. I don't know what to start with.

Also VM platform is oVirt/RHEV which is basically dead in the water and if we continue with VMs I need to replace it with something else (proxmox perhaps). Something that can do both VMs/containers like OpenShift could be an option, but I could potentially get off VMs all together and go 100% container, or build container platform on top of VM cluster.

Again, since most of this setup will be for development/staging purposes it doesn't have to be super redundant but I do have the infrastructure available to do basically whatever needed.

Should I bite the bullet and go straight to k8s or look at other alternatives?

https://redd.it/1i17cqk
@r_devops
Do you guys enjoy writing terraform?

For those building in the cloud, working in smaller orgs do you actually enjoy writing terraform? I find that I would enjoy my job much more if I could just focus on building out features instead of splitting my focus on development, cloud training & infra buildout.

Is there anything you guys use for self-service? I recently wanted to do a poc on AWS ECS but then had to deal of the headache of figuring out the right internal module version to use & then running it before I was able to start working on my poc

https://redd.it/1i194v0
@r_devops
anyone here setup bitnami kafka + cert manager + istio ingress kubernetes gateway API?

I am trying to figure out how to actually connect to it using the url. I have it running in cluster now. chatgpt is sending me down a rabbit hole...requesting help from my fellow humans. If anyone can share the setup.

https://redd.it/1i1ccha
@r_devops
Devops career growth

I recently got moved from being a fed developer into leading a small small team of contractors to build agnostic pipelines for a large organization. I am concerned that I may have just been given busy work because I’m female… guess I am looking for some reassurance that there is still potential for a lot of growth as a DevOps engineer. Opinions?

https://redd.it/1i1e70z
@r_devops
Logstash alternatives

Logstash has been my go to tool for ETLs for most of my professional career. It's either already been in place as the ETL process or the destination has been an Elasticsearch cluster making it the easiest choice to implement. I've never actually looked at any alternatives, anyone have any recommendations?

https://redd.it/1i1bo9e
@r_devops
Does GCP consider image downloads as outgoing traffic?

I wanted to clarify with more knowledgeable people, I have a website and there are a lot of images on it that are loaded into the frontend part, there are no requests on the backend.



Does GCP consider uploading images to the frontend as outgoing traffic from the server? I know it's a stupid question, but I just don't understand it anymore.



Every month, I receive a bill of $ 120 for outgoing traffic from my server in Europe, which traffic goes to America, in the amount of about 700-800 GB.



At the same time, requests do not go anywhere from my server, namely the project that lies on it, I did not write such methods there and I do not need them.

https://redd.it/1i1g8cm
@r_devops
What do people expect from DevOps/SRE at 150k+ base salary positions?

I am wondering what technical areas should one currently focus on to land high-paying job? I mostly talk about US salaries because I haven't seen such high ones in Europe or elsewhere. Is it simply something like Kubernetes and containerization overall, common IaC tooling, Clouds, Ansible, logging i.e just basic DevOps stuff, but with deeper understanding? Is it something more specific or foundational like NALSD, DSA, OS? Or maybe it's just matching a job that looks for a person with a deep knowledge in one certain topic?

Please share your experience or observations!

https://redd.it/1i1hcjz
@r_devops
Your blue-green deployment approach

Is anyone here using awscdk to do blue-green deployment via ci/cd self-service? If so, how are you doing it? I was thinking about the state or cloudformation about the resources that it already deployed. How will it do blue-green if that is the case. Also, are you happy you used awscdk to do build your automated ci/cd pipeline?

Or maybe I should be open for other ideas aside from awscdk, terraform, opentofu. How did you build your automated ci/cd pipeline? How are your developers using it to deploy their resources?

https://redd.it/1i1i3ja
@r_devops
Need help about DePIN powered server uptime manager

For a while, we’ve been developing a DePIN-powered uptime monitoring tool designed to potentially handle data from millions of devices. Our current infrastructure monitoring and uptime management service, (Checkmate) is evolving to include DePIN integration. This will allow users to burn tokens to access data from the UpRock DePIN network.

This is currently how it works under the hood:

\- Connect your wallet

\- Select the server you want to monitor

\- Choose a geographic focus—whether specific cities, countries, or entire continents—for Checkmate to send ping messages

While managing large volumes of data isn’t an issue at this stage, visualization remains a challenge. We’ve implemented MapLibre to display the data, giving users the flexibility to send one-off ping requests to the DePIN network or schedule continuous checks (e.g., every minute).

Given the novelty of this concept (similar to RIPE Atlas), visualizations will play a critical role for admins. Here's what we can currently offer on the dashboard:

\- Node distribution on a map: Visualize the number of nodes per country.

\- Selective probing: Choose probes directly on the map.

\- Probe details: View all probes selected for a specific server.

\- One-off ping tests: Perform immediate connectivity checks.

I need some feedback on how to move ahead. Since we are just a few weeks away from the general release, it would be great if I could get some thoughts. We’re considering whether this is the right balance of features or if adjustments are needed.

My immediate questions would be:

\- If you had access to a global DePIN network for server monitoring, what would you prioritize seeing on the dashboard?

\- Would you be interested in seeing historical logs? Like access logs going back to a specific time.

\- would you want to customize packet size? (set the size of the packets being sent).

Probably there are others upcoming but I would like to start with a small UI set initially.

https://redd.it/1i1jzck
@r_devops
Need help about DePIN powered server uptime manager

For a while, we’ve been developing a DePIN-powered uptime monitoring tool designed to potentially handle data from millions of devices. Our current infrastructure monitoring and uptime management service, (Checkmate) is evolving to include DePIN integration. This will allow users to burn tokens to access data from the UpRock DePIN network.

This is currently how it works under the hood:

\- Connect your wallet

\- Select the server you want to monitor

\- Choose a geographic focus—whether specific cities, countries, or entire continents—for Checkmate to send ping messages

While managing large volumes of data isn’t an issue at this stage, visualization remains a challenge. We’ve implemented MapLibre to display the data, giving users the flexibility to send one-off ping requests to the DePIN network or schedule continuous checks (e.g., every minute).

Given the novelty of this concept (similar to RIPE Atlas), visualizations will play a critical role for admins. Here's what we can currently offer on the dashboard:

\- Node distribution on a map: Visualize the number of nodes per country.

\- Selective probing: Choose probes directly on the map.

\- Probe details: View all probes selected for a specific server.

\- One-off ping tests: Perform immediate connectivity checks.

I need some feedback on how to move ahead. Since we are just a few weeks away from the general release, it would be great if I could get some thoughts. We’re considering whether this is the right balance of features or if adjustments are needed.

My immediate questions would be:

\- If you had access to a global DePIN network for server monitoring, what would you prioritize seeing on the dashboard?

\- Would you be interested in seeing historical logs? Like access logs going back to a specific time.

\- would you want to customize packet size? (set the size of the packets being sent).

Probably there are others upcoming but I would like to start with a small UI set initially.

https://redd.it/1i1jxsj
@r_devops
Salary depression

I’m a lead/staff SRE/Devops practitioner that is currently on the market. Is it just me, or are companies in the US trying to drive salaries down really hard? I’ve seen on-call lead engineers advertised as “max 120k” and I talked to someone today who hadn’t advertised a salary but their max was 140k for a lead SRE with 10+ years experience in a senior role.

Are people actually taking these salaries?

https://redd.it/1i1mzs4
@r_devops
Full-Time DevOps also doing contracting gigs?

Hi all,

I’m currently a full-time DevOps engineer. I enjoy what I do at my current employer, have great management, and don’t want to leave. However, I would like to earn more by potentially finding DevOps related contract jobs to do part-time. If any of you out there are doing this, are there any apps or resources you could point me to? Thanks in advance.

https://redd.it/1i1mkqm
@r_devops
Does Palantir's Apollo offer any real value?

Does Palantir's Apollo offer any real value? It looks and smells like a scam, but it's hard to tell. What do you think about it?

https://redd.it/1i1ofh9
@r_devops
Introducing Whispr: A DevOps tool to fetch secure vault secrets Just-In-Time for Apps

Hi DevOps community, let me introduce an exciting tool we created at Cybrota.

Whispr (Pronounced whisper) is an open-source tool to fetch vault secrets (AWS, Azure or GCP) and inject them straight into your app environment either via environment or as STDIN args. This is very handy in keeping your `.env` file free from plain-text secrets and fetch them on-demand for your local/CI app development. It avoids attacks like stolen-credentials by storing nothing.

All it takes is:

`pip install whispr`

How it works ?

1. Place an empty `.env` file in your project, and let Whispr fetch corresponding secrets from a connected vault and inject values into your program environment. All you need is to run

```sh
$ whispr run 'your_command_with_args'
```

2. Whispr uses your existing vault's authentication (IAM) to securely fetch secrets. So no new auth mechanisms are required.

3. In addition Whispr comes with handy utilities to peek your secret quickly (Vault-agnostic), or even generate a crypto-safe random sequence for rotating secrets.

Here is the GitHub project: https://github.com/cybrota/whispr

4. If you want to inject secrets into app's environment programmatically (without `run`), whispr package provides elegant API.

Tool is currently attracting 2K downloads per month, with various enterprise teams already using it to set up safe and authorized pre-commit hooks to standardizing local app development.

The project itself uses security best practices like code scanning, No shell-use while launching app, and PyPi verified attestation to release packages etc.

I would love to hear your feedback about possible improvements, criticism, and suggestions! I hope it will show up in your workflows soon!

https://redd.it/1i1qffo
@r_devops
Secure Apple Devops Interview

Hey everyone, I recently got myself an interview for a DevOps Engineering position. I’ve mostly done Cloud Ops/ Dev Ops work in AWS (4 years) with some Network admin /Support (2.5 years) work back in my earlier career days.


This role seem to focus more on KVM, Xen, Containers, Enterprise Linux, Ansible (with Python and bash obviously), telemetry tools such as Prometheus, Alertmanager. Looking for some help on a preparation plan if someone has gone through a similar interview process already. If you could give any advice or help tips that would be great!

https://redd.it/1i1okno
@r_devops
My CAPA Experience

Disclaimer: This story was written by one of our employees



I recently earned my CAPA certification and wanted to share my experience.



For preparation, I took the DevOps and Workflow Management with Argo course (LFS256). While the course taught me a lot about the Argo project and how it works, I feel like it didn’t cover everything on the exam. Out of 60 questions, at least 10 caught me off guard because they covered topics I had never encountered before.



If I were to take the exam again, I’d definitely read through the entire documentation for each Argo project and focus on the details. The course links some parts of the docs, but in hindsight, that wasn’t enough.



Comparing this to my experience with the CKA exam (which I passed about 18 months ago), the prep for the CKA felt tougher, even though I had great study resources. That said, I walked away from the CKA feeling confident I’d passed, while with CAPA, I was genuinely unsure and thought I might need a retake.



I’m not sure if my struggle with CAPA was because I hate multiple-choice exams, put less effort into prep, didn’t have the right materials, or some questions surprised me —but for me, CAPA felt harder.



Has anyone done the CAPA exam? Can you compare it to some other CNCF certification exams?

https://redd.it/1i1vuyp
@r_devops
Feedback for OneUptime: Open Source Monitoring and Observability Platform

We're building an open source observability platform - OneUptime (https://oneuptime.com). Think of it as your open-source alternative to Datadog, NewRelic, PagerDuty, and Incident.io—100% FOSS and Apache Licensed.

Already using OneUptime? Huge thanks! We’d love to hear your feedback.

Not on board yet? We’re curious why and eager to know how we can better serve your needs. What features would you like to see implemented? We listen to this community very closely and will ship updates for you all.

Looking forward to hearing your thoughts and feedback!

https://redd.it/1i1xa5y
@r_devops
Any Alternative to TEAMS for AWS Identity Center

https://aws-samples.github.io/iam-identity-center-team/
Do we have any alternative solution like TEAMS which can perform Elevated Access?
Specifically for Master Account.

https://redd.it/1i1y8w1
@r_devops
A Small Tool I Built for Faster Feedback: cfex

Hi everyone,
As a developer, I noticed that startups and small teams often face delays when sharing applications for feedback or demos due to the hassle of setting up staging environments. To solve this, I built cfex, a small CLI tool that lets you go live instantly.

With just one command:

cfex api.yourdomain.com:8080

Your app is live at https://api.yourdomain.com, with HTTPS and HTTP/3 enabled by default. It’s perfect for quick iterations, testing, or showing progress to stakeholders.

The tool is similar to ngrok but built on top of cloudflared, leveraging Cloudflare's robust infrastructure.

The code is open source: https://github.com/muthuishere/cfex-cli
More details: https://muthuishere.medium.com/one-command-to-go-live-with-cfex-135d74d81b45

I’d love to hear your feedback or ideas for improving it. If you think it could help your team or project, feel free to give it a try!



https://redd.it/1i1zqmn
@r_devops
Biotech pros, dive into our Apache NiFi demo for big-scale data automation.

We created a demo video in how Apache NiFi can be used. The video doesn't explicitly show data or workflows specifically pertaining to biotech, but it does show NiFi functionality.

Reason for this post, is I'm looking to see if other biotech business are running into data ingestion limitations and need solutions at scale for ingestion.

Sharing below is our case studies, and the video link to the demo. I would love to get feedback as to the effectivness this solution is for biotech businesses.

Case Studies: https://dasnuve.com/case-studies

NiFi Workflows Demo: https://videoshare.dasnuve.com/video/nifi-workflows-demo

https://redd.it/1i206yc
@r_devops