Reddit DevOps
270 subscribers
5 photos
31K links
Reddit DevOps. #devops
Thanks @reddit2telegram and @r_channels
Download Telegram
Our open source project got featured on DevOps Toolkit!

DevOps Toolkit just did a video covering our open source project, mirrord. mirrord lets apps connect into a live K8s environment during development and “mirrors” traffic to a local process from a pod, so you can debug/iterate as if your service was live in the cluster!

Here's the link if you’re curious: https://www.youtube.com/watch?v=NLa0K5mybzo

https://redd.it/1k58ypz
@r_devops
Dockflare Update: Major New Features (External Tunnels, Multi-Domain!), UI Fixes & New Wiki!

Hey r/devops !

Exciting news - I've just pushed a significant update for **Dockflare**, my tool for automatically managing Cloudflare Tunnels and DNS records for your Docker containers based on labels. This release brings some highly requested features, critical bug fixes, UI improvements, and expanded documentation.

Thanks to everyone who has provided feedback!

Here's a rundown of what's new:

# Major Highlights

* **External Cloudflared Support:** You can now use Dockflare to manage tunnel configurations and DNS even if you prefer to run your cloudflared agent container externally (or directly)! Dockflare will detect and work with it based on tunnel ID.
* **Multi-Domain Configuration:** Manage DNS records for multiple domains pointing to the same container using indexed labels (e.g., cloudflare.domain.0, cloudflare.domain.1).
* **Dark/Light Theme Fixed:** Squashed bugs related to the UI theme switching and persistence. It now works reliably and respects your preferences.
* **New Project Wiki:** Launched a [GitHub Wiki](https://www.google.com/url?sa=E&q=https%3A%2F%2Fgithub.com%2FChrispyBacon-dev%2FDockFlare%2Fwiki) for more detailed documentation, setup guides, troubleshooting, and examples beyond the README.
* **Reverse Proxy / Tunnel Compatibility:** Fixed issues with log streaming and UI access when running Dockflare behind reverse proxies or through a Cloudflare Tunnel itself.

# Detailed Changes

# New Features & Flexibility

* **External Cloudflared Support:** Added comprehensive support for using externally managed cloudflared instances (details in README/Wiki).
* **Multi-Domain Configuration:** Use indexed labels (cloudflare.domain.0, cloudflare.domain.1, etc.) to manage multiple hostnames/domains for a single container.
* **TLS Verification Control:** Added a per-container toggle (cloudflare.tunnel.no\_tls\_verify=true) to disable backend TLS certificate verification if needed (e.g., for self-signed certs on the target service).
* **Cross-Network Container Discovery:** Added the ability (DOCKER\_SCAN\_ALL\_NETWORKS=true) to scan containers across all Docker networks, not just networks Dockflare is attached to.
* **Custom Network Configuration:** The network name Dockflare expects the cloudflared container to join is now configurable (CLOUDFLARED\_NETWORK\_NAME).
* **Performance Optimizations:** Enhanced the reconciliation process (batch processing) for better performance, especially with many rules.

# Critical Bug Fixes

* **Container Detection:** Improved logic to reliably find cloudflared containers even if their names get truncated by Docker/Compose.
* **Timezone Handling:** Fixed timezone-aware datetime handling for scheduled rule deletions.
* **API Communication:** Enhanced error handling during tunnel initialization and Cloudflare API interactions.
* **Reverse Proxy/Tunnel Compatibility:** Added proper Content Security Policy (CSP) headers and fixed log streaming to work correctly when accessed via a proxy or tunnel.
* **Theme:** Fixed inconsistencies in dark/light theme application and toggling.
* **Agent Control:** Prevented the "Start Agent" button from being enabled prematurely.
* **API Status:** Corrected the logic for the API Status indicator for more accuracy.
* **Protocol Consistency:** Ensured internal UI forms/links use the correct HTTP/HTTPS protocol.

# UI/UX Improvements

* **Branding:** Updated the header with the official Dockflare application logo and banner.
* **Wildcard Badge:** Added a visual "wildcard" badge next to wildcard hostnames in the rules table.
* **External Mode UI:** The Tunnel Token row is now correctly hidden when using an external agent.
* **Status Reporting:** Improved error display and status messages for various operations.
* **Real-time Updates:** The UI now shows real-time status updates during the reconciliation process.
* **Code Quality:** Refactored frontend JavaScript for better readability and maintainability.

# Documentation

* **New Wiki:** Launched the [GitHub
Wiki](https://www.google.com/url?sa=E&q=https%3A%2F%2Fgithub.com%2FChrispyBacon-dev%2FDockFlare%2Fwiki) as the primary source for detailed documentation.
* **Expanded README:** Updated the README with details on new options.
* **Enhanced Examples:** Improved .env and Docker Compose examples.
* **Troubleshooting Section:** Added common issues and resolutions to the Wiki/README.

This update significantly increases Dockflare's flexibility for different deployment scenarios and improves the overall stability and user experience.

Check out the project on GitHub: [https://github.com/ChrispyBacon-dev/DockFlare/](https://www.google.com/url?sa=E&q=https%3A%2F%2Fgithub.com%2FChrispyBacon-dev%2FDockFlare%2F)
Dive into the details on the new Wiki: [https://github.com/ChrispyBacon-dev/DockFlare/wiki](https://www.google.com/url?sa=E&q=https%3A%2F%2Fgithub.com%2FChrispyBacon-dev%2FDockFlare%2Fwiki)

As always, feedback, bug reports, and contributions are welcome! Let me know what you think!

https://redd.it/1k58r0l
@r_devops
Continous java profiling to improve open source observability

It's been a common request to add java profiling within the Coroot community - an observability project I'm a part of that looks at turning telemetry into root cause insights (with open source, so easy network monitoring isn't only accessible to companies with budgets for giant vendors.) The feature has been updated now and hopefully it can help some members of this sub too.

Nikolay Sivko's written a blog that walks through how you can use it without any code changes to detect high CPU usage and GC pauses in a Java service. You can check out our Github if you'd like to give it a try, and we'd love any feedback to help improve OSS resources for everyone!

https://redd.it/1k5cplp
@r_devops
Switching to Devops

Hello everyone,

I hope you all had a great Easter and managed to get some good rest.

I would really appreciate some mindset advice. I have been working for 5.5 years as a Cisco TAC engineer, mainly focused on Software Defined Access (SDA). Recently, Cisco shut down the entire TAC in Belgium, and now I am at a turning point.

I am trying to decide whether I should continue deepening my knowledge in networking or shift towards DevOps. My aim is to stay useful in the job market and focus on a technology that is not vendor locked and is likely to stay relevant in the long term.

For those of you who have transitioned into DevOps recently — how has it been? Do you enjoy it? Would you make the same choice again?

Thank you for any insights you can share!

https://redd.it/1k5ecnq
@r_devops
Updated: End-to-end DevOps hands-on project

TL;DR
As the Continues Improvement and Feedback Loopsis are ones of the DevOps principles ... so based on the users feedback I've updated the end-to-end DevOps hands-on project part of the FREE pragmatic Dynamic DevOps Roadmap.

https://devopsroadmap.io/projects/hivebox/

---

Background

Now starting the project is easier than ever even for people with basic DevOps knowledge.

Who see the project for the first time ... this free/open-source roadmap focuses on the principles instead of just tools and it uses an iterative approach the same as in the real-work.

Enjoy ♾️

https://redd.it/1k5h7n7
@r_devops
We built a tool to deploy from Cursor or Claude with one prompt

👋 Hey DevOps folks

We built an MCP server that lets you deploy your app to the cloud just by typing deploy inside your IDE chat (like Cursor or Claude).

Right now, it deploys to our Playground and we’re working on AWS, GCP, and DigitalOcean support next.

Here’s a quick demo video showing how it works:

🎥 https://www.linkedin.com/feed/update/urn:li:activity:7320490826004852737/

Docs if you want to explore or test it.

Any feedback would be appreciated! 💙

https://redd.it/1k5kgex
@r_devops
Using a public computer in internet cafe

I know it's a very unideal situation, but I move around a lot and sometimes don't have my laptop. So, to use a public computer securely to work, how would you do it?

For logging into accounts, passkeys stored in 1password seem to be a safe way, no key logger can get your passwords. But the passkey has to be supplied from your phone. How do you do this? I'm testing this now and the computer gives me the option to supply a passkey from a USB but that's the only way. That's not secure because spyware could download all the contents of the USB, so could steal the passkey. I need to login to GitHub and Google things like this.

What if I create a public GitHub account, generate a new SSH key each time and just develop locally on that, then when I'm at my real computer, I fork the repos. The issue is secrets like API keys but I can rotate them I suppose

https://redd.it/1k5m6gp
@r_devops
Is anyone else sick of slow PR reviews, merge surprises, and lost onboarding context?

I’m seeing a pattern on a few teams:

PRs sit for days or get rushed rubber stamped

Merges go through, but break things downstream

New devs feel lost in legacy code or get stuck in review limbo

Curious how your team handles:

1. Assigning the right reviewer (not just random or round-robin)

2. Catching risky PRs before merge

3. Onboarding devs into complex parts of the codebase

just trying to understand what works for folks dealing with this day-to-day.

Would love to hear how you’ve tackled this (or if you haven’t). Any strategies or tools that actually helped?




https://redd.it/1k5ri0l
@r_devops
I highly recommend watching this video!

I highly recommend watching this video for anyone who is pursuing Cybersecurity at a total beginner level like myself. I’m watching these and it’s really helped me understand concepts that were so over my head at first. Really appreciate it!


https://youtu.be/Ond\_DIGXyoI

https://redd.it/1k5szu2
@r_devops
How future proof is DevOps?

I am sure a lot of people ask this question, but I haven’t found a backed reason as to why it’s good to learn it.
I’m a student who is interested in pursuing a career in DevOps, I barely have any experience yet except for mainly FE and BE basics with some DB knowledge.
In general how much is the demand for DevOps engineers and are the salaries good for Europe?

https://redd.it/1k5w8t3
@r_devops
Top devsecops interview questions

I just completed a devsecops course, ECDE to be precise, and I started getting multiple call when I update my resume. I have crack 3 interview and this is what I found they are mostly asking for.

* Can you discuss your experience with implementing and managing CI/CD pipelines?
* What are some common challenges you have encountered when integrating DevOps practices within an organization, and how did you overcome them?
* Describe your experience with containerization technologies such as Docker and orchestration tools like Kubernetes.
* Have you worked with any configuration management tools such as Ansible, Chef, or Puppet? Can you explain how you have used them in your previous projects?
* Can you discuss your experience with infrastructure-as-code (IaC) tools like Terraform or CloudFormation?
* How do you ensure high availability and scalability in a cloud-based infrastructure? What strategies or tools have you used?
* How do you ensure secure coding practices within a DevOps environment? Can you provide examples of security measures you have implemented?
* Have you worked with vulnerability scanning tools or security testing frameworks in a DevSecOps context? Can you discuss your experience and how they contribute to overall software security?
* Describe a time when you identified and resolved a critical security incident within a DevSecOps environment. What steps did you take, and what was the outcome?

https://redd.it/1k5ww79
@r_devops
Deeply curated database of 750+ well-funded, Remote-friendly startups + jobs

No, this isn't another scraped spreadsheet or pay-to-play directory. It's an open, manually curated database of well-funded startups building interesting things. Hard to find through all the LinkedIn/Twitter noise. And yes, I know startups aren't for everyone, but these are hopefully the better ones. Let me know what you think and hopefully it's helpful to find some interesting opportunities this year: hhttps://startups.gallery/

https://redd.it/1k5z3lv
@r_devops
Pull my head out of my arse on ai agents

I've been using github copilot for awhile. It's ok. My company is pushing AI pretty hard (like everyone else) and we all have a cursor licenses. Again, it's ok. I like the model as something to rubber ducky with and the agent mode to browse through files in an application to answer questions is neat. However, it seems like the industry is pushing more and more towards agentic implementations. Internally, I'm struggling with the idea. I'm in my mid 30s and have been at this for awhile. So this isn't "get off my lawn", but "how can i make something that I won't hate myself for in 6 months".

1) I was watching a video this morning /w bedrock and someone creating a customer service agent to process returns. The ideas are simple enough: model, couple lambdas, and some simple instructions. However, what's to keep the model from hallucinating at any point either to the lambda payload or the customer? We don't really have much control over the outputs. Sure, I could force feed them back in, but again I'm sending more and more requests to a black box. My underlying concern is when I or anyone else pay for a service, we expect that service and want it to be consistent. It seems dangerous to me that we're moving *stuff* out of known happy paths and into a magic box.

2) I've been reading some interesting details on model posioning. At the moment, it's typically by nation states who want to push certain view points and not underlying logic manipulation. However, the concern is still there. I can have code that doesn't change or I can ship requests off to a 3rd party model that could vastly change over time because the data being trained on has changed.

3) Just...why? While there may or may not be a cost savings from human labor (i have no idea i haven't done the math myself), it costs so much more to run a model perpetually than it would to have a web form that links back to the same lambdas.


I have a couple more, but am i wrong in thinking that while the models are neat, it doesn't seem like a great idea?

Regardless, announcements like shopify where they won't hire folks unless they prove it can't be done with AI are rampant and I have to adjust to die, but I don't want to go into that future with my eyes half closed from marketing gimmicks.

https://redd.it/1k6093x
@r_devops
1
Cloud vs Self-Hosted Logging

I'm working on a personal project (SaaS, not launched yet) and need to set up logging.

I'm considering two options:

1. Self-hosting a logging stack like ELK or EFK
2. Free/low-cost cloud-based logging service. I've seen that New Relic has a free tier with a 100GB per month ingest limit, which seems promising. I'm open to other alternatives as well (didn't do much research here).

What would you recommend and why?

https://redd.it/1k61r56
@r_devops
Built a Custom Kubernetes Operator to Deploy a Simple Resume Web Server Using CRDs

Hey folks,

This is my small attempt at learning how to build a custom Kubernetes operator using Kubebuilder. In this project, I created a custom resource called Resume, where you can define experiences, projects, and more. The operator watches this resource and automatically builds a resume website based on the provided data.
https://github.com/JOSHUAJEBARAJ/resume-operator/tree/main

https://redd.it/1k62tgz
@r_devops
There is a possibility that my org may implement DevOps practices…

Hey all!

I made a post here the other day asking about Terraform and CaC tools.

I was given great advice and useful information.

I wanted to reach out and actually provide an update regarding a possible opportunity and possible changes.

The org I work for is a global enterprise. We are a Windows/ Azure org. Our infrastructure is on-premise and in the cloud. I believe we recently moved away from physical servers and now host them using Azure VMs. Not sure if they use Linux or Windows servers though. I’m not that informed.

A year ago, I reached out to the cloud operations lead for the Americas (CAN, USA, LATAM). He told me to study Azure and I may be able to join the team someday. Well, I studied but they ended up hiring someone a bit more experienced. I cannot say I blame them. They were building up that team and needed more experienced people. Instead of holding a grudge, I reached out to the new hire and learned a lot of from him. He actually falls under my region of support so it’s normal that we communicate. Anyways, I eventually asked him about infrastructure as code and how much we used and what tools we used. Currently, the team doesn’t practice DevOps methodology so he didn’t speak much about. Instead, he referred me to the cloud operations lead. I reached out to the lead this morning and randomly just asked him if they were going to hire people once the hiring freeze was over. To my surprise, they are going to hire some people for junior opportunities. This time though, his advice on what to learn was a bit different than before. He advised that I study IaC (Azure native tools such as Bicep, and ARM) and CI/CD pipelines. It seems that my company may start practicing DevOps. Or at least, that is my takeaway.

I’m not sure how much time I have but I was able to get a voucher from MS. AZ-204 is one of the exams I can take for free using this voucher. I’m going to study this and then study AZ-104.

Wish me luck all! This may be my way in! I’m hopeful and excited!

https://redd.it/1k649ri
@r_devops
Devops/SRE AI agents

Has anyone successfully integrated any AI agents or models in their workflows or processes? I am thinking anything from deployment augmentation with AI to incidents management.

-JS

https://redd.it/1k69d11
@r_devops
Bad interview asking for reference from 10 years ago

I just wrapped up an interview, it started out well until the interviewer asked if I could provide references for two of the companies that I worked for in the past. One of those companies was from over 10 years ago, so I politely asked him if he meant another company with a similar name. He said no, he meant the company from 10 years ago. At this point I have a confused look on my face and before I could even tell him that I could provide a reference from that company (even though I thought it was strange given the time and that it wasn't a DevOps role), he goes 'Yeah the company's on your resume isn't it? You work there didn't you?'. At this point I'm all sorts of confused and flustered. I tell him yes I did work at the company and before I can say anything else he says 'you don't keep in touch with people'. I tried to explain that I haven't really kept in touch with anybody from my time there and that I've been out of the local market for a while (don't know why I mentioned that and I regret it now), but I could provide my manager's information. He then goes on to ask me what's wrong with the local market and as I'm answering his question abd talking about how bad the local market is, I'm thinking why am I even talking about this right now? We end up moving on to technical questions, things like ' how does DNS work?', ' how does a CDN work?', ' how does terraform work?', etc. but at this point I'm so flustered and confused about our 10-year-old reference argument that I struggled to answer these basic questions. I honestly don't even understand how a reference from 10 plus years ago and a different role would even be helpful. People change a lot in 10 years and most people don't clearly remember 10 years ago.

Has anyone else been asked for reference 10 plus years ago?

https://redd.it/1k6ai4w
@r_devops
Managing Deployments of gitrepos to servers

I am slowly getting into to devops, however the plethora of tools which all seem to market themselves as the solution for everything it's pretty hard to figure out which is the right way to go. I hope this subreddits experience can guide me in the right direction.

I am managing a variety of services for multiple clients. Each client has one or more vps instances containing multiple services, all running as a docker compose project. Each service has its own git repo, some are client specific (websites) and some are general and reusable (reverse-proxies, paperless, etc.).

I'm now trying to figure out what the best way to approach deployments and updates would be.

My ideal scenario would be a tool which would allow me to:
- Configure which repo (and version) should deploy to which server.
- Execute a workflow/push the repo using ssh-access from a secrets' manager.
- Monitor whether it is successful or not.

My only requirement is to self-host it.

Would gitea or jenkins be the best way to approach this? Thanks for any insights.

https://redd.it/1k6c58t
@r_devops
Is devops relatively hard field to get into as new grad?

How did you get your first DevOps job?

https://redd.it/1k6bwvh
@r_devops