Do you write test for your code?
I write python scripts to automate stuff usually it never exceeds 1-2k LOC. Also I never bother to write test because I don't see value in testing utility scripts. Once I saw a guy who wrote tests for Helm chart and in my mind this is total waste of time.
Just write a script run it if it fails fix it untill it works. Am I crazy?? What is your way of working?
https://redd.it/1lm5d8r
@r_devops
I write python scripts to automate stuff usually it never exceeds 1-2k LOC. Also I never bother to write test because I don't see value in testing utility scripts. Once I saw a guy who wrote tests for Helm chart and in my mind this is total waste of time.
Just write a script run it if it fails fix it untill it works. Am I crazy?? What is your way of working?
https://redd.it/1lm5d8r
@r_devops
Reddit
From the devops community on Reddit
Explore this post and more from the devops community
What software and coding languages are the most important to learn?
I've been learning python and docker and also in the past learned JavaScript though it's been a while since I used JavaScript. I also am very well versed in Linux terminal commands (I have both a windows and Linux laptop) and have used a virtual machine on Linux in the past.
I want to do the DevOps career path but I want to know what software and coding languages are important to know and learn to be able to do the DevOps career path.
https://redd.it/1lm9eh4
@r_devops
I've been learning python and docker and also in the past learned JavaScript though it's been a while since I used JavaScript. I also am very well versed in Linux terminal commands (I have both a windows and Linux laptop) and have used a virtual machine on Linux in the past.
I want to do the DevOps career path but I want to know what software and coding languages are important to know and learn to be able to do the DevOps career path.
https://redd.it/1lm9eh4
@r_devops
Reddit
From the devops community on Reddit
Explore this post and more from the devops community
Changing processes
I work in a pretty decent software department. Good talent, good practices, modern technologies, decent management.
But one thing we can't nail is how to change processes. We have some way we've been doing things, we identify something that needs to be improved but we are failing at transitioning to the new way.
Some people, including staff engineers, believe in these tricke-down initiatives where they pitch a solution, maybe write some article or RFC and they expect everyone to buy in because how awesome this solution is. In their heads it's done. Sounds like circlejerk to me. Some people buy in and most people don't. The old way still works, they are too busy to care and at the end of the day we have 2 ways of doing something instead of 1.
I'm cynical enough to believe that there will only be full adoption if it comes from management and it is mandatory. Management is reluctant to do this because they don't want to create bureaucracy and too many rules. I see the point but it doesn't solve the problem.
I'm not even sure if my autocratic point of view is even the right way. Or are fully adoptions just not happening in medium/large organizations? It just starts to hurt productivity if you need to ask around "so how are we doing this thing now?" too much.
Example: we have 10 different ways we are building and pushing images in different teams/services. We want to unify it using reusable workflows so there's only one way. This is not fully adopted so now we have 11 ways.
Not looking to rant. I'm curious if someone found a smart way to deal with this.
https://redd.it/1lmcflz
@r_devops
I work in a pretty decent software department. Good talent, good practices, modern technologies, decent management.
But one thing we can't nail is how to change processes. We have some way we've been doing things, we identify something that needs to be improved but we are failing at transitioning to the new way.
Some people, including staff engineers, believe in these tricke-down initiatives where they pitch a solution, maybe write some article or RFC and they expect everyone to buy in because how awesome this solution is. In their heads it's done. Sounds like circlejerk to me. Some people buy in and most people don't. The old way still works, they are too busy to care and at the end of the day we have 2 ways of doing something instead of 1.
I'm cynical enough to believe that there will only be full adoption if it comes from management and it is mandatory. Management is reluctant to do this because they don't want to create bureaucracy and too many rules. I see the point but it doesn't solve the problem.
I'm not even sure if my autocratic point of view is even the right way. Or are fully adoptions just not happening in medium/large organizations? It just starts to hurt productivity if you need to ask around "so how are we doing this thing now?" too much.
Example: we have 10 different ways we are building and pushing images in different teams/services. We want to unify it using reusable workflows so there's only one way. This is not fully adopted so now we have 11 ways.
Not looking to rant. I'm curious if someone found a smart way to deal with this.
https://redd.it/1lmcflz
@r_devops
Reddit
From the devops community on Reddit
Explore this post and more from the devops community
Advice Needed for DevOps Job
I have been fucking up constantly in my job, mainly due to my lack of time-keeping honestly. A bit of a background, I work for a major MNC Company, and we have many teams and department in this company. Our MNC Company is using Azure PAAS for everything. The company is so big, that just for RBAC alone, we have our own department. Then for Network Firewall, we outsource to a 3rd party company and for Cloud Infra Provisioning, we also have our own department. What i'm trying to say is, when we provision a new resource like Azure Kubernetes, we would need Service Principals and network firewall, and all of this requires a 3-week process. \
Now, I have 4 projects. I haven't been doing a good job at time-keeping and haven't been raising the tickets properly. This RBAC department is notoriously so evil, that they reject any ticket they receive as soon as they see even the most minute mistake, such as KeyVault name needs to be 24 characters long, keyVault name already exists. The funny thing is that, we are required to put 01 at our keyVault, so I was like thinking, what's stopping you from adding as 02? And due to this another 3 days delay, cause I have to go through the approval process again. \
My mistakes were so bad, my boss has already created a group chat with me, my line manager and my project manager, highlighting the mistakes I made, why I keep creating these tickets that are getting rejected, why I am assigning the wrong server owners for it blah blah. \
I have been very sleepless recently, cause I don't feel like I am in control over how long these tickets will take. It's a different feeling if I have the implementation capabilities, but I don't and that's the issue. \
TLDR: A lot of tickets that I raised keep getting rejected over the most minor reasons, Im not good at soft skills to ask why im getting blocked and what not, and I'm delaying our project timeline. Not just one, a few at least. \
I keep feeling like I'm the most irresponsible DevOps in the team. I have completed 2 out of the 4 projects, but at the sacrifice for the other 2 projects. 1 Project have been successfully deployed to PROD (a miracle honestly), the other project had mTLS error due to expiring cert, which was so bad, our services was down for 12 hours, I had to write an RCA report.
https://redd.it/1lmgxot
@r_devops
I have been fucking up constantly in my job, mainly due to my lack of time-keeping honestly. A bit of a background, I work for a major MNC Company, and we have many teams and department in this company. Our MNC Company is using Azure PAAS for everything. The company is so big, that just for RBAC alone, we have our own department. Then for Network Firewall, we outsource to a 3rd party company and for Cloud Infra Provisioning, we also have our own department. What i'm trying to say is, when we provision a new resource like Azure Kubernetes, we would need Service Principals and network firewall, and all of this requires a 3-week process. \
Now, I have 4 projects. I haven't been doing a good job at time-keeping and haven't been raising the tickets properly. This RBAC department is notoriously so evil, that they reject any ticket they receive as soon as they see even the most minute mistake, such as KeyVault name needs to be 24 characters long, keyVault name already exists. The funny thing is that, we are required to put 01 at our keyVault, so I was like thinking, what's stopping you from adding as 02? And due to this another 3 days delay, cause I have to go through the approval process again. \
My mistakes were so bad, my boss has already created a group chat with me, my line manager and my project manager, highlighting the mistakes I made, why I keep creating these tickets that are getting rejected, why I am assigning the wrong server owners for it blah blah. \
I have been very sleepless recently, cause I don't feel like I am in control over how long these tickets will take. It's a different feeling if I have the implementation capabilities, but I don't and that's the issue. \
TLDR: A lot of tickets that I raised keep getting rejected over the most minor reasons, Im not good at soft skills to ask why im getting blocked and what not, and I'm delaying our project timeline. Not just one, a few at least. \
I keep feeling like I'm the most irresponsible DevOps in the team. I have completed 2 out of the 4 projects, but at the sacrifice for the other 2 projects. 1 Project have been successfully deployed to PROD (a miracle honestly), the other project had mTLS error due to expiring cert, which was so bad, our services was down for 12 hours, I had to write an RCA report.
https://redd.it/1lmgxot
@r_devops
Reddit
From the devops community on Reddit
Explore this post and more from the devops community
Security of deniable encrypted links
So I am exploring the concept of deniable encryption, where any password is correct, like the XOR algorithm. But there is an entropy problem, where the incorrect password will almost always output non-common characters, I attempted to solve this at it's core by diving into the maths and some research papers but got nowhere, as it seemed to be almost impossible.
What I wanted was an algorithm that would give you perfect plausible deniability, so if you shared a link X with a password you could use a different password and get Y, saying you never intended to share X. I came up with a workaround, it's kind of cool and works. Just adding decoys which are mutable XOR ciphers joined, it allows you to set what other data is included, but it is not the perfect solution I was going for. Demo, Deniable Encrypted Link
I think it would be safe to share data encrypted with this method, I've done some basic brute force tests and did not find any shortcuts, I have a rough estimate of a billion years on a server farm for a 12digit password, and it is cool that every password is technically right.
https://redd.it/1lmj618
@r_devops
So I am exploring the concept of deniable encryption, where any password is correct, like the XOR algorithm. But there is an entropy problem, where the incorrect password will almost always output non-common characters, I attempted to solve this at it's core by diving into the maths and some research papers but got nowhere, as it seemed to be almost impossible.
What I wanted was an algorithm that would give you perfect plausible deniability, so if you shared a link X with a password you could use a different password and get Y, saying you never intended to share X. I came up with a workaround, it's kind of cool and works. Just adding decoys which are mutable XOR ciphers joined, it allows you to set what other data is included, but it is not the perfect solution I was going for. Demo, Deniable Encrypted Link
I think it would be safe to share data encrypted with this method, I've done some basic brute force tests and did not find any shortcuts, I have a rough estimate of a billion years on a server farm for a 12digit password, and it is cool that every password is technically right.
https://redd.it/1lmj618
@r_devops
QR Catalyst
Anonymous Link Sharing | QR Catalyst
Share links anonymously with password protection and encode/decode text with compression. Fully client-side.
A small utility to add a security check before running remote installer scripts in pipelines.
Hey everyone,
We've all been there. You need to install a tool in a Dockerfile or a CI/CD pipeline, and the official method is:
# Super convenient, but always feels a bit sketchy...
curl -sSL https://some-tool.com/install.sh | bash
This works, but it's a blind trust fall. What if the script changes without you knowing? A typo could be added, or worse, something malicious. The usual alternative is to manually download, inspect, and run the script, which is safer but breaks the convenience of automation. So, I built a small, single-file bash utility called `vet` to solve this. The idea is to keep the convenience but add a transparent security layer right on the command line.
**What vet does:** It wraps the execution of a remote script in a safe, interactive workflow:
* **Shows you a diff:** If you've run the script before, `vet` caches it. The next time you run it, it will automatically show you a diff of exactly what has changed. No more silent updates.
* **Integrates with ShellCheck:** If you have `shellcheck` installed, `vet` will run it against the downloaded script first and warn you about any potential issues before you even review it.
* **Requires explicit confirmation:** After the diff and linting, it still prompts you for a final \[y/N\] before executing anything.
Here’s how the workflow changes:
**Before (The risky way):**
curl -sSL https://nvm.sh/install | bash
**After (The vet way):**
# In your terminal, this will prompt you.
vet https://nvm.sh/install
# In a pipeline, after you've audited the script.
vet --force https://nvm.sh/install
**GitHub Repo:** [https://github.com/vet-run/vet](https://github.com/vet-run/vet)
Would love to hear your thoughts, feedback, or critiques. Is this something you'd find useful in your own pipelines?
Thanks
https://redd.it/1lmmje2
@r_devops
Hey everyone,
We've all been there. You need to install a tool in a Dockerfile or a CI/CD pipeline, and the official method is:
# Super convenient, but always feels a bit sketchy...
curl -sSL https://some-tool.com/install.sh | bash
This works, but it's a blind trust fall. What if the script changes without you knowing? A typo could be added, or worse, something malicious. The usual alternative is to manually download, inspect, and run the script, which is safer but breaks the convenience of automation. So, I built a small, single-file bash utility called `vet` to solve this. The idea is to keep the convenience but add a transparent security layer right on the command line.
**What vet does:** It wraps the execution of a remote script in a safe, interactive workflow:
* **Shows you a diff:** If you've run the script before, `vet` caches it. The next time you run it, it will automatically show you a diff of exactly what has changed. No more silent updates.
* **Integrates with ShellCheck:** If you have `shellcheck` installed, `vet` will run it against the downloaded script first and warn you about any potential issues before you even review it.
* **Requires explicit confirmation:** After the diff and linting, it still prompts you for a final \[y/N\] before executing anything.
Here’s how the workflow changes:
**Before (The risky way):**
curl -sSL https://nvm.sh/install | bash
**After (The vet way):**
# In your terminal, this will prompt you.
vet https://nvm.sh/install
# In a pipeline, after you've audited the script.
vet --force https://nvm.sh/install
**GitHub Repo:** [https://github.com/vet-run/vet](https://github.com/vet-run/vet)
Would love to hear your thoughts, feedback, or critiques. Is this something you'd find useful in your own pipelines?
Thanks
https://redd.it/1lmmje2
@r_devops
Exploring the Future of Developer Tools: Memory-Driven Automation and Local AI Kernels
Hi everyone, I’ve been working on a concept aimed at transforming how developers interact with their workflows and tools. The idea revolves around creating a memory and automation layer that lives locally alongside AI kernels think of it as a personal assistant that remembers your context, tools, and preferences, rather than trying to know everything. What makes this different: Always-on, local-first operation for privacy and low latency Complete sovereignty over your data and workflows Deep, actionable integration with developer tools (editors, version control, CI/CD) to automate repetitive tasks, surface relevant context, and provide traceability across multi-feature projects Designed for real project continuity: persistent memory, version awareness, and workflow automation not just chat history I’m still in the early stages and haven’t shipped anything yet, but I’m excited about the potential here. I’d love to hear your thoughts on the challenges or opportunities you see in this space. What would you want from a developer-centric AI assistant that truly understands your workflow and project history? I’m sharing this to get feedback and connect with others passionate about AI and developer tooling. Looking forward to your insights!
https://redd.it/1lmp94l
@r_devops
Hi everyone, I’ve been working on a concept aimed at transforming how developers interact with their workflows and tools. The idea revolves around creating a memory and automation layer that lives locally alongside AI kernels think of it as a personal assistant that remembers your context, tools, and preferences, rather than trying to know everything. What makes this different: Always-on, local-first operation for privacy and low latency Complete sovereignty over your data and workflows Deep, actionable integration with developer tools (editors, version control, CI/CD) to automate repetitive tasks, surface relevant context, and provide traceability across multi-feature projects Designed for real project continuity: persistent memory, version awareness, and workflow automation not just chat history I’m still in the early stages and haven’t shipped anything yet, but I’m excited about the potential here. I’d love to hear your thoughts on the challenges or opportunities you see in this space. What would you want from a developer-centric AI assistant that truly understands your workflow and project history? I’m sharing this to get feedback and connect with others passionate about AI and developer tooling. Looking forward to your insights!
https://redd.it/1lmp94l
@r_devops
Reddit
From the devops community on Reddit
Explore this post and more from the devops community
Just started my Devops journey
Hi,
I have overall 3 years of experience as system Admin and recently cleared my RHCSA exam.
I want to switch my career to Devops profile and for this I learnt Linux and now I am learning Git and Git hub. I have learnt fundamental of Git and Git hub like init, push, pull, clone, fork, Authentication type like ssh and PAT,etc.
Now I need study partner, who is also learning Devops and also happy to connect with someone who is ready to help whenever I stuck anywhere.
Anyone who is open to connect, just dm me.
Thanks for your help and support.
https://redd.it/1lmtgo3
@r_devops
Hi,
I have overall 3 years of experience as system Admin and recently cleared my RHCSA exam.
I want to switch my career to Devops profile and for this I learnt Linux and now I am learning Git and Git hub. I have learnt fundamental of Git and Git hub like init, push, pull, clone, fork, Authentication type like ssh and PAT,etc.
Now I need study partner, who is also learning Devops and also happy to connect with someone who is ready to help whenever I stuck anywhere.
Anyone who is open to connect, just dm me.
Thanks for your help and support.
https://redd.it/1lmtgo3
@r_devops
Reddit
From the devops community on Reddit
Explore this post and more from the devops community
SRE Interview Coming up, no Experience
I have an interview for a Site Reliability Engineer role, but i have no experience in it! I only trained as an SDET, so i was surprised when a company reached out for this SRE position, i honestly have no background in it at all
What kind of questions should i expect?
They also mentioned there will be a technical interview and that i need to share my screen with them! What kind of coding tasks or other topics might they ask about?
Please help this person land the job!😅
https://redd.it/1lmtnm3
@r_devops
I have an interview for a Site Reliability Engineer role, but i have no experience in it! I only trained as an SDET, so i was surprised when a company reached out for this SRE position, i honestly have no background in it at all
What kind of questions should i expect?
They also mentioned there will be a technical interview and that i need to share my screen with them! What kind of coding tasks or other topics might they ask about?
Please help this person land the job!😅
https://redd.it/1lmtnm3
@r_devops
Reddit
From the devops community on Reddit
Explore this post and more from the devops community
Adding personal account to work laptop?
Hey! I’m currently an intern and I have a really great work laptop. I need some extra material to use during my projects - mainly some notes from my uni courses that are on my student account. I was wondering if it would be wrong for me to add my personal university account and download the notes from my drive? I don’t really care too much if they have access to it and I can delete it. If anything the notes are legally protected by the professor so only if you have taken the courses you can have the notes and if you haven’t it would be legal trouble
https://redd.it/1lmwfrq
@r_devops
Hey! I’m currently an intern and I have a really great work laptop. I need some extra material to use during my projects - mainly some notes from my uni courses that are on my student account. I was wondering if it would be wrong for me to add my personal university account and download the notes from my drive? I don’t really care too much if they have access to it and I can delete it. If anything the notes are legally protected by the professor so only if you have taken the courses you can have the notes and if you haven’t it would be legal trouble
https://redd.it/1lmwfrq
@r_devops
Reddit
From the devops community on Reddit
Explore this post and more from the devops community
Learning to Build an AI Agent for DevOps – What Would Actually Make It Useful?
Yo! I’m in the process of learning how to build AI agents, and I’m trying to figure out how to make one genuinely useful for my team at work (DevOps/SRE focus). The idea is to create a bot that helps troubleshoot issues, remembers past incidents, and maybe even catches patterns we’d normally miss—kind of like a second brain that never forgets weird root causes.
Right now mine call
Parse incident docs and chunk them into embeddings for semantic search - not very hard
Let you chat with it to troubleshoot or recall past issues (as long as the app is running)
Run locally as a CLI, but could grow into a Slack bot or web UI later
What I’m trying to learn is:
If you had something like this, what would actually make it valuable for you and your team?
Would you want it to:
Surface similar past incidents automatically?
Suggest fixes or known playbooks?
Explain confusing Terraform or k8s configs?
Help triage alerts and logs?
Say “this looks like that one outage in April”?
Also: are any of you already using tools like this? Whether it's scripts, platforms, or vendor stuff—I’d love to know what’s out there and whether it’s worth the cost.
I’m not trying to pitch anything—just hoping to learn from others building or using AI in this space. Appreciate any thoughts, feedback, or links.
https://redd.it/1lmyg9w
@r_devops
Yo! I’m in the process of learning how to build AI agents, and I’m trying to figure out how to make one genuinely useful for my team at work (DevOps/SRE focus). The idea is to create a bot that helps troubleshoot issues, remembers past incidents, and maybe even catches patterns we’d normally miss—kind of like a second brain that never forgets weird root causes.
Right now mine call
Parse incident docs and chunk them into embeddings for semantic search - not very hard
Let you chat with it to troubleshoot or recall past issues (as long as the app is running)
Run locally as a CLI, but could grow into a Slack bot or web UI later
What I’m trying to learn is:
If you had something like this, what would actually make it valuable for you and your team?
Would you want it to:
Surface similar past incidents automatically?
Suggest fixes or known playbooks?
Explain confusing Terraform or k8s configs?
Help triage alerts and logs?
Say “this looks like that one outage in April”?
Also: are any of you already using tools like this? Whether it's scripts, platforms, or vendor stuff—I’d love to know what’s out there and whether it’s worth the cost.
I’m not trying to pitch anything—just hoping to learn from others building or using AI in this space. Appreciate any thoughts, feedback, or links.
https://redd.it/1lmyg9w
@r_devops
Reddit
From the devops community on Reddit
Explore this post and more from the devops community
Why do so few AI projects have real observability?
So many teams are shipping AI agents, co-pilots, chatbots — but barely track what’s happening under the hood.
If an AI assistant gives a bad answer, where did it fail? If an SMB loses a sale because the bot didn’t hand off to a human, where’s the trace?
Observability should be standard for AI stacks:
• Traces for every agent step (MCP calls, vector search, plugin actions)
• Logs structured with context you can query
• Metrics to show ROI (good answers vs. hallucinations, conversions driven)
• Real-time dashboards business owners actually understand
SMBs want trust, devs need debuggability, and enterprises need audit trails — yet most teams treat AI like a black box.
Curious:
→ If you run an AI product, what do you trace today?
→ What’s missing in your LLM or agent logs?
→ What would real end-to-end OTEL look like for your use case?
Working on it now — here’s a longer breakdown if you want it: https://go.fabswill.com/otelmcpandmore
https://redd.it/1ln24vo
@r_devops
So many teams are shipping AI agents, co-pilots, chatbots — but barely track what’s happening under the hood.
If an AI assistant gives a bad answer, where did it fail? If an SMB loses a sale because the bot didn’t hand off to a human, where’s the trace?
Observability should be standard for AI stacks:
• Traces for every agent step (MCP calls, vector search, plugin actions)
• Logs structured with context you can query
• Metrics to show ROI (good answers vs. hallucinations, conversions driven)
• Real-time dashboards business owners actually understand
SMBs want trust, devs need debuggability, and enterprises need audit trails — yet most teams treat AI like a black box.
Curious:
→ If you run an AI product, what do you trace today?
→ What’s missing in your LLM or agent logs?
→ What would real end-to-end OTEL look like for your use case?
Working on it now — here’s a longer breakdown if you want it: https://go.fabswill.com/otelmcpandmore
https://redd.it/1ln24vo
@r_devops
YouTube
End-to-End Observability with OpenTelemetry + MCP & Semantic Search | Next.js, .NET, Qdrant, Docker
🔍 How should you think about Observability in modern AI-powered apps?
In this deep-dive session, we tackle End-to-End Observability using OpenTelemetry — plus we build a Model Context Protocol (MCP) server with Semantic Search powered by Qdrant, Next.js…
In this deep-dive session, we tackle End-to-End Observability using OpenTelemetry — plus we build a Model Context Protocol (MCP) server with Semantic Search powered by Qdrant, Next.js…
Java vs python
What should I learn , Java or python, for DevOps.
I am really confused between these two languages.
Please help.
https://redd.it/1ln6gde
@r_devops
What should I learn , Java or python, for DevOps.
I am really confused between these two languages.
Please help.
https://redd.it/1ln6gde
@r_devops
Reddit
From the devops community on Reddit
Explore this post and more from the devops community
ISO 27001 Audit with a Self-Hosted Dashboard – Here’s the Behind-the-Scenes
Last week, I posted "How we left AWS, kept ISO 27001, and cut cloud costs by 90% (with Hetzner/OVH + Ansible stack)" and now I am back with a follow-up:
This self-hosted SaaS Passed Its ISO 27001 Audit: Here’s The Dashboard That Did It.
I built an internal dashboard to track every control, asset, risk, and audit trail, without paying for some overpriced compliance platform.
I wrote up the whole story (and included screenshots + methodology) here:
This self-hosted SaaS passed its ISO 27001 audit – here’s the dashboard that did it
If you’re bootstrapping, running open-source, or just hate “compliance theater”, this might be useful. Would love feedback, especially from others who’ve been through similar audits.
Note: \~80% of what I built is shared publicly across HN, Reddit comments, and the full breakdown on Medium (including screenshots + methodology). It’s an open build-in-public process that might help others skip overpriced compliance platforms.
I’m bootstrapping this and sharing the journey openly. There is an option to buy playbooks but it is not need to get value from my content. If that’s not the right vibe for this sub, I’ll take the feedback. No hard feelings.
https://redd.it/1ln990i
@r_devops
Last week, I posted "How we left AWS, kept ISO 27001, and cut cloud costs by 90% (with Hetzner/OVH + Ansible stack)" and now I am back with a follow-up:
This self-hosted SaaS Passed Its ISO 27001 Audit: Here’s The Dashboard That Did It.
I built an internal dashboard to track every control, asset, risk, and audit trail, without paying for some overpriced compliance platform.
I wrote up the whole story (and included screenshots + methodology) here:
This self-hosted SaaS passed its ISO 27001 audit – here’s the dashboard that did it
If you’re bootstrapping, running open-source, or just hate “compliance theater”, this might be useful. Would love feedback, especially from others who’ve been through similar audits.
Note: \~80% of what I built is shared publicly across HN, Reddit comments, and the full breakdown on Medium (including screenshots + methodology). It’s an open build-in-public process that might help others skip overpriced compliance platforms.
I’m bootstrapping this and sharing the journey openly. There is an option to buy playbooks but it is not need to get value from my content. If that’s not the right vibe for this sub, I’ll take the feedback. No hard feelings.
https://redd.it/1ln990i
@r_devops
Medium
This self-hosted SaaS Passed Its ISO 27001 Audit: Here’s The Dashboard That Did It.
How we turned the annual audit from a six-week nightmare into a non-event, and built a compliance machine.
How do you deal with devs?
Basically I was hired in small company (about 50 it employees) as a devops engineer. I’m third devops in the company and our task is basically cleaning up all our apps and implementing best practices (IaC, CI/CD). We have a great ops team (i.e. sys admins) that support our vision but our devs are not so fond of it.
We have a lot manual deployments (git pull/ docker compose up), no ci/cd, no orchestration and just now are implementing vlans.
When we are suggesting improvements, like setting up nexus proxy repo to start preparing for disconnecting from docker hub or npm, we are often ignored and devs continue pulling packages directly from anywhere they want. When we are suggesting setting immutable docker tags (not latest of course) they oppose because “it’s too hard to track which version to assign if there’s >1 dev working in 1 project”.
How do you deal with such situations? I’m not sure we can support from C-suite since we are not an traditional IT company, more like a medtech with heavy focus on med and just improving tech side because it started working too bad (we had like 3-4 incidents per week about a year ago when leadership decided we need to invest in better infrastructure, observability, etc )
https://redd.it/1lna13j
@r_devops
Basically I was hired in small company (about 50 it employees) as a devops engineer. I’m third devops in the company and our task is basically cleaning up all our apps and implementing best practices (IaC, CI/CD). We have a great ops team (i.e. sys admins) that support our vision but our devs are not so fond of it.
We have a lot manual deployments (git pull/ docker compose up), no ci/cd, no orchestration and just now are implementing vlans.
When we are suggesting improvements, like setting up nexus proxy repo to start preparing for disconnecting from docker hub or npm, we are often ignored and devs continue pulling packages directly from anywhere they want. When we are suggesting setting immutable docker tags (not latest of course) they oppose because “it’s too hard to track which version to assign if there’s >1 dev working in 1 project”.
How do you deal with such situations? I’m not sure we can support from C-suite since we are not an traditional IT company, more like a medtech with heavy focus on med and just improving tech side because it started working too bad (we had like 3-4 incidents per week about a year ago when leadership decided we need to invest in better infrastructure, observability, etc )
https://redd.it/1lna13j
@r_devops
Reddit
From the devops community on Reddit
Explore this post and more from the devops community
How do you view the future?
I have seen opinions here and there about how DevOps as an idea will disappear soon with services trying to replace it and automate it and what not. While I am not a DevOps engineer, I felt intrigued to ask and understand as I always thoughts that DevOps was more of a company’s Frankenstein and not something for all.
And away from the AI drama, how do you view the future of DevOps? Will it transform? Is there a common channel for another role, like cloud engineer or SRE?
https://redd.it/1lnb78c
@r_devops
I have seen opinions here and there about how DevOps as an idea will disappear soon with services trying to replace it and automate it and what not. While I am not a DevOps engineer, I felt intrigued to ask and understand as I always thoughts that DevOps was more of a company’s Frankenstein and not something for all.
And away from the AI drama, how do you view the future of DevOps? Will it transform? Is there a common channel for another role, like cloud engineer or SRE?
https://redd.it/1lnb78c
@r_devops
Reddit
From the devops community on Reddit
Explore this post and more from the devops community
What have you found the most useful course you've taken?
For example, when I first was getting into the Cloud, I personally found Adrian Cantrill's course (for Solutions Architect Associate) really useful, both in the sense that it was teaching me about the Cloud, but also in the preliminary phase was teaching about tech in general, such as IPs (and how they're originally in octets), the OSI model, etc.
I'm a bit more advanced now. Some time ago I was studying for the CKA and I found Kodekloud's labs incredibly useful to understand Kubernetes.
Besides courses, obviously we learn on the spot, we have to write research spikes, we create good documentation... but what have you guys found to be the 'golden standard' or not even gold standard, just incredibly good or useful course in our field. (This can be the core of DevOps, or specializations, e.g. you were interested in SRE, so decided to read Google's SRE book, and then go through a XYZ course).
https://redd.it/1lnc7y3
@r_devops
For example, when I first was getting into the Cloud, I personally found Adrian Cantrill's course (for Solutions Architect Associate) really useful, both in the sense that it was teaching me about the Cloud, but also in the preliminary phase was teaching about tech in general, such as IPs (and how they're originally in octets), the OSI model, etc.
I'm a bit more advanced now. Some time ago I was studying for the CKA and I found Kodekloud's labs incredibly useful to understand Kubernetes.
Besides courses, obviously we learn on the spot, we have to write research spikes, we create good documentation... but what have you guys found to be the 'golden standard' or not even gold standard, just incredibly good or useful course in our field. (This can be the core of DevOps, or specializations, e.g. you were interested in SRE, so decided to read Google's SRE book, and then go through a XYZ course).
https://redd.it/1lnc7y3
@r_devops
Reddit
From the devops community on Reddit
Explore this post and more from the devops community
Looking for DevOps job in Canada– any leads?
Hey folks,
I’m a DevOps engineer with 5+ years of experience (AWS, GitLab CI/CD, Docker, K8s, Terraform, etc.) currently looking for a new opportunity (open to hybrid/onsite or Remote ).
If you know of any companies hiring or can refer me somewhere, I’d really appreciate it! Happy to DM my resume or chat more.
Thanks in advance! 🙏
https://redd.it/1lncn4v
@r_devops
Hey folks,
I’m a DevOps engineer with 5+ years of experience (AWS, GitLab CI/CD, Docker, K8s, Terraform, etc.) currently looking for a new opportunity (open to hybrid/onsite or Remote ).
If you know of any companies hiring or can refer me somewhere, I’d really appreciate it! Happy to DM my resume or chat more.
Thanks in advance! 🙏
https://redd.it/1lncn4v
@r_devops
Reddit
Looking for DevOps job in Canada– any leads? : r/devops
410K subscribers in the devops community.
AI risk is growing faster than your controls?
Hey guys, I'm the founder of a company called Jozu, which is a model integrity platform. I've been noticing a bit of a trend when talking with companies that are looking at adopting our solution and am curious how prevalent this is.
The TL;DR is that AI models aren't governed like first-class assets (eg application code)
Your artifacts that scattered across Git, S3, HF Hub, MLflow, and Jupyter, your models aren't consistently versioned. Second, It's unclear who signs off on what goes into production, and auditing changes for your customers or regulators is a nightmare.
This is caused by ad-hoc promotion scripts, dependence on tribal knowledge, unclear rollback versioning and processes, fragile change and lineage tracking, and manual auditing across multiple systems.
Since ML maturity varies so much from org to org, that it's hard to know what is and isn't normal.
https://redd.it/1lncd78
@r_devops
Hey guys, I'm the founder of a company called Jozu, which is a model integrity platform. I've been noticing a bit of a trend when talking with companies that are looking at adopting our solution and am curious how prevalent this is.
The TL;DR is that AI models aren't governed like first-class assets (eg application code)
Your artifacts that scattered across Git, S3, HF Hub, MLflow, and Jupyter, your models aren't consistently versioned. Second, It's unclear who signs off on what goes into production, and auditing changes for your customers or regulators is a nightmare.
This is caused by ad-hoc promotion scripts, dependence on tribal knowledge, unclear rollback versioning and processes, fragile change and lineage tracking, and manual auditing across multiple systems.
Since ML maturity varies so much from org to org, that it's hard to know what is and isn't normal.
https://redd.it/1lncd78
@r_devops
Reddit
From the devops community on Reddit
Explore this post and more from the devops community
Amazon Devops Role Experience
I recently applied for Devops role in Amazon. I'm not looking for switch but I'm targeting MAANG in coming years so I applied for this position to get at least experience of hiring process and surprisingly my resume got shortlisted and I received an assessment link.
There were user experience, work style and Devops related questions. I did good only in the last section but fortunately I received call from HR after 4 days from assessment. 🤞
She took all the basic details and asked me how good I'm at coding. I showed my stupidity here by being brutally honest. I replied that " I am mostly working on kubernetes and AI ML part in my company so In coding I would rate myself 6/10 "
And here we go.... Instant Regret ! 🥹
I never heard back from HR.
But now that my urge for these companies has already increased, I want to give another shot after few months.
I'm sharing this experience just to know how I can prepare myself and what skills I should develop
to stand out from crowd of experienced people. ✨
Happy Learning !!!
https://redd.it/1lnez95
@r_devops
I recently applied for Devops role in Amazon. I'm not looking for switch but I'm targeting MAANG in coming years so I applied for this position to get at least experience of hiring process and surprisingly my resume got shortlisted and I received an assessment link.
There were user experience, work style and Devops related questions. I did good only in the last section but fortunately I received call from HR after 4 days from assessment. 🤞
She took all the basic details and asked me how good I'm at coding. I showed my stupidity here by being brutally honest. I replied that " I am mostly working on kubernetes and AI ML part in my company so In coding I would rate myself 6/10 "
And here we go.... Instant Regret ! 🥹
I never heard back from HR.
But now that my urge for these companies has already increased, I want to give another shot after few months.
I'm sharing this experience just to know how I can prepare myself and what skills I should develop
to stand out from crowd of experienced people. ✨
Happy Learning !!!
https://redd.it/1lnez95
@r_devops
Reddit
From the devops community on Reddit
Explore this post and more from the devops community
How do you handle trusted software delivery at a global scale?
Hey 👋
Right now I’m working on something pretty exciting (and a bit nerve-wracking, not gonna lie):
We have a global customer base, teams spread across Australia, the US, and Europe, and I need to build an infrastructure that ensures they can quickly and securely fetch container images from a registry that’s geographically close to them.
But speed isn’t enough.
I also need to guarantee that what they pull is exactly what I built, no tampering, no surprises, just trust.
So this isn’t just about performance, but it’s about authenticity and integrity.
When a customer deploys my software, I want them to know:
1. It came from us
2. It hasn’t been touched
3. It’s the version they expected
Still brainstorming the best way to approach this (edge replication? verified signatures? something more elegant?), but would love to hear how others tackled similar challenges.
How do you handle trusted software delivery at a global scale?
https://redd.it/1ln9tqb
@r_devops
Hey 👋
Right now I’m working on something pretty exciting (and a bit nerve-wracking, not gonna lie):
We have a global customer base, teams spread across Australia, the US, and Europe, and I need to build an infrastructure that ensures they can quickly and securely fetch container images from a registry that’s geographically close to them.
But speed isn’t enough.
I also need to guarantee that what they pull is exactly what I built, no tampering, no surprises, just trust.
So this isn’t just about performance, but it’s about authenticity and integrity.
When a customer deploys my software, I want them to know:
1. It came from us
2. It hasn’t been touched
3. It’s the version they expected
Still brainstorming the best way to approach this (edge replication? verified signatures? something more elegant?), but would love to hear how others tackled similar challenges.
How do you handle trusted software delivery at a global scale?
https://redd.it/1ln9tqb
@r_devops
Reddit
From the devops community on Reddit
Explore this post and more from the devops community