Reddit DevOps
268 subscribers
30.9K links
Reddit DevOps. #devops
Thanks @reddit2telegram and @r_channels
Download Telegram
How are you currently understanding your engineering output?

Hey Devs!

I am an engineering manager at a big tech company and am currently running a 30 person team. The problem here is that I often find it difficult to understand the engineering estimates and hence, actual output of my engineers. Daily stand-ups do not work as these are still estimates and I feel I have a bad control over my resources.

To all CTOs, Engineering Managers/leads, do you feel the same? How are you controlling and managing your teams and how effective has it been? Love to discuss!

https://redd.it/1dvih78
@r_devops
How do you guys handle database migrations and devops stuff?

What approaches do you use to manage database schema changes? How is your DevOps pipeline structured to guarantee comprehensive code testing and automated deployments? What are the strategies you adopt to ensure minimal deployment failures and high-quality code reaching production?

https://redd.it/1dvnwew
@r_devops
Rant: I regret every single moment being in DevOps

Working as a DevOps Architect, started my career a decade back. But every single day I regret if I could go and change anything is to work as a Dev than be in Ops. Worked in several companies as Sysops, PlatOps, SRE. At the end of the day it boils down to the same shit. I can go and write code all day ling than working on shitty YAMLs or some random scripts for automation. The irony is I’m very successful in my current job but deep inside I always hared it. Everyday I need to push myself hard to console myself that u need to do it to get the fat pay cheque at the month end. So anyone who is truly passionate about building stuffs focus on being a dev than a DevOps engineer. Don’t pay the heed to those random advices there is plenty of provisions of automations where u can leverage your coding skills. Trust except few really big orgs there is hardly any. Btw writing those stupid YAML is not coding. It’s always better to start as a Dev and then later move to Cloud Architect or similar roles than do otherwise. In most cases DevOps team budget come from Dev so u are always at their mercy. On top of that On-calls, releases will make your like miserable as crap.

https://redd.it/1dvqvtt
@r_devops
I'm genuinely interested what is the architecture behind managed DBs

Hey guys,


So the title says it all, I am just genuinely interested how server giants do the architecture behind managed DBs(SQL, PgSQL, Redis, etc...)

The reason I am asking is because I am curios to know how do they do it? I assume it is a server that has several virtual machines inside it connected on a public ip for each VM, and each VM is an isolated database, in other words, you cannot SSH into it but each VM contains a DB.

Now some of the providers provide the DB engine and then they provide storage, I also assume that they connect the volume to the engine and from there we proceed to start connecting and working on the DB.

As for failovers, I assume the replicate the DB on a second-by-second basis, or even on every transaction that happens to the DB, but with now the storage being isolated, I guess the DB engine just writes on several disks for replication.

And all that, with a 1GB RAM DB and some 25 SSD storage for 17$/month, so I guess the profit from these technologies is not 100%, it is at least at least 1000%.


Thank you so much for reading my post.

https://redd.it/1dvu0cu
@r_devops
Automating Cloud Infrastructure Incident Response with Runbooks | Open-Source Tools

I've been working with cloud infrastructure for the past 15 years and as new trends come and go, DNS and certificates rotation still remains nr 1 source of middle-of-the-night PagerDuty calls.

There are of course other issues and automating the incident response and resolution steps in runbooks is a no-brainer. However doing so in an automated way without using a proprietary 3rd party system is not as intuitive.

In a recent video I'm showing one possible way of automating runbooks execution with just open source tools.

https://youtu.be/iJS5cFOXcos

How are you automating your incident response process?

https://redd.it/1dvv7uk
@r_devops
Securely Accessing AWS Services from Anywhere with IAM Roles Anywhere

Accessing AWS services securely from anywhere can be a challenge, but a new AWS feature called IAM Roles Anywhere makes it much easier. In my latest blog post, I dive into how IAM Roles Anywhere works and how you can use it to:

• Securely connect to AWS services from any device or location
• Eliminate the need for long-term AWS credentials
• Simplify authentication and authorization for remote access

Check out the full post to learn more about this powerful new AWS capability and how it can benefit your organization:

https://dcgmechanics.medium.com/securely-accessing-aws-services-from-anywhere-with-iam-roles-anywhere-2d61d37eee1f

Any kind of feedback is appreciated!

https://redd.it/1dvwftm
@r_devops
KCL Programming Language v0.9 Released! High Performance, Richer SDKs, Plugins and Integrations.

https://www.kcl-lang.io/blog/2024-07-05-kcl-0.9.0-release

Hi fellas! KCL Programming Language v0.9.0 released! 🙇 Thank you to all community participants! ❤️ Welcome to read and provide feedback! 


https://redd.it/1dvxycv
@r_devops
Troubles with Jaeger

Hey folks. Am I the only one who constantly experiences issues with setting up clusters with Jaeger? For months I am trying to add Jaeger tracing to our stack and invariably I fail to make some apps send traces or even have something show up in Jaeger UI.
I sometimes even fail to get the Bookinfo sample app from Istio to show up traces.
I encounter those issues on both GKE clusters I use at work and also my own Minikube and homelab K3s. I see a lot of value in using it but it’s just constant problems. Am I the only one?
Istio is working very well and I use a lot of its routing features.

https://redd.it/1dvytz8
@r_devops
HELP!?!

PLEASE! Help me I'm very confused how to start with.
I'm about to begin my BCA. And....
If I'll go for Java, then I will have to go for Java Full stack. But seeing every other person do web dev I think it is saturated already.So I was thinking to do smth different.

But at the moment I'm not sure whether I want to go for Ai/ML, Data Science, Devops or anything else other than full stack and all of these require Python.

And some ppl say for going for any of these, we need to do WebDev first..is it so?
So what should I go for Java or Python.

PLEASE someone explain me properly😭

and what all should I do in first year of BCA

https://redd.it/1dvzesw
@r_devops
8 months experience

I started working in this field after spending a year at a big corporation where I handled basic tasks such as password resets and installing packages on machines via SSH. I also have 10 months of experience as a Java developer intern from 7 years ago. Then i got bachelor’s degree in computer science engineering

Today, I feel very behind. In the past 8 months, I've learned to:

- Create cloud functions to open tickets on ServiceNow based on certain events, or to make API calls to retrieve and process information.
- Deploy/backup/restore PaaS instances of PostgreSQL and Redis.
- Manage Kubernetes clusters (I also obtained the CKA certification).
- Use SSH, curl, and Postman effectively for modifications and tests.
- Write small scripts in PowerShell.
- Use Docker to create images, push them to a container registry, and use them.
- Install Helm charts for things like secret operators, and in some cases, modify image pointers to private container registries.
- Update certificates and make changes to the ALT names contained in them.
- Various other tasks I can't recall at the moment.

I'm slow at many tasks and often have to disturb my more experienced colleagues. I feel very insecure about everything. Have you been through this as well?

https://redd.it/1dvu9g1
@r_devops
Looking at DevOps using Goldratt's questions on “technology”.

Recently re-read an article that shows how to apply Goldratt's "value" questions on the topic of what value does DevOps bring. Would love to start a discussion with everyone here about their experience and thoughts on this subject -- https://medium.com/prodopsio/devops-theory-of-constraints-cf1477f9bd1a


https://redd.it/1dw4eem
@r_devops
Gitlab CI: Run job on declined merge request

I'm trying to figure out if it's possible to run a pipeline in Gitlab when a merge request is closed without being merged (i.e. declined or rejected). It doesn't seem like there are any variables that directly determine this, but I'm wondering if there is any way to combine several to get after what I want. Anybody have any ideas?

https://redd.it/1dw8u98
@r_devops
Will I run into issues with Apple Silicon?

Currently I run Linux, macOS Mojave on Intel, Windows, on different machines. I'm thinking of getting an M2 Pro Mac Mini in the near future, but I wonder if I'd have bad luck using it as a development machine. I do mainly web dev and some C/C++ dev. As someone who has worked on lots of various short-term projects that will be untouched for many years and then suddenly I need to run the code again, I wonder if I should be worried. The vibe I get from many "it's fine, don't worry" comments is that I don't need to stress as long as I'm only working on new code - what's the state of all this in 2024?

https://redd.it/1dw9ne3
@r_devops
Python container can't run bash file because it doesn't find it when I run it on Windows, but not on Linux

Had a simple Dockerfile and a simple docker-compose file, I mounted a drive and mapped it to the root of the directory. Then I put an entrypoint.bash file and copied it to a directory inside the container and tried to run it, but when I am on Windows, it doesn't find the file and on Linux it managed to find the file. Is there a reason why that Python container wouldn't work?

https://redd.it/1dw8iyo
@r_devops
How does ArgoCD deploy helm charts?

Hey all,

I'm just starting to work with helm charts and ArgoCD and I noticed something odd. I know ArgoCD deploys our charts successfully to our clusters, but why can't I list the charts it deployed using "helm list -A"? When I or other users try to manually use helm install, I can "list" these installed charts. I assume that because ArgoCD is working on the same charts, I should be able to list it, but clearly I can't. Can someone explain why is that? I'm still a beginner and really curious to know. Thanks.

https://redd.it/1dwbxce
@r_devops
Lists / calendars of virtual DevOps conferences?

Are there any lists or calendars of virtual DevOps conferences, so I can easily work out what might be available for my next expected break in employment (end of contract)?

When I have a break in work (transitioning between companies, layoffs, between contracts), I like to take some of that time to do things that will push my career forward that are also fun for me. Right now, I'm wanting to get some better perspective on the state of the industry more than I'm wanting to build another skill - my resume is already laden with skills. I think I'll get more value from gathering some high-level insight into how people are innovating, what tools are helping them, and how their companies are benefitting from that innovation. I'm thinking a virtual conference might be a good fit.

(an in-person conference is a no-go, my immune system punishes me for being around crowds in ways that would render the value nonexistent)

https://redd.it/1dwdfxg
@r_devops
Does Google Cloud SQL gives you notifications when there are performance issues like slow queries?

Does Google Cloud SQL gives you notifications when there are performance issues like slow queries? I checked the logs, but they don't seem to indicate there is any slow query. Is there a way to look through the logs and find a slow query.

https://redd.it/1dwcpc1
@r_devops
fluxcd ‘git reset’ support ?

Hello guys,
K8S environment with fluxcd + gitlab CE

after running these command:

git reset --hard commit_sha256
git push origin ocloud --force

gitrepository goes to READY FALSE with these error:

failed to checkout and determine revision: unable to clone 'https://xxx.xxx.xxx/xxx-xxx-xxx.git': object not found

024-07-05T21:07:10.792Z error GitRepository/flux-system.flux-system - failed to checkout and determine revision: unable to clone 'https://xxx.xxx.xxx/xxx-xxx-xxx.git': object not found failed to checkout and determine revision: unable to clone 'https://xxx.xxx.xxx/xxx-xxx-xxx.git': object not found
2024-07-05T21:07:10.809Z error GitRepository/flux-system.flux-system - Reconciler error failed to checkout and determine revision: unable to clone 'https://xxx.xxx.xxx/xxx-xxx-xxx.git': object not found

I tried running flux uninstall and re-running bootstrap but the problem persists. Gitrepository remains in ready False

I solve the problem if I recreate a branch from scratch and report the Kubernetes configurations
However, recreating the branch causes problems for our workflow.
How can I solve this problem on this particular branch that is not working?

Steps to reproduce

git reset --hard commit_sha256
git push origin branch_name --force

OS / Distro - Ubuntu 22.04

Flux version - flux: v2.0.0

Flux check
► checking prerequisites
✗ flux 2.0.0 <2.3.0 (new version is available, please upgrade)
Kubernetes 1.27.10 >=1.24.0-0
► checking controllers
helm-controller: deployment ready
ghcr.io/fluxcd/helm-controller:v0.35.0
kustomize-controller: deployment ready
ghcr.io/fluxcd/kustomize-controller:v1.0.0
notification-controller: deployment ready
ghcr.io/fluxcd/notification-controller:v1.0.0
source-controller: deployment ready
ghcr.io/fluxcd/source-controller:v1.0.0
► checking crds
alerts.notification.toolkit.fluxcd.io/v1beta2
buckets.source.toolkit.fluxcd.io/v1beta2
gitrepositories.source.toolkit.fluxcd.io/v1
helmcharts.source.toolkit.fluxcd.io/v1beta2
helmreleases.helm.toolkit.fluxcd.io/v2beta1
helmrepositories.source.toolkit.fluxcd.io/v1beta2
kustomizations.kustomize.toolkit.fluxcd.io/v1
ocirepositories.source.toolkit.fluxcd.io/v1beta2
providers.notification.toolkit.fluxcd.io/v1beta2
receivers.notification.toolkit.fluxcd.io/v1
all checks passed


Thanks in advance to anyone who can help us!

https://redd.it/1dwonk2
@r_devops
How important is cloud service "fit" for startup jobs?

Startups are especially picky on tech stack fit for traditional SWE roles, even moreso in this job market. A Python company will likely choose a great Python developer over a great (other language) developer.

Is the fit requirement (e.g. GCP company = GCP candidate) equally strict for cloud services?

Thank you!

https://redd.it/1dwrqmo
@r_devops
Best way to manage auto backup for self hosted database.

Hi guys, so I am running a mysql db in a container on a vm. I have a db file which I mounted as volume to the db container. I am not really sure of the best practice to manage auto back up for the db.

I have two ideas in mind, feel free to add to them.

1. Have a cron job that auto backup the db after certain period i.e every 12 hours, then another cron job to delete the first 3(the db is kinda large, so it will consume space if I dont auto delete older backups).
2. A cron just that dumps the db to something like s3 buckets.

Not sure which approach is the best and if there are better approach than this.

Would love your input guys.

Ideally would love an approach that wont consume money, but I am also open to paid options.

https://redd.it/1dwrzxy
@r_devops