Reddit DevOps
267 subscribers
1 photo
31K links
Reddit DevOps. #devops
Thanks @reddit2telegram and @r_channels
Download Telegram
How can I get started with a career in Devops

Hello was recently laid off from my last position which has me currently looking for new opportunities.


I currently have a bachelor's in programming and 8 years of experience mostly working as a field service technician handling lower level networking and desktop issues.
My last job was a Tier 2 O365 Admin which I worked at for about 10 months.


I've been working on getting azure certifications currently have The Microsoft Certified Enterprise Administrator Expert cert and the Azure Administrator Associate cert. However, I've heard from several people that certifications are worthless without experience. Would the best strategy to break into the field be to just continue working on Certs while applying?

https://redd.it/ojaea6
@r_devops
Query On granting Timed/ON-DEMAND access to databases

Hello folks,

We are using cloudsql in GCP and I'd like to provide a timed access to the databases to the end users. GCP provides IAM authentication to the cloudsql instance-databases, so each IAM user can login using his gsuite/cloud identity credentials.

I'm trying to make it as an on-demand access, i.e user makes a request via slack, the flask app (slack slash command redirect) takes the parameters to grant relevant access to the respective database. After 4 hours, the access has to be removed.

Currently I'm stuck at the expiration part. Unlike other GCP resources where removing users from IAM would work, here I have to login to the database and remove all the granted privileges for the user.

I'm thinking of store the access granted information in a database, have a script poll the db or python schedule functions watch out for the expiration time and start revoke actions. Any inputs on how to go about this? I'm going with the cloud native approach here, is there any better way of handling this on-demand access?

https://redd.it/ojamno
@r_devops
Any open-source tools to provision virtual Kubernetes clusters on-demand?

Any open-source tools to provision virtual Kubernetes clusters on-demand?

Basically, I know there is a loft sh and their o/s vcluster


Are there any open-source alternatives that allow to:


\- provision clusters in k8s namespaces


\- autoscale- put to "sleep mode" the pod & deployed resources on schedule

Anything you can recommend and are using now?
Why would you recommend it?

https://redd.it/ojbxyf
@r_devops
Reading metrics from not instrumented servers

Hi folks! I’m working on a tool for ad-hoc metric extraction and I’m really curious to hear back from the fellow DevOps people. I often find myself in situations when some metrics aren’t collected. Some servers may be simply not onboarded to the metric collection system yet. Some breakdowns may not exist because of the high-cardinality issue, especially per-pod breakdowns. And some metrics we just may haven’t thought about in advance. So, I hacked a tool called pq to parse and query logs as metrics. It seems to work and I have some ideas for further development. For instance, by adding a tiny HTTP adapter to it, it would be able to mimic Prometheus API, so one would be able to attach Grafana to an exposed port and query metrics from the UI. But before going any further with it, I’d like to hear back from the potential users.

Or just share your ways of solving similar problems, it’s also highly appreciated! Cheers!

https://redd.it/ojbgdd
@r_devops
Reading metrics from not instrumented servers

Hi folks! I’m working on a tool for ad-hoc metric extraction and I’m really curious to hear back from the fellow DevOps people. I often find myself in situations when some metrics aren’t collected. Some servers may be simply not onboarded to the metric collection system yet. Some breakdowns may not exist because of the high-cardinality issue, especially per-pod breakdowns. And some metrics we just may haven’t thought about in advance. So, I hacked a tool called pq to parse and query logs as metrics. It seems to work and I have some ideas for further development. For instance, by adding a tiny HTTP adapter to it, it would be able to mimic Prometheus API, so one would be able to attach Grafana to an exposed port and query metrics from the UI. But before going any further with it, I’d like to hear back from the potential users.

Or just share your ways of solving similar problems, it’s also highly appreciated! Cheers!

https://redd.it/ojbg4n
@r_devops
Connecting to your Homelab without a VPN, SSH keys, or Public VP's using AWS Systems Manager Tutorial

The pandemic has made me enter the joys of home lab. I work daily as a cloud infrastructure engineer by day, so learning the home lab stuff has been rewarding. In addition, I have been trying to incorporate my cloud knowledge and skills into my lab environment by making a hybrid cloud approach.

I wanted to connect to my home lab remotely, and my cloud expertise makes me cringe at VPNs even though they have their place. It's just not "Cloud-native However, and I learned you could service in AWS to connect to private instances without the need for ssh keys, public infrastructure, or VPNs. There is a service called Systems Manager that offers a service "Sessions Manager".

I created a tutorial on how to set this up if anyone is interested.

https://youtu.be/Lm9afXJA7Q0

https://redd.it/oj04v4
@r_devops
How to deploy efficiently on digital ocean

Hi everyone, i've just build a multivendor ecommerce webapp using mern stack its a giant project

Now i want to deploy this webapp on digital ocean

Now i just want to know how do i deploy like deploy directly on digital ocean using nginx
Or
Should i use docker/kubernetes then after containerization go for digital ocean

Or

If you know a better way to deploy im open to suggestions

-should i use one server or two

This webapp is going to get around 5000-10000
Per day visits

https://redd.it/oin806
@r_devops
DevOps, what tools do you use for each and when?any stacks?

What tools do you use and when?
What’s your go to tools for:

Source Code Management,

Deployment,

Project Management,

Notifications & incidents,

Data & Visualization,

Session Replay,

SSO/ TFA

When do you not use a tool that you typically do?
Any “stack” that’s recommended for DevOps?

https://redd.it/ojhsf9
@r_devops
How to deploy efficiently

Hi everyone, i've just build a multivendor ecommerce webapp using mern stack its a giant project

Now i want to deploy this webapp on digital ocean

Now i just want to know how do i deploy like deploy directly on digital ocean using nginx
Or
Should i use docker/kubernetes then after containerization go for digital ocean

Or

If you know a better way to deploy im open to suggestions

-should i use one server or two ?

This webapp is going to get around 5000-10000
Per day visits

https://redd.it/oin71c
@r_devops
10 Python modules to simplify your DevOps journey

I was researching this topic(Python modules for DevOps), and as usual, there is a lot of material available on the internet, but they are all over the place. So here is my version; I have put together a list of 10 Python modules, some of which I used frequently and some case-by-case basis. As this is my version :-) so it may not cover all the modules that other DevOps engineers use daily. Would you mind letting me know if there is any module I am missing from the list in the comment section?

https://devopslearning.medium.com/10-python-modules-to-simplify-your-devops-journey-3641b7398b11

https://redd.it/ojk61p
@r_devops
Runbook software

So I work for a small company and I’ve been asked to look into runbook software that we could use. However, most stuff I find is meant for automation. What the business wants is something where they can keep the individual steps to take and it should allow them to log each manual walkthrough - the person performing the task should be able to tick each step as completed as they go through the list.
I’m really lost in this, are there any simple tools (or Jira/Confluence integrations) that would allow you to create such checklists to then tick off and save each time?

https://redd.it/oimt4d
@r_devops
How do you guys securely document your machines?

I've been running into issues working with my machines where it feels like everything lives in the heads of a few people. Any time I need to ssh into an instance in AWS I have to wait for someone to reply with the IP address, even though my credentials are already whitelisted in that instance which becomes annoying. On top of that, there's been a lot of issues with me thinking things are networked one way when they're actually networked another.

Have you all created any single source of truths like a diagram of all your machines with a list of things like their IP addresses/What credentials are needed to access that you stored securely so it doesn't just live in people's heads?

https://redd.it/ojo0h8
@r_devops
Ci/CD stages

I have read several articles and wanted to make sure my understanding is correct wrt ci/cd stages.

Stage 1 : Source where the pipeline picks up code from the commit which triggered the build.

Stage 2 : Build - where the code is built into a war file / docker image etc

Stage 3 : Test - where the compiled artifact is tested using some sort of unit tests.

Stage 4 : Release - where once the above stage is successful the build artifact is deployed to the environment.

The same stages would then repeat in the higher environment using the same commit I'd code/ or be completely independent and only trigger once the changes are merged into master.

https://redd.it/ojoxat
@r_devops
Automated Change Management for Oracle EBS

Has anyone successfully completed 'continuous deployment/delivery' of patches (customization, standard) to Oracle EBS 12.x without using Oracle AMS?

If yes, what were/are the challenges? Did you had to revamp the automation a lot for changes in underlying Oracle tools, standards?

https://redd.it/ojqlsm
@r_devops
Getting started in a DevOps / WD. what do I search for? What was your first job?

I have a fair understanding of throughout the process of planning to completion ( not perfect at all), what is a good entry position/ what was your first job?

Should I search for “ media” as a category? Or what narrows it down to websites ?

https://redd.it/ojq1c8
@r_devops
I'm stuck career-wise and I could use some advice.

I'm a sysadmin of 4 years, and this is my first job out of college. I majored in CS and decided I wanted to be more on the operations side of things, and just use my coding knowledge for automation and to get a leg up.

Now I'm at a place where I want to move into Linux Admin/DevOps/SRE/Cloud (it seems that the entire field is moving there anyway), but I don't have the professional dev experience to do so. Like, sure I can just practice a language or something in my free time, but the killer is my lack of enterprise coding experience. I have nothing to put on a resume, and to be honest I feel like I'd be totally stumped in a dev interview.

I definitely don't want to take a gigantic pay cut and become a junior dev/intern, so I just feel completely stuck and helpless at the moment. Any advice is appriciated.

https://redd.it/ojs8jw
@r_devops
Datadog ECS network monitoring

I’ve recently deployed Datadog agents as an [ECS daemon service](https://docs.datadoghq.com/agent/docker/), and it works great, exposing almost all the metrics we need. There are two things I can’t figure out from the docs:

1. How to enable the [Network](https://docs.datadoghq.com/agent/docker/) integration. The docs say it’s on by default but I’m not seeing them come through

2. Enable ENA monitoring. Our containers are using AWSVPC networking, so they’re being allocated ENAs, which are handling almost/all of our traffic. Based on [this PR](https://github.com/DataDog/integrations-core/pull/8331) it seems like there is a config collect_aws_ena_metrics in the network conf.yaml, but that doesn’t seem configurable via Docker, and the comment seems to say it only applies to hosts

We could potentially deploy both ECS and host-based agents, but there is some duplication and additional complexity there that would ideally be avoided.

Thanks!

https://redd.it/ojs9wp
@r_devops
PagerDuty Required?

I am currently in a devops role that requires me to be on PagerDuty overnight and on weekends. If I want to continue to be a devops engineer, should I expect this role to always require me to be on PagerDuty? Do all companies expect this or do they understand that is what SREs are for

View Poll

https://redd.it/ojuq7v
@r_devops
I developed a tool that allows you to perform automated security audits and code reviews cloud applications by showing vulnerabilities in an easy-to-follow architectural pathway code.

So, the title speaks for itself.

I've been in permanent contact with DevOps Engineers due to my field of work and they've been finding my tool extremely helpful. In a simple manner, CodeShield (name of my tool) performs automated security audits and code reviews cloud applications by showing vulnerabilities in an easy-to-follow architectural pathway code.

If you're experienced with AWS Cloud-Native Apps, I'm pretty sure you might be interested in the working features it offers you.

The early adopters have been finding it tremendously useful and I could not be more excited for the next phase - consolidation.

If I caught your curiosity, feel free to visit Codeshield.io to freely run a test scan on your code and let me know how was your experience doing so.


Would love to hear your feedback!

https://redd.it/ojjmr6
@r_devops
AWS RDS

Hey guys,
I am a DevOps at a data company and deal mainly today with the CI/CD pipelines of the company, writing infra as code and get the infra of the company to a better state.

We use Postgres for new services which are small and run on as an RDS instance.

The company started out at the beginning with a main database in MySQL which is also an RDS instance. However this instance is hard at work, since it records many events and data. After around 2-3 years of activity it reached 1TB of storage need. It has now been upgraded to scale to 1.5TB as a lifeline.

From my point of view it seems like this might be a problem? I mean, pricing wise it does anybody know if an RDS with a bigger storage is priced differently than having an EC2 with the DB and a big EBS attached? Would this be better at large storages deployments?

In addition, how would you approach such a problem? Clean up events from the db which are less relevant for today into a cheaper storage solution and ETL them somehow for querying purposes?

I have less experience working with such dataset sizes so looking for some guidance

Thx for replying 🤩☺️

https://redd.it/ojwnlk
@r_devops