Reddit DevOps
271 subscribers
11 photos
31.1K links
Reddit DevOps. #devops
Thanks @reddit2telegram and @r_channels
Download Telegram
First day as a Cloud & Devops

I will mainly work with AWS so any advice with best practices , well architected , security , logs , best cheap options etc.. and should i use ECS or EKS for a food order app .

This company didnt had any filled mentioned position so i must do all the things from scratch .

I appreciate your help thank you.

https://redd.it/10vx92a
@r_devops
Devops Apprenticeships?

I didn’t know these existed. Does this seem legit? I’m working help desk and plan was to move onto NOC for a year and then try to get a Linux admin or cloud support job. Wondering if this would be a better path.

https://redd.it/10vz0ck
@r_devops
How do you manage deployments with multiple repos?

Currently working on a project that has multiple repos and git submodules are really killing me. --recurse-submodules does the trick but I must edit first the .gitmodules url from https to ssh.

How do you do this in your projects? Is deploying them as a whole a good idea or deploying on a per repo basis better?

https://redd.it/10w09m5
@r_devops
Single node to get started with kubernetes

I have created a single node cluster on Digital Ocean to start learning kubernetes. The plan is to start off by deploying a simple web app to the cluster. I'm relatively new to kubernetes, so I'm wondering whether s single node is sufficient for learning purposes. I can't afford more than one for now, as I'm quite impoverished. What can be achieved with single node, and what would be the limitations?

https://redd.it/10vw57b
@r_devops
No, Platform Engineering Will NOT Do What You Think It Will Do

I keep seeing "Platform Engineering" everywhere nowadays and people compare it to DevOps.

Here is what I think about it; it does not matter if it is Platform Engineering or DevOps, it is the skill-set and cultural change we really need.

DevOps started as a "cultural change in teams" but when the tools emerged to fill the gaps for DevOps, we started seeing DevOps as a role that can use tools (Jenkins, Gitlab, AWS, GCP, etc ...).

The same will/may happen to Platform Engineering. If Platform Engineering becomes something in next years, we will see tools for Platform Engineers and then we will see job posts to look for people who knows those tools.

What we really need to focus on is not tools and that is why I think Platform Engineering will not solve your next big problem.

I have a longer article on this topic here:
https://blog.demir.io/no-platform-engineering-will-not-do-what-you-think-it-will-do-cd331407a7fa

https://redd.it/10w21dc
@r_devops
Tools to use to make high level Architecture diagrams

Can anyone recommend the best tools to use to make high level animated application/infra architecture diagrams where we can design animated flows for better understanding

https://redd.it/10w4mn9
@r_devops
How can I configure and access ElastiCache for memcached from my Lightsail instance?

i have a WordPress instance on my lightsail , so to make instance handle with height traffic , i will use caching , i prefer to use Memcached than Redis , but the problem is this service is for ec2 , so on not on ec2 , i don't want to connect to it directly , i don't need it , i just need elasticache Memcached , if you know any method to make that possible , and thanks a lot for you help

https://redd.it/10w4t7g
@r_devops
Kajabi technology

I'm not sure what its called, but I have a client that wants to copy how Kajabi works specifically the website builder one.

So I have this project where we host the DNS in our domain let's say its wildcard is *.example.com. So the heads want to replicate the kajabi feature where clients can customize their DNS (with our site content) to company_name.example.com, we made that one work.

Now they want the full Kajabi experience, so if the client wants to use our content but wants the DNS to be on their product name it will be company_name.com (with wildcard as *.example.com instead of just company_name.example.com)

hope you guys can point me in the right direction here.


For ref:

https://kajabi.com/features/websites

https://redd.it/10w3njw
@r_devops
Where do you put your Guides?

I'm creating internal tools for the company I work for (Terraform projects, python tools, internal React pages, etc.)
My team likes to use Confluence pages, but it's a bit of dumping ground of thoughts so it's difficult to find guides quickly (Theres a backlog item! It's just extremely low priority)
Where do you keep your guides for using the tooling? Not just how it works, build stuff, maintaining it, etc.
In a separate wiki/Confluence page? In a readme in the repo? Word document in the repo? SharePoint page?

https://redd.it/10w9khf
@r_devops
Help Please! So I’m coming from a BA role but just landed an interview for DevOps Release Engineer

I’ve searched the sub but am not clear still on how to best prepare for the interview. I understand the DevOps mindset but am concerned about the technical bits.

What hard skills and/or technical talking points should I be ready for? Besides tooling i.e. Jenkins for CI/CD and K8s for deployment etc

I’ve done a little bit of SDET through internship. I know SDLC/STLC, Jira, Confluence, Selenium just rattling a few

I’m also currently reading Continuous Deployment by Jez Humble..

How can I best translate my requirements elicitation knowledge into the new role?

All constructive feedback welcome.

https://redd.it/10wb1dg
@r_devops
Rebuilding my careerv (I got laid off). I could use some advice.

Hey guys,

I got laid off five months ago from a Site Reliability Engineer position. I worked heavily in incident response. My physical and mental health took a massive dive during covid, which led me to be laid off. I had to work on my health before working 12 hours a day to catch up.

I haven't had to deal with the interview process in almost eight years. I am horrible at it. I have been looking for DevOps and SRE roles, but haven't had much luck in interviews.

I have worked in IE and SRE for ten years, so I know not all hope is lost. I love my industry and want to keep at it.

I'd love some good resources and books on DevOps and some thoughts on how to build myself back up.

Here is what I have been working on.

* Wrote a Python Module that builds objects based on responses of the official MLB Stats REST API. I worked and mentored a friend of mine who is a CS major that never was able to get hired. It was a lot of fun and helped build confidence in my Python Skillset.
* AWS studies. I primarily used GCP at my last job, but really didn't get much time to develop and interface with it.
* I have been working on a project to build a Kubernetes cluster from scratch in a free-tier AWS cluster. The first part of the project was to build it from BASH scripts, and now I'm working on ansible configuration management to build it. The idea is to automate the entire deployment, so I can spin up another free tier account when I run out of resources. I also plan to do this with GCP.
* Studying and Reading books on microservices and Kubernetes certifications.

Some thoughts on what I want to begin working on soon

* Create a Docker application that utilizes my Python MLB Module, and write a deployment pipeline that deploys it to a free tier Kubernetes cluster from my ansible scripts.

I wasted some cash on some DevOps books. They just are how-to guides on how to automate certain tasks, which is great, but I can already figure that material out. I'm looking for a good book on the principles of DevOps so I can have better discussions during interviews.

https://redd.it/10wfjyz
@r_devops
kaniko cant copy venv to final container

I am running in to a weird issue where I can't copy a venv between the build container and the final container. I am using

COPY --from=builder /builds/virtual-enviroment/venv /project/venv

and get the error

error building image: error building stage: failed to execute command: resolving src: failed to get fileinfo for /kaniko/0/builds/virtual-enviroment/venv: lstat /kaniko/0/builds/virtual-enviroment/venv: no such file or directory

weirdly seems to work on a run where the layer does not get cached but any subsequent runs it fails with the above error. i have other COPY commands above that are working without issue


this is run as part of a gitlab-ci pipeline. I am running the latest kaniko debug executor (1.9.1) and have also tried downgrading to other versions. I have no .dockerignore file

https://redd.it/10wi3hb
@r_devops
Any Log management alternative for ELK.

Hi Guys, how do quickly identify errors proactively from applications logs? Do you use any tools like ELK? We are finding it difficult to manage ELK. and looking for an alternative. Any pointers?

https://redd.it/10wam49
@r_devops
When is a "broad" skill-set too broad?

For context, I'm a Software Engineer with just 1-2 years of industry experience. The problem is that, I love everything that I pick up.
Before getting the job, I was really into Development so I learned stuff like MERN stack and .NET Core. I got decent at it. But then I got a job at a company as a fresher and they put me in the Platform Engineering team. I love the technologies I get to work with (GitHub Actions, Python Scripting, GCP, Kubernetes, etc.) And then when I'm done with my day job, I really crave working on some personal projects and I make use of every technology I know.
I feel like I'm "good" at a lot of things but not an "expert" at anything. If I was focused on a single path, I would have been way better at it.
Should I try to narrow down my interests?

https://redd.it/10we4rd
@r_devops
Valid Clickops use case?

I'm building a GUI tool for a previously headless Open Source AWS cost optimization tool I'm building.

The idea is to allow new users to 1) quickly estimate their potential savings, 2) generate the basic configuration (a bunch of Ec2 tags) that configures it and enables it on multiple existing Autoscaling groups by mass-tagging them as per the configuration done in the GUI.

For 1 I have a spreadsheet but it's error prone and a bit effort intense. 2 is totally doable in whatever IaC you may be using but it's relatively painful, error prone and time consuming.

I've been contemplating between deploying something like this in my own AWS account or the customer's account but I was concerned about the security and privacy implications, so for now it's a native desktop GUI application that you need to install and configure with access keys or existing profiles from .AWS/config. I have a MVP but it's not ready yet for widespread use.


What do you think about such a tool? Would you find it useful enough to install it as a desktop application?

What if it was not open source itself?(I'm considering keeping it proprietary as part of my monetization strategy for my OSS project, but there's no special secret sauce, it's just a mass tagging tool after all).

Thanks!

https://redd.it/10wdmpj
@r_devops
Terraform state

Guys, I just started digging deeper into Terraform, so please forgive me if my question is silly or dumb (and also forgive my English, not my native language)

All tutorials and beginner materials on Terraform tell you about the magical state file, which is produced after you run your main.tf file. You can even store the state file remotely in a storage account. OK, that's wonderful but all these tutorials don't mention one single thing: state file is produced in the scope of the current "terraform apply" run. I've been playing locally with Terraform and behavior is the following:

You declare a resource in your [main.tf](https://main.tf)
You run terraform apply, tfstate file is generated
You remove the resource from the [main.tf](https://main.tf) file and add another resource, run terraform apply
Terraform will destroy the former resource and provision the new one

Probably I'm the only one who thinks it's counter-intuitive, but initially I thought that state file is the source of truth and has a log structure, so whenever you create or change resources, it will just add new information. No resources destroyed, unless you explicitly ask for it. It turns out to be very different from my thoughts. Hence, the workarounds:

Keep all resources in [main.tf](https://main.tf) file and add new ones in the same file (horrible, lol)
Store as many state files as you have resources and pick them up if you need something to know about your current infrastructure(that's how it's done currently at my company)

Did I miss something? Probably you can point me in the right direction, maybe there are best practices on handling state files? Or, if this is really how Terraform behaves and you can't change it, how about the others, like Pulumi?

Thank you in advance!

https://redd.it/10w5r9y
@r_devops
Prototype review?

I'm looking for developers that are interested in reviewing a DevOps software deployment platform. Please let me know if you are interested and I can send you more info. Thanks!

https://redd.it/10waynd
@r_devops
Do you think being a DevOp engineer is a difficult job to learn for the average man?

I’ve recently been looking into this career but in all honesty, I’m not the brightest person.

https://redd.it/10vr3er
@r_devops
Persisting a config setting in /etc/ssh/sshconfig of container

We have an app that runs as a container in a VM. The app routinely SSHs into an SFTP server to download files and stuff. Anyway, we want to change a value in ssh\
config of the container, particularly the ServerAliveInterval since we've been experiencing sporadic SSH connection timeout errors. Since this is a running container, that value won't persist after a restart. How can we make it persistent?

https://redd.it/10wrf9j
@r_devops