Reddit DevOps
266 subscribers
30.9K links
Reddit DevOps. #devops
Thanks @reddit2telegram and @r_channels
Download Telegram
DDoS attack - i think

I manage several ecommerce websites and their hosting for work. Over the years I have seen various types of attacks, as well as an increase an AI / bot traffic.

On the 3rd July I was alerted to high server activity on one of our sites. When I was reviewing the server and nginx logs, I could see the requests per hour to the site had gone from an average of 20,000 an hour to 120,000. However Sales had not increased,

Reviewing the nginx logs, I found that there was a large number of requests to a small group of category pages, never any request for CSS / JS - which stinks of bot.

Cherry picking some IP addresses, they only ever made one request.

Immediately we enabled cloudflare under attack mode, which made the traffic instantly drop, adding to the idea that this is bot traffic and not a successful marketing campaign.


I identified patterns in paths and created a rule in cloudflare to target this, allowing me to remove the under attack mode and keep the website online.

Between then and now I have been reviewing the requests hitting my rule.

A few times I downloaded and analysed 500 requests to the rule and they all read similar to this.

\- 493 Different IP addresses
\- 278 ASNs
\- 55 Countries
\- 13 URLs
\- 412 User Agents
\- 500 different query parameters


The website sells items to the UK, a large number of these requests are coming from Brazil, Singapore, Vietnam, India and Bangladesh


Checking on the rule today (25th july) so 3 weeks in - and within cloudflare I can see the rule is blocking a LOT of requests. This is showing is has presented the challenge 18k requests in the last 24 hours.

I should add, my rule is set to ignore for known bots.


Is this a DDoS Attack? I have never had one this sophisticated or last this long.

The website is not high value and the requests have been blocked for 3 weeks now yet they still continue to come in.


Any suggestions on additional things I can do to tackle this would also be welcome



https://redd.it/1m8vf0n
@r_devops
Please help me with nifi and nifikop that i'm trying to learn!

I encounter a few problems. I'm trying to install a simple HTTP nifi in my Azure Kubernetes.
I have a very simple setup, just for test. A single VM from which I can get into my AKS with k9s or kubectl commands.
I have a simple cluster made like:

```
az aks create --resource-group rg1 --name aks1 --node-count 3 --enable-cluster-autoscaler --min-count 3 --max-count 5 --network-plugin azure --vnet-subnet-id '/subscriptions/c3a46a89-745e-413b-9aaf-c6387f0c7760/resourceGroups/rg1/providers/Microsoft.Network/virtualNetworks/vnet1/subnets/vnet1-subnet1' --enable-private-cluster --zones 1 2 3
```

I did tried to install different things on it for tests and they are working so I don't think there may be a problem with the cluster itself.

Steps I did for my NIFI:

1.I installed cert manager,
```
kubectl apply -f https://github.com/jetstack/cert-manager/releases/latest/download/cert-manager.yaml
```

2.
zookeper,
```
helm upgrade --install zookeeper-cluster bitnami/zookeeper \
--namespace nifi \
--set resources.requests.memory=256Mi \
--set resources.requests.cpu=250m \
--set resources.limits.memory=256Mi \
--set resources.limits.cpu=250m \
--set networkPolicy.enabled=true \
--set persistence.storageClass=default \
--set replicaCount=3 \
--version "13.8.4"
```
3.
Added nifikop with servieaccount and a clusterrolebinding,
```
kubectl create serviceaccount nifi -n nifi

kubectl create clusterrolebinding nifi-admin --clusterrole=cluster-admin --serviceaccount=nifi:nifi
```
4.
```
helm install nifikop \
oci://ghcr.io/konpyutaika/helm-charts/nifikop \
--namespace=nifi \
--version 1.14.1 \
--set metrics.enabled=true \
--set image.pullPolicy=IfNotPresent \
--set logLevel=INFO \
--set serviceAccount.create=false \
--set serviceAccount.name=nifi \
--set namespaces="{nifi}" \
--set resources.requests.memory=256Mi \
--set resources.requests.cpu=250m \
--set resources.limits.memory=256Mi \
--set resources.limits.cpu=250m
```

5. nifi-cluster.yaml
```
apiVersion: nifi.konpyutaika.com/v1
kind: NifiCluster
metadata:
name: simplenifi
namespace: nifi
spec:
service:
headlessEnabled: true
labels:
cluster-name: simplenifi
zkAddress: "zookeeper-cluster-headless.nifi.svc.cluster.local:2181"
zkPath: /simplenifi
clusterImage: "apache/nifi:2.4.0"
initContainers:
- name: init-nifi-utils
image: esolcontainerregistry1.azurecr.io/nifi/nifi-resources:9
imagePullPolicy: Always
command: ["sh", "-c"]
securityContext:
runAsUser: 0
args:
- |
rm -rf /opt/nifi/extensions/* && \
cp -vr /external-resources-files/jars/* /opt/nifi/extensions/
volumeMounts:
- name: nifi-external-resources
mountPath: /opt/nifi/extensions
oneNifiNodePerNode: true
readOnlyConfig:
nifiProperties:
overrideConfigs: |
nifi.sensitive.props.key=thisIsABadSensitiveKeyPassword
nifi.cluster.protocol.is.secure=false

# Disable HTTPS
nifi.web.https.host=
nifi.web.https.port=

# Enable HTTP
nifi.web.http.host=0.0.0.0
nifi.web.http.port=8080

nifi.remote.input.http.enabled=true
nifi.remote.input.secure=false

nifi.security.needClientAuth=false
nifi.security.allow.anonymous.authentication=false
nifi.security.user.authorizer: "single-user-authorizer"
managedAdminUsers:
- name: myadmin
identity: [email protected]
pod:
labels:
cluster-name: simplenifi
readinessProbe:
exec:
command:
- bash
- -c
- curl -f https://localhost:8080/nifi-api
initialDelaySeconds: 20
periodSeconds: 10
timeoutSeconds: 5
failureThreshold: 6
nodeConfigGroups:
default_group:
imagePullPolicy: IfNotPresent
isNode: true
serviceAccountName: default
storageConfigs:
- mountPath: "/opt/nifi/nifi-current/logs"
name: logs
reclaimPolicy: Delete
pvcSpec:
accessModes:
- ReadWriteOnce
storageClassName: "default"
resources:
requests:
storage: 10Gi
- mountPath: "/opt/nifi/extensions"
name: nifi-external-resources
pvcSpec:
accessModes:
- ReadWriteOnce
storageClassName: "default"
resources:
requests:
storage: 4Gi
resourcesRequirements:
limits:
cpu: "1"
memory: 2Gi
requests:
cpu: "1"
memory: 2Gi
nodes:
- id: 1
nodeConfigGroup: "default_group"
- id: 2
nodeConfigGroup: "default_group"
propagateLabels: true
nifiClusterTaskSpec:
retryDurationMinutes: 10
listenersConfig:
internalListeners:
- containerPort: 8080
type: http
name: http
- containerPort: 6007
type: cluster
name: cluster
- containerPort: 10000
type: s2s
name: s2s
- containerPort: 9090
type: prometheus
name: prometheus
- containerPort: 6342
type: load-balance
name: load-balance
sslSecrets:
create: true
singleUserConfiguration:
enabled: true
secretKeys:
username: username
password: password
secretRef:
name: nifi-single-user
namespace: nifi
```
6. nifi-service.yaml

```
apiVersion: v1
kind: Service
metadata:
name: nifi-http
namespace: nifi
spec:
selector:
app: nifi
cluster-name: simplenifi
ports:

port: 8080
targetPort: 8080
protocol: TCP
name: http
```

The problems I can't get over are the next. When I try to add any process into the nifi interface or do anything I get the error:

Node 0.0.0.0:8080 is unable to fulfill this request due to: Transaction ffb3ecbd-f849-4d47-9f68-099a44eb2c96 is already in progress.

But I didn't do anything into the nifi to have anything in progress.

The second problem is that, even though I have the singleuserconfiguration on true with the secret applied and etc, (i didn't post the secret here, but it is applied in the cluster) it still logs me directly without asking for an username and password. And I do have these:

nifi.security.allow.anonymous.authentication=false
nifi.security.user.authorizer: "single-user-authorizer"

I tried to ask another person from my team but he has no idea about nifi, or doesn't care to help me. I tried to read the documentation over and over and I just don't understand anymore. I'm trying this for a week already, please help me I'll give you a 6pack of beer, a burger, a pizza ANYTHING.

This is a cluster that I'm trying to make for a test, is not production ready, I don't need it to be production ready. I just need this to work. I'll be here if you guys need more info from me.

https://imgur.com/a/D77TGff Image with the nifi cluster and error

https://redd.it/1m8xlt4
@r_devops
Looking for Real-World Production Terraform or Pulumi Configurations

Hi,

I'm building a tool for simplifying cloud provisioning and deployment workflows, and I'd really appreciate some input from this community.

If you're willing to share, I'm looking for examples of **complex, real-world Terraform or Pulumi configurations used in production**. These can be across any cloud provider and should ideally reflect real organizational use (with all sensitive data redacted, of course).

To make the examples more useful, it would help if you could include:

* A brief description of what the configuration is doing (e.g., multi-region failover, hybrid networking, autoscaling setup, etc.)
* The general **company size or scale** (e.g., startup, mid-size, enterprise)
* Any interesting constraints, edge cases, or reasons why the config was structured that way

Thanks in advance!

https://redd.it/1m8ysnq
@r_devops
Troubleshooting woes?

How have you dealt with troubleshooting in a new role? I recently got a title change from a graduate and know there are more responsibilities but what do I do when my team is too busy to help if needed?
I used to rely a lot on my buddy I was told to shadow and now can sort certain errors without her but I'm worried it's a bad thing about me needing help at times, given its my second year and they only added me on callout this year, but what when they're all busy and I'm stuck? I do Google and check AI (AWS Q for code) but systems are different and AI always says the wrong things while Google says a lot of different things

https://redd.it/1m927r8
@r_devops
Boot reset a linux screen without entering password or bios?

Hello all,

The venue I work for has a couple of 11.6" screens that use linux 4.4 with Ubuntu 18. It was installed at the same time as our narrowcasting software and so the company locked it on all possible sides, disabled bios, everything and didn't give us a password.

We gave them a ring and they say they can't help us out, unless we pay a good amount of $$$.

I was wondering if there is a way for me to reset the full computer and put new Ubuntu on it myself? I already tried a boot USB in the service input at startup, pressing any button it had at restart, all standard passwords, etc. Sadly no result

We would like to reset it because we are changing narrowcasting software, but the screens itself are still find and so we would like to keep using them with the new system.

Anyone got any clue how to fully reset it (it's okay if it's 100% wiped)

Screen / pc used: https://cf-assets.s3.amazonaws.com/LINQ11.6touch.pdf

Specs:
LINQ 11.6” PRO TOUCH SERIES
Architecture ARM64
Ubuntu 18.04.3 LTS
Linux 4.4.167
Digital signage AiO tablet
RK3399 11.6" AiO Tablet
Model: ST116
ST116-RK3399-S

Thank you!

https://redd.it/1m955sq
@r_devops
Git Gud: Setting Up a Better Git Config

I've been slowly refining my `.gitconfig` over time to make Git less frustrating and more productive.

In this blog post, I cover some of the quality-of-life improvements and hidden config gems that have really helped me out, like:

* Making `git commit` show full diffs in the editor
* Sorting branches and tags by most recent activity or version number
* Prettifying diffs with `diff-so-fancy`
* Auto-setting upstream remotes so I don’t have to type `--set-upstream` every time
* Git aliases and shell aliases to save keystrokes
* Enabling background maintenance to reduce repo bloat
* GPG commit signing for that sweet “Verified” badge
* Enabling rerere (yes, it’s a real thing) to auto-resolve repeat merge conflicts
* Bonus: editor tweaks, typo suggestions, whitespace highlighting, and more

It's aimed at developers who already use Git but want to *tune* it to better fit their workflow.

🔗 [**Read it here → Git Gud: Setting Up a Better Git Config**](https://micahkepe.com/blog/gitconfig/)

Would love to hear if there’s anything you think I missed—or if you have your own favorite `.gitconfig` tweaks or aliases.

https://redd.it/1m95els
@r_devops
I was just asked by Google to go through a round of Interviews

To be honest, I'm surprised that my resume passed the algorithm, and I'm equally surprised that my lack of a CS degree also didn't affect the outcome. So, truly, I'm kinda honoured and flattered that they still wanted to go forward.

I've never gone through tech interviews at a FAANG company before - and I heard that they are soul crushing. I just submitted my availability for my first 45-min interview in 2.5 weeks time.

They sent me an email to prepare (shit myself) over some core concepts:

* Arrays and Strings
* Linked Lists
* Trees and Graphs
* Hash Maps/Tables
* Sorting and Searching Algorithms


I'm already sweating bullets. I'm good at coding, but not CS level good. How fucked am I? I feel like I'm pretty fucked. The fucking feels real. I checked out [prepare.sh](https://prepare.sh) and leetcode to see if I can narrow my challenges down but there are still like way too many tests to possibly go through in time.

The pressure from being in front of people to code is already anxiety inducing enough. I'm so over my head.

https://redd.it/1m983mt
@r_devops
I got 4 rejection emails today, one with an internal recommendation too. Can I get a sanity check on my resume please?

I've been on and off looking for a new job for about a year now. I got laid off in May and have ramped up my efforts since then including getting my CKA cert and almost ready for the AWS SysOps cert. I've scored a few interviews over the last year, but nothing since May, and keep getting hit with "We've chosen to go with another candidate". The rejection emails from today included a DevOps position where I have all the skills and experience that were listed on the job position but I got insta-rejected, even with the internal recommendation.

I know the job market is tough right now and that a lot of these openings are being flooded with talented candidates, which means my resume needs to be on point. I've crafted my resume with the help of ChatGPT, but getting some feedback from real people might point out areas that could be improved. If you could find a few spare minutes to review my resume and provide any feedback I would be extremely grateful. Thanks!

Resume: https://imgur.com/a/seh2Wl1

https://redd.it/1m97i8e
@r_devops
Just Launched: JS Deobfuscator

Hey devs 👋

I just launched a free tool called JS Deobfuscator – it helps you convert obfuscated, minified, or packed JavaScript into readable code.

🔧 Features:
• Handles complex proxy/function chains
• String/array unpacking
• Optional beautify-only mode
• Dark/light theme
• One-click download or copy

Would love for you to try it and give me feedback — what works, what doesn’t, and what you’d like to see next!

👉 https://jsdeobfuscator.com

Thanks in advance 🙌


https://redd.it/1m9ch3n
@r_devops
Dev ops pathway.

40m looking to switch careers, I’ve been working in the rock climbing industry my whole life and would love to settle into something less physical and more reliable for my growing family.

I’m currently studying dev ops through a very lengthy course on UDemy. It’s going pretty slow as I work full time, and I have a newborn baby.

I have a decent understanding of Linux/bash and have taken a few python courses previously. I can create and manage virtual machines both manually and automated. The course will cover slot of the tools as well.

My question is what would be a good positions to start applying for if I wanted ti switch careers before I finish my schooling?
What’s the very beginning of the path to dev ops?

Help desk? System admin? Coffee gopher?

https://redd.it/1m9hvvl
@r_devops
Is it possible to configure WSL2 so that containers expose the same port on both the WSL2 environment and the host system? Specifically, if a container is running on localhost:8000 within WSL2, can it be accessed on localhost:8000 from the Windows host or external networks?

I was able to set up my WSL2 so that the containers run on WSL2, but I can't reach the backend at all from outside of the container, which makes my local setup completely useless.

https://redd.it/1m9hpq5
@r_devops
Career shifting helpppppp

I’m currently working as a Backend Developer at a mid-sized company, but unfortunately, the work environment isn’t ideal.
There’s a lack of structure, and my manager is often unfocused, which leads to constant pressure and confusion across the team. The stress doesn’t come from a heavy workload, but rather from the absence of clear direction and effective leadership.

Recently, I started exploring new opportunities and even began going through some interviews. Then I joined a mentorship program run by a reputable company — and it was a turning point.

During the program, I was introduced to the DevOps field, and I found myself genuinely passionate about it. My mentor guided me on the tools and certifications I need to pursue, but the full transition will likely take around 5 months or so.

The challenge is that I’m struggling to stay motivated in my current role while preparing for this shift. I’m really looking for a more structured, corporate environment — one with real teams, healthy collaboration, and clear goals.

If you have any advice or know of companies open to hiring people who are transitioning between roles, I’d truly appreciate your help and guidance.

This is a tough phase, and any support would mean a lot. Thank you!

https://redd.it/1m9pf5i
@r_devops
Do you use Crossplane? My company today only uses K8s. We have Crossplane configured for some tasks, but not extensively. We are considering whether to continue using it or start using Terraform, as most people are unfamiliar with Crossplane.

Please share your thoughts

https://redd.it/1m9rv9m
@r_devops
Get a Job

Let me tell you my story. I am 18 years old, currently living in Europe and working in fast food to earn some money and survive. At the age of 13, I became interested in computers and programming. At 14, I started writing some small programs in C++. Since I was 15, I have been studying cybersecurity, networks, and everything related to DevOps. My dream and goal is to become an engineer in this field. Please give me some advice. Currently, I try to study this every day for at least an hour, and on weekends for more than 4-5 hours without days off. I had some interviews but was not accepted anywhere.

https://redd.it/1m9seyn
@r_devops
API GATEWAY

I have been tasked to create a prototype for an API gateway for my company which we shall sell as a saas. I have not done such a project before and here is how i have been thinking about approaching the problem.
1. Use Nginx as a reverse proxy then business logic in Go or C/C++ and Redis pub/sub for caching.
2. Coming up with a reverse proxy first then modifying it into a gateway
3. Just start everything from scratch.

am a junior and i have never encountered such, if there is a better way and please guide . help align my thinking

https://redd.it/1m9w28t
@r_devops
Introducing ReflexCore :Your Open‑Source Cognition Layer for DevSecOps Shells

Hey everyone

I’m excited to share GitsWhy ReflexCore, the free, Apache‑2.0–licensed agent that turns any Bash/Zsh shell into a cognition‑native DevSecOps environment. It:

Monitors keystroke patterns to infer intent and detect hesitation

Auto‑tunes system health by flushing entropy pools & cleaning zombie processes

Logs everything into a local, Fernet‑encrypted vault for later analysis

Ships with a full test suite & GitHub Actions CI, so it’s production‑ready

https://github.com/gitswhy/reflexcore

https://redd.it/1m9zsuw
@r_devops
Junior DevOps interview prep ressources !

Got my first DevOps/cloud technical assessment coming up (take-home case + discussion). It was from networking so there's no JD or clear position, so I don't know what he will test exactly. I have AZ-104, GCP Associate, and some other certs but limited real-world experience. Looking for good resources that cover practical troubleshooting scenarios and 'what if X breaks' type questions. Already have the Cracking DevOps Interview book. Any other recommendations for hands-on practice or realistic scenario walkthroughs? Especially interested in take-home case examples or what to expect from the technical discussion afterward.

https://redd.it/1ma4yrm
@r_devops
Do you track vendor SLA breaches?

I've started looking more into SAAS SLA breaches for common saas services we use (GitHub, JIRA, etc) due to outages during the first half of the year. Each vendor seems to have its own set of "rules" for what downtime is, if your account qualifies, and how quickly you have to submit it.

Is anyone successfully recouping credits, or am I on a fool's errand? Does your devops team do this or you have an internal team (finance?) doing this? Maybe its managed by a third party vendor? Looking for options and advice.

https://redd.it/1ma9u3d
@r_devops