Daemon that kills processes using too much CPU

communism@lemmy.ml · edit-2 20 days ago

Daemon that kills processes using too much CPU

ragica@lemmy.ml · edit-2 20 days ago

I used to use earlyoom on an old laptop and it worked well for my purposes.

I hear there is a systemd-oomd, but I never tried it.

Edit: sorry I misread your post to be about memory rather than CPU. Too early on the morning for my brain to work.

communism@lemmy.ml · 20 days ago

Thanks. I’ve had a couple of comments suggesting that it might be a memory leak instead of CPU usage anyway so I’ve installed earlyoom and we’ll see if that can diagnose the problem, if not I’ll look into CPU solutions.

just_another_person@lemmy.world · 20 days ago

It’s not the CPU. All that will do is consume CPU and raise your energy bill.

DigitalDilemma@lemmy.ml · 20 days ago

Never heard of something like that, and I suspect anyone who started creating it soon filed it under “Really bad ideas” alongside “Whoops, why did my kernel just stop?”

sar is the traditional way to watch for high load processes, but do the basics first as that’s not exactly trivial to get going. Things like running htop. Not only will that give you a simple breakdown of memory usage (others have already pointed out swap load which is very likely), but also sorting by cpu usage. htop is more than just a linux taskmgr, it’s a first step triage for stuff like this.

just_another_person@lemmy.world · 20 days ago

Get some sort of resource monitor running on the machine to collect timeseries data about your procs, preferably sent to another machine. Prometheus is simple enough, but SigNoz and Outrace are like DataDog alternatives if you want to go there.
Identify what’s running out of control. Check CPU and Memory (most likely a memory leak)
Check logs to see if something is obviously wrong
Look and see if there is an update for whatever the proc is that addresses this issue
If it’s a systems process, set proper limits

In general, it’s not an out of control CPU that’s going to halt your machine, it’s memory loss. If you have an out of control process taking too much memory, it should get OOMkilled by the kernel, but if you don’t have proper swap configured, and not enough memory, it may not have time to successfully prevent the machine from running out of memory and halting.

Jerkface (any/all)@lemmy.ca · edit-2 19 days ago

Removed by mod

gian · 20 days ago

The kernel has a way to assign resources to each and every process, try to google for “Linux kernel limits” or “linux cgroup cpu limit”.

The problem is knowing which process cause the load, but if you cannot even htop, then I doubt a daemon could do something.

communism@lemmy.ml · 20 days ago

if you cannot even htop, then I doubt a daemon could do something.

The point is that a daemon can catch it before it reaches that point by killing processes that are using too much resources, before all the system resources are used up.

gian · 18 days ago

True only if the resource hog process grows progressively, else the daemon is in the same situation and the kernel limits are the only way since it stops the process before.

But yes, a daemon could be an interesting solution

custard_swollower@lemmy.world · 20 days ago

Open a console with top/htop and check if it will be visible when the system halts.

From my experience it looks like out of memory situation and some process starts swapping like crazy, or a faulty hdd that tries to read some part of the disk over and over again without success.

communism@lemmy.ml · edit-2 20 days ago

Open a console with top/htop and check if it will be visible when the system halts.

That would require me to have a second machine up all the time sshed in with htop open, no? Sometimes this happens on the server while I’m asleep and I don’t really want a second machine running 24/7.