Hi guys! I have a rather beefy machine. AMD Ryzen 7700, 32GB DDR5, GPU 7800XT 16GB, several NVME drives for OS, general data, games. And yet…after a while it becomes completely unresponsive. Mouse freezes, keyboard doesn’t key anything, and the screen gets completely frozen. Meanwhile the disk led gets full activity, almost constantly red. So…While this might be crazy pagination turning the system to a crawl (I have an 8GB swapfile), I want to be able to determine what’s going on. Is there a way I can check any log, or enable any kind of logging that would tell me what happened on the seconds before it became completely unresponsive? Who takes all my memory??
Normal situations where this happens:
Firefox open, multiple windows, lots of tabs. Maybe ~5-8GB of RAM.
Virtmanager running a Windows VM, running a work remote desktop…4GB of RAM
Steam…1GB of RAM
Thunderbird, Deluge, Telegram, Whatsapp…Not much more really.
This shouldn’t even come close to the RAM capacity of this machine. And yet…it really looks like it suffocates without memory. How can I check for issues?
This looks like either a driver issue, but more likely, a hardware issue. Either your nvme, or your RAM, is faulty. Run memcheck (it’s a bootable thing you run to make sure your ram is ok), and I’m sure there are tests for ssds too.
When this happens, can you switch console (ctrl-alt-f1) or restart X (ctrl-alt-backspace) or can you ssh from another PC? you can also in a window have a journalctl or something tailing the logs and see if something is happening there
Haven’t tried remote SSH yet. But switching to text console doesn’t work, unresponsive for that, too.
Ok so I kinda had a similar problem. Difference is that I was using Arch and full disk encryption. System would freeze up if I tried writing big files and disk light would start blinking.
It might not be that to so maybe run “journalctl -b -1” the next boot after your system freezes and check towards the bottom of the log to see if there are any errors, usually red. Another way is to use btop running in the background and when the system gives any sign that it’ll freeze switch to btop and check what’s going on.
Edit: something that came to me is to try to switch to another tty using Ctrl+alt+number, I’m not sure how neon works so try 2 or 3 or 4.
you’re nowhere close to RAM exhaustion. I had similar mishaps on an all-AMD system a few gens back and it manifested itself as micro-stutters that occasionally grew to such manifestations. I think I remember it was fixed via a combination of kernel switches and progressively better performance as new versions of kernel and modules/drivers progressed.
no idea what KDE Neon is based on (Ubuntu LTS?), but I’m guessing you rock pretty old kernels and relatively modern hardware, which is a pain. also you don’t need a swapfile, use zram. or just switch to fedora or sumsuch that takes care of all them things for you.
It may not be the raw RAM usage.
My first suspect is the Windows VM especially if it’s running enterprise security software 4GB is probably not enough for modem Windows and it could be trying to use its page file, thrashing your disk in the process.
Are you able to collect some data from system monitor on paging and disk activity? That could help you narrow it down. You can use btop for a quick terminal option if your gui is non responsive (assuming your could switch to a console). Vmstat is another option that you can run in the background to collect stats over time, but it’s not user friendly.
Nothing much enterprise…It’s running “Windows App”, just a glorified RDP with extra authentication settings for SSO etc. Hence why I gave it only 4GB. It’s not just GUI not being responsive, everything is. It’s a full freeze, and I can’t get to the text consoles either. Most I can aspire to, I think, is to gather data from right before the freeze happens…and check it after I reset the computer.
I see. My concern was with security scanning tools often put on computers by enterprise IT departments but it sounds like that’s not the case here.
In your situation, assuming you’re not finding what you seek with journalctl, I think I would use a tool like vmstat or
sarto collect periodic snapshots of CPU, memory, and io. You can tell it to collect data every X seconds and tee that to a file. After you reboot you can see what happened leading up to the crash. You should be able to import the data into a spreadsheet or something for analysis, but it’s not very intuitive and you’ll need to consult man pages for the options and how to interpret them.There are a lot of good suggestions in this thread. I would lean towards a hardware or driver issue, maybe bad RAM. Unfortunately these things take a lot of trial and error to figure out.
In addition to all the good suggestions already here, consider installing early-oom and configure it to kill the stuff you care less, maybe one of those heavy electron-based clients.
Better use systemd-oomd it comes with systemd already on arch and works pretty well
Neon doesn’t force you to actually update the ubuntu it’s built on unless you manually do it iirc. Update your shit and report back.
Once you decide not to try that, top, btop atop or htop can tell you the amount of ram you’re using. They will all also tell you how your disk writes are doing.
It doesn’t sound like you have a ram issue, it sounds like you have a disk issue. First and foremost, once you’ve verified that you have plenty of memory available using a tool described above, expand your windows vm to 8gb. Windows would aggressively page if it had only 4gb and windows in a vm will also aggressively page when it only has 4gb, except it has to go through kvm to access those qcows.
It sounds like you have way too many tabs open. Close some and see if that helps you out. You can highlight a bunch of them by selecting one and ctrl-shift clicking on another one to get every tab in between. Right click and add to bookmarks then close them.
Next, use spinrite with I think a level 3 scan on all your nvme drives. It shaves a write cycle off the top (you have hundreds of thousands at the very least) but in return makes everything fast again. Flash memory becomes less responsive as read cycles on a block pile up until it’s rewritten.
Try different USB devices. I had a bad mouse do something similar.






