Hi, I have a fedora29 system in our colo that's a few years old now and just goes catatonic and stops responding after a few days. It's happened a few times now, even with different kernels, so I suspect it's a memory or hardware problem.
Is it possible to run memtest without having physical access to the machine to insert a USB stick or CDROM?
After the machine reboots (via IPMI access), there's nothing in the logs and no abrt-cli info on a kernel crash or other info I can find about why it died.
What else can I do to troubleshoot this without having to drive to the colo to check on it?
The last entry from journalctl just before it stopped responding was just a regular nrpe entry, unrelated to the crash.
I've pasted the current dmesg output here: http://pasted.co/4b700ee1
Any ideas greatly appreciated.