On 07/21/2017 06:27 AM, Mike Wright wrote:
On 07/21/2017 06:08 AM, Sherman Grunewagen wrote:
I am getting this block of messages put into /var/log/messages every few minutes (or seconds) since I received this laptop and installed F25. The install is updated as of two days ago. Macine is a new Dell Precision 7520.
Jul 21 02:48:53 new_pons kernel: mce: [Hardware Error]: Machine check events logged Jul 21 02:48:53 new_pons mcelog[953]: mcelog: Family 6 Model 9e CPU: only decoding architectural errors Jul 21 02:48:53 new_pons mcelog[953]: Hardware event. This is not a software error. Jul 21 02:48:53 new_pons mcelog[953]: MCE 0 Jul 21 02:48:53 new_pons mcelog[953]: CPU 2 BANK 0 TSC 2dce95564215 Jul 21 02:48:53 new_pons mcelog[953]: TIME 1500630533 Fri Jul 21 02:48:53 2017 Jul 21 02:48:53 new_pons mcelog[953]: MCG status: Jul 21 02:48:53 new_pons mcelog[953]: MCi status: Jul 21 02:48:53 new_pons mcelog[953]: Error overflow Jul 21 02:48:53 new_pons mcelog[953]: Corrected error Jul 21 02:48:53 new_pons mcelog[953]: Error enabled Jul 21 02:48:53 new_pons mcelog[953]: MCA: Internal parity error Jul 21 02:48:53 new_pons mcelog[953]: STATUS d00000c000010005 MCGSTATUS 0 Jul 21 02:48:53 new_pons mcelog[953]: MCGCAP c0a APICID 4 SOCKETID 0 Jul 21 02:48:53 new_pons mcelog[953]: CPUID Vendor Intel Family 6 Model 158 Jul 21 02:48:53 new_pons mcelog[953]: mcelog: warning: 24 bytes ignored in each record Jul 21 02:48:53 new_pons mcelog[953]: mcelog: consider an update Jul 21 02:48:54 new_pons abrt-dump-journal-oops[1020]: abrt-dump-journal-oops: Found oopses: 1 Jul 21 02:48:54 new_pons abrt-dump-journal-oops[1020]: abrt-dump-journal-oops: Creating problem directories Jul 21 02:48:55 new_pons abrt-dump-journal-oops[1020]: Reported 1 kernel oopses to Abrt
This looks to be a CPU cache memory error that is correctable. I have a quad-core with one core giving correctable cache errors. Turning the core off reduces my net performance significantly so instead I turned off MCE errors (besides, they were crapping all over my xterms).
Your call as to whether your work is critical enough to replace the CPU. Good excuse to tell your SO you have to upgrade ;D
Mike Wright
Thanks. This is a brand new Dell machine (1 week old). Looking around on the web, I see many others with the issue. One person reported that Dell was infuriatingly no help (surprise!), claiming these are "false positive" errors. I'm moving to China in 3 weeks and have little time to pursue this with Dell. But I want to avoid my laptop dying in China, so I'm really not sure what to do. -Sherman