On Fri, 2017-07-21 at 06:08 -0700, Sherman Grunewagen wrote:
I am getting this block of messages put into /var/log/messages every few minutes (or seconds) since I received this laptop and installed F25. The install is updated as of two days ago. Macine is a new Dell Precision 7520.
What does this mean? Do I have a defective CPU?
-Sherman
Jul 21 02:48:53 new_pons kernel: mce: [Hardware Error]: Machine check events logged Jul 21 02:48:53 new_pons mcelog[953]: mcelog: Family 6 Model 9e CPU: only decoding architectural errors Jul 21 02:48:53 new_pons mcelog[953]: Hardware event. This is not a software error. Jul 21 02:48:53 new_pons mcelog[953]: MCE 0 Jul 21 02:48:53 new_pons mcelog[953]: CPU 2 BANK 0 TSC 2dce95564215 Jul 21 02:48:53 new_pons mcelog[953]: TIME 1500630533 Fri Jul 21 02:48:53 2017 Jul 21 02:48:53 new_pons mcelog[953]: MCG status: Jul 21 02:48:53 new_pons mcelog[953]: MCi status: Jul 21 02:48:53 new_pons mcelog[953]: Error overflow Jul 21 02:48:53 new_pons mcelog[953]: Corrected error Jul 21 02:48:53 new_pons mcelog[953]: Error enabled Jul 21 02:48:53 new_pons mcelog[953]: MCA: Internal parity error Jul 21 02:48:53 new_pons mcelog[953]: STATUS d00000c000010005 MCGSTATUS 0 Jul 21 02:48:53 new_pons mcelog[953]: MCGCAP c0a APICID 4 SOCKETID 0 Jul 21 02:48:53 new_pons mcelog[953]: CPUID Vendor Intel Family 6 Model 158 Jul 21 02:48:53 new_pons mcelog[953]: mcelog: warning: 24 bytes ignored in each record Jul 21 02:48:53 new_pons mcelog[953]: mcelog: consider an update Jul 21 02:48:54 new_pons abrt-dump-journal-oops[1020]: abrt-dump- journal-oops: Found oopses: 1 Jul 21 02:48:54 new_pons abrt-dump-journal-oops[1020]: abrt-dump- journal-oops: Creating problem directories Jul 21 02:48:55 new_pons abrt-dump-journal-oops[1020]: Reported 1 kernel oopses to Abrt _______________________________________________
On a Dell Alienware 17, from ~ mid-2014 with an Intel(R) Core(TM) i7-4900MQ CPU @ 2.80GHz with a Fedora 26 on it:
$ dmesg | grep -i mce [ 0.022559] mce: CPU supports 9 MCE banks [ 0.022570] mce: [Hardware Error]: Machine check events logged [ 0.043479] mce: [Hardware Error]: CPU 0: Machine Check: 0 Bank 6: ee0000000040110a [ 0.043483] mce: [Hardware Error]: TSC 0 ADDR ffb07080 MISC 38a0000086 [ 0.043486] mce: [Hardware Error]: PROCESSOR 0:306c3 TIME 1500639882 SOCKET 0 APIC 0 microcode 22 [ 0.043488] mce: [Hardware Error]: CPU 0: Machine Check: 0 Bank 7: ee0000000040110a [ 0.043490] mce: [Hardware Error]: TSC 0 ADDR ffb07340 MISC 1b8a0000086 [ 0.043492] mce: [Hardware Error]: PROCESSOR 0:306c3 TIME 1500639882 SOCKET 0 APIC 0 microcode 22 [ 332.751908] mce: [Hardware Error]: Machine check events logged
journalctl, excerpt from the last ~ 12 hrs:
audit[1]: SERVICE_STOP pid=1 uid=0 auid=4294967295 ses=4294967295 subj=system_u:system_r:init_t:s0 msg='unit=mcelog comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success' kernel: mce: CPU supports 9 MCE banks kernel: mce: [Hardware Error]: Machine check events logged kernel: mce: [Hardware Error]: CPU 0: Machine Check: 0 Bank 6: ee0000000040110a kernel: mce: [Hardware Error]: TSC 0 ADDR ffb07080 MISC 38a0000086 kernel: mce: [Hardware Error]: PROCESSOR 0:306c3 TIME 1500637485 SOCKET 0 APIC 0 microcode 22 kernel: mce: [Hardware Error]: CPU 0: Machine Check: 0 Bank 7: ee0000000040110a kernel: mce: [Hardware Error]: TSC 0 ADDR ffb07340 MISC 1b8a0000086 kernel: mce: [Hardware Error]: PROCESSOR 0:306c3 TIME 1500637485 SOCKET 0 APIC 0 microcode 22 audit[1]: SERVICE_START pid=1 uid=0 auid=4294967295 ses=4294967295 subj=system_u:system_r:init_t:s0 msg='unit=mcelog comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success' mcelog[2377]: Hardware event. This is not a software error. mcelog[2377]: MCE 0 mcelog[2377]: CPU 0 BANK 6 mcelog[2377]: MISC 38a0000086 ADDR ffb07080 mcelog[2377]: TIME 1500637485 Fri Jul 21 13:44:45 2017 mcelog[2377]: MCG status: mcelog[2377]: MCi status: mcelog[2377]: Error overflow mcelog[2377]: Uncorrected error mcelog[2377]: MCi_MISC register valid mcelog[2377]: MCi_ADDR register valid mcelog[2377]: Processor context corrupt mcelog[2377]: MCA: corrected filtering (some unreported errors in same region) mcelog[2377]: Generic CACHE Level-2 Generic Error mcelog[2377]: STATUS ee0000000040110a MCGSTATUS 0 mcelog[2377]: MCGCAP c09 APICID 0 SOCKETID 0 mcelog[2377]: CPUID Vendor Intel Family 6 Model 60 mcelog[2377]: Hardware event. This is not a software error. mcelog[2377]: MCE 1 mcelog[2377]: CPU 0 BANK 7 mcelog[2377]: MISC 1b8a0000086 ADDR ffb07340 mcelog[2377]: TIME 1500637485 Fri Jul 21 13:44:45 2017 mcelog[2377]: MCG status: mcelog[2377]: MCi status: mcelog[2377]: Error overflow mcelog[2377]: Uncorrected error mcelog[2377]: MCi_MISC register valid mcelog[2377]: MCi_ADDR register valid mcelog[2377]: Processor context corrupt mcelog[2377]: MCA: corrected filtering (some unreported errors in same region) mcelog[2377]: Generic CACHE Level-2 Generic Error mcelog[2377]: STATUS ee0000000040110a MCGSTATUS 0 mcelog[2377]: MCGCAP c09 APICID 0 SOCKETID 0 mcelog[2377]: CPUID Vendor Intel Family 6 Model 60 mcelog[2377]: mcelog: warning: 24 bytes ignored in each record mcelog[2377]: mcelog: consider an update kernel: mce: [Hardware Error]: Machine check events logged audit[1]: SERVICE_START pid=1 uid=0 auid=4294967295 ses=4294967295 subj=system_u:system_r:init_t:s0 msg='unit=mcelog comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success' audit[1]: SERVICE_STOP pid=1 uid=0 auid=4294967295 ses=4294967295 subj=system_u:system_r:init_t:s0 msg='unit=mcelog comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success' kernel: mce: CPU supports 9 MCE banks kernel: mce: [Hardware Error]: Machine check events logged kernel: mce: [Hardware Error]: CPU 0: Machine Check: 0 Bank 6: ee0000000040110a kernel: mce: [Hardware Error]: TSC 0 ADDR ffb07080 MISC 38a0000086 kernel: mce: [Hardware Error]: PROCESSOR 0:306c3 TIME 1500639882 SOCKET 0 APIC 0 microcode 22 kernel: mce: [Hardware Error]: CPU 0: Machine Check: 0 Bank 7: ee0000000040110a kernel: mce: [Hardware Error]: TSC 0 ADDR ffb07340 MISC 1b8a0000086 kernel: mce: [Hardware Error]: PROCESSOR 0:306c3 TIME 1500639882 SOCKET 0 APIC 0 microcode 22 audit[1]: SERVICE_START pid=1 uid=0 auid=4294967295 ses=4294967295 subj=system_u:system_r:init_t:s0 msg='unit=mcelog comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success' mcelog[2093]: Hardware event. This is not a software error. mcelog[2093]: MCE 0 mcelog[2093]: CPU 0 BANK 6 mcelog[2093]: MISC 38a0000086 ADDR ffb07080 mcelog[2093]: TIME 1500639882 Fri Jul 21 14:24:42 2017 mcelog[2093]: MCG status: mcelog[2093]: MCi status: mcelog[2093]: Error overflow mcelog[2093]: Uncorrected error mcelog[2093]: MCi_MISC register valid mcelog[2093]: MCi_ADDR register valid mcelog[2093]: Processor context corrupt mcelog[2093]: MCA: corrected filtering (some unreported errors in same region) mcelog[2093]: Generic CACHE Level-2 Generic Error mcelog[2093]: STATUS ee0000000040110a MCGSTATUS 0 mcelog[2093]: MCGCAP c09 APICID 0 SOCKETID 0 mcelog[2093]: CPUID Vendor Intel Family 6 Model 60 mcelog[2093]: Hardware event. This is not a software error. mcelog[2093]: MCE 1 mcelog[2093]: CPU 0 BANK 7 mcelog[2093]: MISC 1b8a0000086 ADDR ffb07340 mcelog[2093]: TIME 1500639882 Fri Jul 21 14:24:42 2017 mcelog[2093]: MCG status: mcelog[2093]: MCi status: mcelog[2093]: Error overflow mcelog[2093]: Uncorrected error mcelog[2093]: MCi_MISC register valid mcelog[2093]: MCi_ADDR register valid mcelog[2093]: Processor context corrupt mcelog[2093]: MCA: corrected filtering (some unreported errors in same region) mcelog[2093]: Generic CACHE Level-2 Generic Error mcelog[2093]: STATUS ee0000000040110a MCGSTATUS 0 mcelog[2093]: MCGCAP c09 APICID 0 SOCKETID 0 mcelog[2093]: CPUID Vendor Intel Family 6 Model 60 mcelog[2093]: mcelog: warning: 24 bytes ignored in each record mcelog[2093]: mcelog: consider an update kernel: mce: [Hardware Error]: Machine check events logged
--------------------------------------------
IIRC I have these errors since I installed Fedora 24 on this machine, last autumn I think. And F24 worked - after a little extra help via software settings - really good, and considerably better than F26 now on the same machine ...
HTH
Wolfgang