Skip to main content
edited tags
Link
Anthon
  • 81.4k
  • 42
  • 174
  • 228
Removed misleading tags (Q is not about any of them in particular), added error tag (not sure about that, error tag has no description)
Link
Source Link
moujik
  • 93
  • 1
  • 1
  • 3

APEI Generic Hardware Error

Over the past week my server (running Debian Jessie) has rebooted twice. In the syslog I see this before each reboot, and at no other points:

Aug 15 13:32:58 hoshimiya kernel: [296512.005355] {1}[Hardware Error]: Hardware error from APEI Generic Hardware Error Source: 1
Aug 15 13:32:58 hoshimiya kernel: [296512.005360] {1}[Hardware Error]: It has been corrected by h/w and requires no further action
Aug 15 13:32:58 hoshimiya kernel: [296512.005361] {1}[Hardware Error]: event severity: corrected
Aug 15 13:32:58 hoshimiya kernel: [296512.005362] {1}[Hardware Error]:  Error 0, type: corrected
Aug 15 13:32:58 hoshimiya kernel: [296512.005363] {1}[Hardware Error]:  fru_text: CorrectedErr
Aug 15 13:32:58 hoshimiya kernel: [296512.005364] {1}[Hardware Error]:   section_type: memory error
Aug 15 13:32:58 hoshimiya kernel: [296512.005365] [Firmware Warn]: error section length is too small

Some googling leads me to believe that this is to do with my ECC RAM detecting and recovering from an error. Is this correct? If it's recovering, why does the system reboot? I'd like to prevent the system from rebooting, if at all possible.