Firmware Alerts in the following table are grouped into four general categories for ease of use. The actual source of that particular alert is displayed in the "Source" column.
See the Alert Severity Table for definitions of Alert Severity statuses.
CPU EV7 Category of Firmware Alerts
|
Event Description |
Source |
Severity |
Supplied Data |
|
CPU Clock Power Fault |
EV7 |
Failure |
|
|
%s %s has faulted (VRM failure) |
CMMn, EV7? |
Failure |
cpu_id, vrm name |
|
RIMM SPD Checksum failed for RIMM #%d |
CMMn |
Warning |
Failed RIMM number |
|
Error writing the PLL clock ratio registers. |
CMMn |
Failure |
|
|
Error programming the VID register. |
CMMn |
Failure |
|
|
Too many %s VRMs (%d) have failed. |
CMMn |
Failure |
Vrm type, number failed |
|
Can't reset EV7 with power off. |
EV7 |
Failure |
|
|
CPU has timed out during SROM load.\n |
EV7 |
Failure |
CPU number |
|
CPU failed SROM/XSROM load. |
EV7 |
Failure |
|
|
CPU has timed out during tepid reset, continuing. |
EV7 |
Failure |
|
|
Can't halt EV7 with power off. |
EV7 |
Failure |
|
|
SROM port is stuck busy. |
EV7 |
Failure |
|
|
Scan dump on CPU timed out waiting for busy. |
EV7 |
Failure |
CPU number |
|
Scan dump on CPU timed out. |
EV7 |
Failure |
|
|
Can't read CPU EEPROM |
EV7 |
Failure |
|
|
Can't write CPU EEPROM |
EV7 |
Failure |
|
|
srom_check_status: CPU timed out waiting for SROM load status. |
EV7 |
Failure |
CPU number |
|
srom_check_status: CPU can not accept the load image command. |
EV7 |
Failure |
|
|
srom_check_status: CPU timed out waiting for SROM load image status. |
EV7 |
Failure |
|
|
srom_check_status: CPU timed out on XSROM version command. |
EV7 |
Failure |
|
|
Error in load image to EV7 |
EV7 |
Failure |
|
|
OCLA %d was found running. Clearing RUN. |
EV7 |
Failure |
ocla |
|
OCLA %d was found disabled. Setting Enable. |
EV7 |
Failure |
ocla |
Environmental Category of Firmware Alerts
|
Event Description |
Source |
Severity |
Supplied Data |
|
Voltage |
MBM, PBM, CMMn |
OK, |
Locator, |
|
Temperature |
MBM, PBM, CMMn |
OK, |
Locator, |
|
Fan |
MBM, PBM |
OK, |
Locator, |
|
Intrusion |
MBM, PBM |
OK (close), Warning (open) |
Locator |
|
PS |
MBM, PBM |
OK, |
Locator, |
|
WPI/SDI |
MBM |
OK, |
Locator, |
|
IOR |
PBM |
OK, |
Locator, |
|
EEPROM |
MBM, PBM, CMMn |
OK, |
Locator |
|
VRM |
CMMn |
OK, |
Locator |
|
Power off drawer due to temp failure |
MBM, PBM |
Failure |
|
|
Power off drawer due to insufficient running fans |
MBM, PBM |
Failure |
|
|
Power off drawer due to unknown failure |
MBM, PBM |
Failure |
|
|
Component has been added |
CMMn, PS, IORn |
OK |
|
|
Component has been removed |
CMMn, PS, IORn |
OK |
|
|
Insufficient running PS |
PS |
Warning |
|
Operational Category of Firmware Alerts
|
Event Description |
Source |
Severity |
Supplied Data |
|
SYS_SERIAL_NUM is not set |
Operational |
Warning |
|
|
Running with mixed firmware revisions. |
Operational |
Warning |
|
|
%s test failure. |
MBM, PBM |
Failure |
POST test that failed |
|
Last reset due to watchdog timeout |
MBM, PBM, CMMn |
Warning |
|
|
Server management group is transitioning. |
Operational |
Warning |
|
|
Server management group is stable. |
Operational |
OK |
|
|
Power switch state changed |
Operational |
OK |
New state |
|
Error log entry |
MBM, PBM |
Warning |
|
Partition Category of Firmware Alerts
|
Event Description |
Source |
Severity |
Supplied Data |
|
IP Cable missing between cab:%d drw:%d port:%s and cab:%d drw:%d port:%s |
MBM |
Warning |
Cabinet, drawer, port |
|
Logging PAL EV7 Logout |
EV7 |
Failure |
|
|
Test %02X [%s] failed on cpu [NS: %d EW: %d] |
EV7 |
Failure |
Test number, test name, CPU ns, cpu ew |
|
Unable to disable Zbox |
EV7 |
Failure |
|
|
Disabled CPU/IO |
EV7, IORn |
Failure |
|
|
Disabled Zbox1 |
EV7 |
Failure |
|
|
Disabled RAID (remap) |
EV7 |
Failure |
|
|
Disabled Memory |
EV7 |
Failure |
|
|
Disabled: IP Cable cab:%02X drw:%X CPU:%x %s wrap:%d; (%x,%x) to (%x,%x) |
EV7 |
Failure |
cab, drawer, CPU, string, wrap, ns1, ew1, ns2, ew2 |
|
Other end of IP Cable not found - cab:%02X drw:%X CPU:%x %s wrap:%d; (%x,%x) |
EV7 |
Failure |
cab, drawer, cpu, string, wrap, ns1, ew1 |
|
IO Configured without CPU Memory |
EV7 |
Warning |
|
|
Adjusting maximum EV7 CPU count to match assigned PIDs. HP:%d, max PIDs: %d |
Operational |
Warning |
HP number, new max CPUs |
|
Partition is unroutable. Fallback Rectangle (%d,%d) (%d,%d) num_RboxReqs: %d |
Operational |
Failure |
ns1, ew1, ns2, ew2, numRboxRegqs |
|
Halt on error. HP:%d |
Operational |
Failure |
HP number |
|
Can't power on: OCP Switch is off. |
Operational |
Failure |
|
|
Can't power on: Drawer will exceed 4 EV7s for ES47 |
Operational |
Failure |
|
|
Preparing to power on partition. HP: %s |
Operational |
OK |
HP number |
|
No eligible CPUs have memory required to be a primary. |
Operational |
Failure |
|
|
Preparing to power off partition. HP: %s |
Operational |
OK |
HP number |
|
Resetting partition. HP: %s |
Operational |
OK |
HP number |
|
FPGA Load fault |
PBM |
Failure |
|
|
Time update distribution failed for hp: %d sp:%d |
Operational |
Failure |
HP number, SP number |
|
Partition powered on. HP: %s |
Operational |
OK |
HP number |
|
Partition powered off. HP: %s |
Operational |
OK |
HP number |
|
Partition reset. HP: %s |
Operational |
OK |
HP number |
|
Partition configuration changed. |
Operational |
OK |
|
|
CPU Speeds are mixed. |
Operational |
Failure |
HP number |
|
Memory range check is disabled |
Operational |
OK |
HP number |