B    Firmware Alerts

The tables in this appendix list all of the alerts generated by the firmware, the source of each alert, the severity level, and the data that is contained in the alert packet. Section 3.3.3.2 describes firmware alerts.

Table B-1:  Firmware Alerts — Environmental Group

Event Description Source Severity Supplied Data
Voltage MBM, PBM, CMMn OK, Warning, Failure, Non-Present, Unknown Locator, Voltage reading
Temperature MBM, PBM, CMMn OK, Warning, Failure, Non-Present, Unknown Locator, Temperature reading
Fan MBM, PBM OK, Warning, Failure, Non-Present, Unknown Locator, Fan RPM value
Intrusion MBM, PBM OK(close), Warning(open) Locator
PS MBM, PBM OK, Failure, Non-Present, Unknown Locator, Specific Error: 0=PS type; 1=AC; 2=POK; 3=PSFail; 4=PFRL; 5=overtemp; 6=AC RMS or AC1; 7=Fan
WPI/SDI MBM OK, Failure, Unknown Locator, Specific Error: 0=PS Type; 1=VAUX or 9V_A; 2=Vcc or 9V_B
IOR PBM OK, Failure, Non-Present, Unknown Locator, Specific Error: 1=Converter failure; 2=BP short; 3=1.8V; 4=2.5V; 5=3.3V; 6=IO7 1.5V; 7=BP 1.5V
EEPROM MBM, PBM, CMMn OK, Warning, Failure, Non-Present, Unknown Locator, Temperature reading
EEPROM MBM, PBM, CMMn OK, Warning, Failure, Non-Present, Unknown Locator
VRM CMMn OK, Warning, Failure, Non-Present, Unknown Locator
Power off drawer due to temp failure MBM, PBM Failure  
Power off drawer due to insufficient running fans MBM, PBM Failure  
Power off drawer due to unknown failure MBM, PBM Failure  
Component has been added CMMn, PS, IORn OK  
Component has been removed CMMn, PS, IORn OK  
Insufficient running PS PS Warning  

Table B-2:  Firmware Alerts — Operational Group

Event Description Source Severity Supplied Data
SYS_SERIAL_NUM is not set Operational Warning  
Running with mixed firmware revisions Operational Warning  
%s test failure MBM, PBM Failure POST test that failed
Last reset due to watchdog timeout MBM, PBM, CMMn Warning  
Server management group is transitioning Operational Warning  
Server management group is stable. Operational OK  
Power switch state changed Operational OK New state
Error log entry MBM, PBM Warning  

Table B-3:  Firmware Alerts — Partition Group

Event Description Source Severity Supplied Data
IP Cable missing between cab:%d drw:%d port:%s and cab:%d drw:%d port:%s MBM Warning Cabinet, drawer, port
Logging PAL EV7 Logout EV7 Failure  
Test %02X [%s] failed on cpu [NS: %d EW: %d] EV7 Failure test number, test name, cpu ns, cpu ew
Unable to disable Zbox EV7 Failure  
Disabled CPU/IO EV7, IORn Failure  
Disabled Zbox1 EV7 Failure  
Disabled RAID (remap) EV7 Failure  
Disabled Memory EV7 Failure  
Disabled: IP Cable cab:%02X drw:%X CPU:%x %s wrap:%d; (%x,%x) to (%x,%x) EV7 Failure cab, drawer, cpu, string, wrap, ns1, ew1, ns2, ew2
Other end of IP Cable not found - cab:%02X drw:%X CPU:%x %s wrap:%d; (%x,%x) EV7 Failure cab, drawer, cpu, string, wrap, ns1, ew1
IO Configured without CPU Memory EV7 Warning  
Adjusting maximum EV7 CPU count to match assigned PIDs. HP:%d, max PIDs: %d Operational Warning HP number, new max cpus
Partition is unroutable. Fallback Rectangle (%d,%d) (%d,%d) num_RboxReqs: %d Operational Failure ns1, ew1, ns2, ew2, numRboxRegqs
Halt on error. HP:%d Operational Failure HP number
Can't power on: OCP Switch is off. Operational Failure  
Can't power on: Drawer will exceed 4 EV7s for ES47 Operational Failure  
Preparing to power on partition. HP: %s Operational OK HP number
No eligible CPUs have memory required to be a primary. Operational Failure  
Preparing to power off partition. HP: %s Operational OK HP number
Resetting partition. HP: %s Operational OK HP number
FPGA Load fault PBM Failure HP number
Time update distribution failed for hp: %d sp:%d Operational Failure hp number, sp number
Partition powered on. HP: %s Operational OK HP number
Partition powered off. HP: %s Operational OK HP number
Partition reset. HP: %s Operational OK HP number
Partition configuration changed. Operational OK  
CPU Speeds are mixed. Operational Failure HP number
Memory range check is disabled Operational OK HP number

Table B-4:  Firmware Alerts — EV7 Group

Event Description Source Severity Supplied Data
CPU Clock Power Fault EV7 Failure  
%s %s has faulted (VRM failure) CMMn, EV7? Failure cpu_id, vrm name
RIMM SPD Checksum failed for RIMM #%d CMMn Warning failed RIMM number
Error writing the PLL clock ratio registers. CMMn Failure  
Too many %s VRMs (%d) have failed CMMn Failure vrm type, number failed
Can't reset EV7 with power off EV7 Failure  
CPU has timed out during SROM load.\n EV7 Failure cpu number
CPU failed SROM/XSROM load EV7 Failure  
CPU has timed out during tepid reset, continuing EV7 Failure  
Can't halt EV7 with power off EV7 Failure  
SROM port is stuck busy EV7 Failure cpu number
Scan dump on CPU timed out waiting for busy EV7 Failure  
Scan dump on CPU timed out EV7 Failure  
Can't read CPU EEPROM EV7 Failure  
Can't write CPU EEPROM EV7 Failure  
srom_check_status: CPU timed out waiting for SROM load status EV7 Failure cpu number
srom_check_status: CPU cannot accept the load image command. EV7 Failure  
srom_check_status: CPU timed out waiting for SROM load image status EV7 Failure  
srom_check_status: CPU timed out on XSROM version command. EV7 Failure  
Error in load image to EV7 EV7 Failure  
OCLA %d was found running. Clearing RUN EV7 Failure ocla
OCLA %d was found disabled. Setting Enable EV7 Failure ocla