Firmware Alerts Table

Firmware Alerts in the following table are grouped into four general categories for ease of use.  The actual source of that particular alert is displayed in the "Source" column.

CPU EV7

Environmental

Operational

Partition

See the Alert Severity Table for definitions of Alert Severity statuses.

CPU EV7 Category of Firmware Alerts

Event Description

Source

Severity

Supplied Data

CPU Clock Power Fault

EV7

Failure

 

%s %s has faulted (VRM failure)

CMMn, EV7?

Failure

cpu_id, vrm name

RIMM SPD Checksum failed for RIMM #%d

CMMn

Warning

Failed RIMM number

Error writing the PLL clock ratio registers.

CMMn

Failure

 

Error programming the VID register.

CMMn

Failure

 

Too many %s VRMs (%d) have failed.

CMMn

Failure

Vrm type, number failed

Can't reset EV7 with power off.

EV7

Failure

 

CPU has timed out during SROM load.\n

EV7

Failure

CPU number

CPU failed SROM/XSROM load.

EV7

Failure

 

CPU has timed out during tepid reset, continuing.

EV7

Failure

 

Can't halt EV7 with power off.

EV7

Failure

 

SROM port is stuck busy.

EV7

Failure

 

Scan dump on CPU timed out waiting for busy.

EV7

Failure

CPU number

Scan dump on CPU timed out.

EV7

Failure

 

Can't read CPU EEPROM

EV7

Failure

 

Can't write CPU EEPROM

EV7

Failure

 

srom_check_status: CPU timed out waiting for SROM load status.

EV7

Failure

CPU number

srom_check_status: CPU can not accept the load image command.

EV7

Failure

 

srom_check_status: CPU timed out waiting for SROM load image status.

EV7

Failure

 

srom_check_status: CPU timed out on XSROM version command.

EV7

Failure

 

Error in load image to EV7

EV7

Failure

 

OCLA %d was found running. Clearing RUN.

EV7

Failure

ocla

OCLA %d was found disabled. Setting Enable.

EV7

Failure

ocla

 

Environmental Category of Firmware Alerts

Event Description

Source

Severity

Supplied Data

Voltage

MBM, PBM, CMMn

OK,
Warning, Failure,
Non-Present, Unknown

Locator,
Voltage reading

Temperature

MBM, PBM, CMMn

OK,
Warning, Failure,
Non-Present, Unknown

Locator,
Temperature reading

Fan

MBM, PBM

OK,
Warning, Failure,
Non-Present, Unknown

Locator,
Fan RPM value

Intrusion

MBM, PBM

OK (close), Warning (open)

Locator

PS

MBM, PBM

OK,
Failure,
Non-Present, Unknown

Locator,
Specific Error:
0=PS type;
1=AC;
2=POK;
3=PSFail;
4=PFRL;
5=overtemp;
6=AC RMS or AC1;
7=Fan

WPI/SDI

MBM

OK,
Failure, Unknown

Locator,
Specific Error:
0=PS Type;
1=VAUX or 9V_A; 2=Vcc or 9V_B

IOR

PBM

OK,
Failure,
Non-Present, Unknown

Locator,
Specific Error: 1=Converter failure; 2=BP short; 3=1.8V; 4=2.5V;
5=3.3V;
6=IO7 1.5V;
7=BP 1.5V

EEPROM

MBM, PBM, CMMn

OK,
Failure,
Non-Present, Unknown

Locator

VRM

CMMn

OK,
Failure,
Non-Present, Unknown

Locator

Power off drawer due to temp failure

MBM, PBM

Failure

 

Power off drawer due to insufficient running fans

MBM, PBM

Failure

 

Power off drawer due to unknown failure

MBM, PBM

Failure

 

Component has been added

CMMn, PS, IORn

OK

 

Component has been removed

CMMn, PS, IORn

OK

 

Insufficient running PS

PS

Warning

 

 

Operational Category of Firmware Alerts

Event Description

Source

Severity

Supplied Data

SYS_SERIAL_NUM is not set

Operational

Warning

 

Running with mixed firmware revisions.

Operational

Warning

 

%s test failure.

MBM, PBM

Failure

POST test that failed

Last reset due to watchdog timeout

MBM, PBM, CMMn

Warning

 

Server management group is transitioning.

Operational

Warning

 

Server management group is stable.

Operational

OK

 

Power switch state changed

Operational

OK

New state

Error log entry

MBM, PBM

Warning

 

 

Partition Category of Firmware Alerts

Event Description

Source

Severity

Supplied Data

IP Cable missing between cab:%d drw:%d port:%s and cab:%d drw:%d port:%s

MBM

Warning

Cabinet, drawer, port

Logging PAL EV7 Logout

EV7

Failure

 

Test %02X [%s] failed on cpu [NS: %d EW: %d]

EV7

Failure

Test number, test name, CPU ns, cpu ew

Unable to disable Zbox

EV7

Failure

 

Disabled CPU/IO

EV7, IORn

Failure

 

Disabled Zbox1

EV7

Failure

 

Disabled RAID (remap)

EV7

Failure

 

Disabled Memory

EV7

Failure

 

Disabled: IP Cable cab:%02X drw:%X CPU:%x %s wrap:%d; (%x,%x) to (%x,%x)

EV7

Failure

cab, drawer, CPU, string, wrap, ns1, ew1, ns2, ew2

Other end of IP Cable not found - cab:%02X drw:%X CPU:%x %s wrap:%d; (%x,%x)

EV7

Failure

cab, drawer, cpu, string, wrap, ns1, ew1

IO Configured without CPU Memory

EV7

Warning

 

Adjusting maximum EV7 CPU count to match assigned PIDs. HP:%d, max PIDs: %d

Operational

Warning

HP number, new max CPUs

Partition is unroutable.  Fallback Rectangle (%d,%d) (%d,%d)   num_RboxReqs: %d

Operational

Failure

ns1, ew1, ns2, ew2, numRboxRegqs

Halt on error. HP:%d

Operational

Failure

HP number

Can't power on: OCP Switch is off.

Operational

Failure

 

Can't power on: Drawer will exceed 4 EV7s for ES47

Operational

Failure

 

Preparing to power on partition. HP: %s

Operational

OK

HP number

No eligible CPUs have memory required to be a primary.

Operational

Failure

 

Preparing to power off partition. HP: %s

Operational

OK

HP number

Resetting partition. HP: %s

Operational

OK

HP number

FPGA Load fault

PBM

Failure

 

Time update distribution failed for hp: %d sp:%d

Operational

Failure

HP number, SP number

Partition powered on. HP: %s

Operational

OK

HP number

Partition powered off. HP: %s

Operational

OK

HP number

Partition reset. HP: %s

Operational

OK

HP number

Partition configuration changed.

Operational

OK

 

CPU Speeds are mixed.

Operational

Failure

HP number

Memory range check is disabled

Operational

OK

HP number