advertisement
Memory Subsystem
Byte Field
12 Sensor Number
13 Event Direction and
Event Type
14 Event Data 1
15 Event Data 2
16 Event Data 3
Description
13h
[7] Event direction
0b = Assertion Event
1b = Deassertion Event
[6:0] Event Type = 09h (digital Discrete)
[7:6] – 10b = OEM code in Event Data 2
[5:4] – 00b = Unspecified Event Data 3
[3:0] – Event Trigger Offset as described in Table 57
Not used
Not used
Table 57: Sparing Configuration Status Sensor Event Trigger Offset – Next Steps
Hex
Event Trigger Offset
Description
01h The system has configured into
Spare Channel RAS mode.
00h The system has configured out of
Spare Channel RAS mode
Description
Sparing mode is enabled in setup.
Sparing mode is disabled, either from setup or due to error in which case post error
8500 also occurs.
Informational event only.
Next Steps
1. If this event is accompanied by a post error 8500, there was a problem applying the sparing configuration to the memory. Check for other errors related to the memory and troubleshoot accordingly.
2. If there is no post error then sparing mode was simply disabled in BIOS setup and this should be considered informational only.
7.1.4 Sparing Redundancy State Sensor
This sensor provides the RAS Redundancy state for the Spare Channel Mode.
50 Intel order number G74211-002 Revision 1.1
System Event Log Troubleshooting Guide for Intel
®
Byte
8
9
Field
Generator ID
11 Sensor Type
12 Sensor Number
13 Event Direction and
Event Type
14 Event Data 1
15 Event Data 2
Table 58: Sparing Redundancy State Sensor Typical Characteristics
Description
0001h = BIOS POST
0ch = Memory
11h
[7] Event direction
0b = Assertion Event
1b = Deassertion Event
[6:0] Event Type = 0Bh (Generic Discrete)
[7:6] – 10b = OEM code in Event Data 2
[5:4] – 10b = OEM code in Event Data 3
[3:0] – Event Trigger Offset as described in Table 59
[7:4] – If Domain Instance Type (ED3) is set to Local, this field specifies the 0-based Socket ID of the processor that contains the sparing domain local sub-instances.
A value of 1110b indicates that the sparing configuration specified in Bits [3:0] applies globally to all sockets in the system.
If Domain Instance Type (ED3) is set to Global, this field specifies the 0-based Socket
ID of the second participant processor in this sparing domain global instance.
A value of 1111b indicates that this field is unused and does not contain valid data.
[3:0] – If Domain Instance Type (ED3) is set to Local, this field specifies the sparing domain local sub-instances – which channels are included in this sub-instance:
0000b – Reserved
0001b – {Ch A, Ch B, Ch C} (only configuration possible on Intel
®
S5500/S5520
Server Boards)
0010b-1110b – Reserved
If Domain Instance Type (ED3) is set to Global, this field specifies the 0-based Socket
ID of the first participant processor in this sparing domain global instance.
A value of 1111b indicates that this field is unused and does not contain valid data.
Memory Subsystem
Revision 1.1 Intel order number G74211-002 51
Memory Subsystem
Byte Field
16 Event Data 3
Description
[7] – Domain Instance Type
0b: Local memory sparing domain instance. This SEL pertains to a local memory sparing domain that is restricted to memory sparing pairs within a processor socket only.
1b: Global memory sparing domain instance. This SEL pertains to a global memory sparing domain that pertains to memory sparing between processor sockets.
[6:4] – Reserved
[3:0] – 0-based Instance ID of this sparing domain
Table 59: Sparing Redundancy State Sensor Event Trigger Offset – Next Steps
Hex
Event Trigger Offset
Description
01h Memory is configured in Spare
Channel Mode, and the memory is operating in the fully redundant state, with the spare channel inactive and available.
00h Memory is configured in Spare
Channel Mode, and the memory has lost redundancy and is operating in the degraded state, with the spare channel active and used to replace a failed channel.
Description
System boots with spare channel mode active, one entry per processor.
Spare channel replaces failing channel, one SEL entry for processor with failing memory to signify loss of redundancy.
Informational event.
Next Steps
This event should be accompanied by memory errors indicating the source of the issue. Troubleshoot accordingly (probably replace affected DIMM).
52 Intel order number G74211-002 Revision 1.1
advertisement
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Related manuals
advertisement
Table of contents
- 11 Introduction
- 11 Purpose
- 11 Industry Standard
- 11 Intelligent Platform Management Interface (IPMI)
- 12 Baseboard Management Controller (BMC)
- 13 Intelligent Power Node Manager Version
- 14 Basic Decoding of a SEL Record
- 14 Default Values in the SEL Records
- 18 Sensor Cross Reference List
- 18 BMC owned Sensors (GID = 0020h)
- 22 BIOS POST owned Sensors (GID = 0001h)
- 22 BIOS SMI owned Sensors (GID = 0033h)
- 24 Hot Swap Controller Firmware owned Sensors (GID = 00C0h/00C2h)
- 25 Node Manager / ME Firmware owned Sensors (GID = 002Ch or 602Ch)
- 26 Microsoft* OS owned Events (GID = 0041)
- 26 Linux* Kernel Panic Events (GID = 0021)
- 27 Power Subsystems
- 27 Voltage Sensors
- 31 Power Unit
- 31 Power Unit Status Sensor
- 32 Power Unit Redundancy Sensor
- 34 Power Supply
- 34 Power Supply Status Sensors
- 35 Power Supply AC Power Input Sensors
- 36 Power Supply Current Output % Sensors
- 37 Power Supply Temperature Sensors
- 39 Cooling Subsystem
- 39 Fan Sensors
- 39 Fan Speed Sensors
- 40 Fan Presence and Redundancy Sensors
- 43 Temperature Sensors
- 43 Regular Temperature Sensors
- 45 Thermal Margin Sensors
- 46 Processor Thermal Control % Sensors
- 47 Discrete Thermal Sensors
- 49 Processor Subsystem
- 49 Processor Status Sensor
- 50 Catastrophic Error Sensor
- 51 Catastrophic Error Sensor – Next Steps
- 51 CPU Missing Sensor
- 52 CPU Missing Sensor – Next Steps
- 52 QuickPath Interconnect Error Sensors
- 52 QPI Correctable Error Sensor
- 53 QPI Non-Fatal Error Sensor
- 54 QPI Fatal and Fatal
- 56 Memory Subsystem
- 56 Memory RAS Mirroring and Sparing
- 56 Mirroring Configuration Status
- 57 Mirrored Redundancy State Sensor
- 59 Sparing Configuration Status
- 60 Sparing Redundancy State Sensor
- 63 ECC and Address Parity
- 63 Memory Correctable and Uncorrectable ECC Error
- 65 Memory Address Parity Error
- 68 PCI Express* and Legacy PCI Subsystem
- 68 PCI Express* Errors
- 68 PCI Express* Correctable Errors
- 69 PCI Express* Fatal Errors
- 71 Legacy PCI Errors
- 73 System BIOS Events
- 73 System Events
- 73 System Boot
- 73 Timestamp Clock Synchronization
- 74 System Firmware Progress (Formerly Post Error)
- 75 System Firmware Progress (Formerly Post Error) – Next Steps
- 81 Chassis Subsystem
- 81 Physical Security
- 81 Chassis Intrusion
- 81 LAN Leash Lost
- 83 FP (NMI) Interrupt
- 83 FP (NMI) Interrupt – Next Steps
- 84 Button Press Events
- 85 Miscellaneous Events
- 85 IPMI Watchdog
- 87 SMI Timeout
- 87 SMI Timeout – Next Steps
- 88 System Event Log Cleared
- 88 System Event – PEF Action
- 89 System Event – PEF Action – Next Steps
- 90 Hot Swap Controller Events
- 90 HSC Backplane Temperature Sensor
- 91 HSC Drive Slot Status Sensor
- 92 HSC Drive Slot Status Sensor – Next Steps
- 92 HSC Drive Presence Sensor
- 93 HSC Drive Presence Sensor – Next Steps
- 95 Manageability Engine (ME) Events
- 95 Node Manager Exception Event
- 96 Node Manager Exception Event – Next Steps
- 96 Node Manager Health Event
- 97 Node Manager Health Event – Next Steps
- 98 Node Manager Operational Capabilities Change
- 99 Node Manager Operational Capabilities Change – Next Steps
- 100 Node Manager Alert Threshold Exceeded
- 101 Node Manager Alert Threshold Exceeded – Next Steps
- 101 ME Firmware Health Event
- 102 ME Firmware Health Event – Next Steps
- 103 Microsoft Windows* Records
- 103 Boot-up Event Records
- 104 Shutdown Event Records
- 107 Bug Check / Blue Screen Event Records
- 109 Linux* Kernel Panic Records