advertisement
System Event Log Troubleshooting Guide for Intel
®
Miscellaneous Events
11.2 SMI Timeout
SMI stands for system management interrupt and is an interrupt that gets generated so the processor can service server management events
(typically memory or PCI errors, or other forms of critical interrupts), in order to log them to the SEL. If this interrupt times out, the system is frozen.
Table 78: SMI Timeout Sensor Typical Characteristics
Byte Field
11 Sensor Type
12 Sensor Number
13 Event Direction and
Event Type
14 Event Data 1
15 Event Data 2
16 Event Data 3
11.2.1 SMI Timeout – Next Steps
This event normally only occurs after another more critical event.
Description
F3h = SMI Timeout
06h
[7] Event direction
0b = Assertion Event
1b = Deassertion Event
[6:0] Event Type = 03h (“digital” Discrete)
[7:6] – 00b = Unspecified Event Data 2
[5:4] – 00b = Unspecified Event Data 3
[3:0] – Event Trigger Offset = 1 = State Asserted
Not used
Not used
1. Check the SEL for any critical interrupts, memory errors, bus errors, PCI errors, or any other serious errors.
2. If these are not present, the system locked up before it was able to log the original issue. In this case, low level debug is normally required.
Revision 1.1 Intel order number G74211-002 77
advertisement
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Related manuals
advertisement
Table of contents
- 11 Introduction
- 11 Purpose
- 11 Industry Standard
- 11 Intelligent Platform Management Interface (IPMI)
- 12 Baseboard Management Controller (BMC)
- 13 Intelligent Power Node Manager Version
- 14 Basic Decoding of a SEL Record
- 14 Default Values in the SEL Records
- 18 Sensor Cross Reference List
- 18 BMC owned Sensors (GID = 0020h)
- 22 BIOS POST owned Sensors (GID = 0001h)
- 22 BIOS SMI owned Sensors (GID = 0033h)
- 24 Hot Swap Controller Firmware owned Sensors (GID = 00C0h/00C2h)
- 25 Node Manager / ME Firmware owned Sensors (GID = 002Ch or 602Ch)
- 26 Microsoft* OS owned Events (GID = 0041)
- 26 Linux* Kernel Panic Events (GID = 0021)
- 27 Power Subsystems
- 27 Voltage Sensors
- 31 Power Unit
- 31 Power Unit Status Sensor
- 32 Power Unit Redundancy Sensor
- 34 Power Supply
- 34 Power Supply Status Sensors
- 35 Power Supply AC Power Input Sensors
- 36 Power Supply Current Output % Sensors
- 37 Power Supply Temperature Sensors
- 39 Cooling Subsystem
- 39 Fan Sensors
- 39 Fan Speed Sensors
- 40 Fan Presence and Redundancy Sensors
- 43 Temperature Sensors
- 43 Regular Temperature Sensors
- 45 Thermal Margin Sensors
- 46 Processor Thermal Control % Sensors
- 47 Discrete Thermal Sensors
- 49 Processor Subsystem
- 49 Processor Status Sensor
- 50 Catastrophic Error Sensor
- 51 Catastrophic Error Sensor – Next Steps
- 51 CPU Missing Sensor
- 52 CPU Missing Sensor – Next Steps
- 52 QuickPath Interconnect Error Sensors
- 52 QPI Correctable Error Sensor
- 53 QPI Non-Fatal Error Sensor
- 54 QPI Fatal and Fatal
- 56 Memory Subsystem
- 56 Memory RAS Mirroring and Sparing
- 56 Mirroring Configuration Status
- 57 Mirrored Redundancy State Sensor
- 59 Sparing Configuration Status
- 60 Sparing Redundancy State Sensor
- 63 ECC and Address Parity
- 63 Memory Correctable and Uncorrectable ECC Error
- 65 Memory Address Parity Error
- 68 PCI Express* and Legacy PCI Subsystem
- 68 PCI Express* Errors
- 68 PCI Express* Correctable Errors
- 69 PCI Express* Fatal Errors
- 71 Legacy PCI Errors
- 73 System BIOS Events
- 73 System Events
- 73 System Boot
- 73 Timestamp Clock Synchronization
- 74 System Firmware Progress (Formerly Post Error)
- 75 System Firmware Progress (Formerly Post Error) – Next Steps
- 81 Chassis Subsystem
- 81 Physical Security
- 81 Chassis Intrusion
- 81 LAN Leash Lost
- 83 FP (NMI) Interrupt
- 83 FP (NMI) Interrupt – Next Steps
- 84 Button Press Events
- 85 Miscellaneous Events
- 85 IPMI Watchdog
- 87 SMI Timeout
- 87 SMI Timeout – Next Steps
- 88 System Event Log Cleared
- 88 System Event – PEF Action
- 89 System Event – PEF Action – Next Steps
- 90 Hot Swap Controller Events
- 90 HSC Backplane Temperature Sensor
- 91 HSC Drive Slot Status Sensor
- 92 HSC Drive Slot Status Sensor – Next Steps
- 92 HSC Drive Presence Sensor
- 93 HSC Drive Presence Sensor – Next Steps
- 95 Manageability Engine (ME) Events
- 95 Node Manager Exception Event
- 96 Node Manager Exception Event – Next Steps
- 96 Node Manager Health Event
- 97 Node Manager Health Event – Next Steps
- 98 Node Manager Operational Capabilities Change
- 99 Node Manager Operational Capabilities Change – Next Steps
- 100 Node Manager Alert Threshold Exceeded
- 101 Node Manager Alert Threshold Exceeded – Next Steps
- 101 ME Firmware Health Event
- 102 ME Firmware Health Event – Next Steps
- 103 Microsoft Windows* Records
- 103 Boot-up Event Records
- 104 Shutdown Event Records
- 107 Bug Check / Blue Screen Event Records
- 109 Linux* Kernel Panic Records