Using LCD Panel Interface. Dell Chassis Management Controller Version 5.20 for PowerEdge M1000E

Add to My manuals
245 Pages

advertisement

Using LCD Panel Interface. Dell Chassis Management Controller Version 5.20 for PowerEdge M1000E | Manualzz

Using LCD Panel Interface

You can use the LCD panel on the chassis to perform configuration and diagnostics, and to obtain status information about the chassis and its contents.

The following figure illustrates the LCD panel. The LCD screen displays menus, icons, pictures, and messages.

17

Figure 11. LCD Display

1

3

LCD screen scroll buttons (4)

2

4 selection ("check") button status indicator LED

Related links

LCD Navigation

Diagnostics

LCD Hardware Troubleshooting

Front Panel LCD Messages

LCD Error Messages

LCD Module and Server Status Information

LCD Navigation

The right side of the LCD panel contains five buttons: four arrow buttons (up, down, left, and right) and a center button.

To move between screens, use the right (next) and left (previous) arrow buttons. At any time while using the panel, you can return to a previous screen.

To scroll through options on a screen, use the down and up arrow buttons.

To select and save an item on a screen and move to the next screen, use the center button.

222

The up, down, left, and right arrow buttons change the selected menu items or icons on the screen. The selected item is shown with a light blue background or border.

When messages displayed on the LCD screen are longer than what fits on the screen, use the left and right arrow buttons to scroll the text left and right.

The icons described in the following table are used to navigate between LCD screens.

Table 45. LCD Panel Navigational Icons

Icon Normal Icon Highlighted Icon Name and Description

Back — Highlight and press the center button to return to the previous screen.

Accept/Yes — Highlight and press the center button to accept a change and return to the previous screen.

Skip/Next — Highlight and press the center button to skip any changes and go to the next screen.

No — Highlight and press the center button to answer "No" to a question and go to the next screen.

Rotate — Highlight and press the center button to switch between the front and rear graphical views of the chassis.

NOTE: The amber background indicates that the opposite view has errors.

Component Identify — Blinks the blue

LED on a component.

NOTE: There is a blinking blue rectangle around this icon when

Component Identify is enabled.

A status indicator LED on the LCD panel provides an indication of the overall health of the chassis and its components.

• Solid blue indicates good health.

• Blinking amber indicates that at least one component has a fault condition.

• Blinking blue is an ID signal, used to identify one chassis in a group of chassis.

Related links

Main Menu

LCD Setup Menu

Language Setup Screen

Default Screen

Graphical Server Status Screen

Graphical Module Status Screen

Enclosure Menu Screen

Module Status Screen

Enclosure Status Screen

IP Summary Screen

223

Main Menu

From the Main menu, you can navigate to one of the following screens:

• LCD Setup Menu — select the language to use and the LCD screen that displays when no one is using the LCD.

• Server — displays status information for servers.

• Enclosure — displays status information for the chassis.

Use the up and down arrow buttons to highlight an item.

Press the center button to activate your selection.

LCD Setup Menu

The LCD Setup menu displays a menu of items that can be configured:

• Language Setup — choose the language you want to use for LCD screen text and messages.

• Default Screen — choose the screen that displays when there is no activity on the LCD panel.

Use the up and down arrow buttons to highlight an item in the menu or highlight the Back icon if you want to return to the Main menu.

Press the center button to activate your selection.

Language Setup Screen

The Language Setup screen allows you to select the language used for LCD panel messages. The currently active language is highlighted with a light blue background.

1.

Use the up, down, left, and right arrow buttons to highlight the desired language.

2.

Press the center button.

The Accept icon appears and is highlighted.

3.

Press the center button to confirm the change.

The LCD Setup menu is displayed.

Default Screen

The Default Screen allows you to change the screen that the LCD panel displays when there is no activity at the panel. The factory default screen is the Main Menu. You can choose from the following screens to display:

• Main Menu

• Server Status (front graphical view of the chassis)

• Module Status (rear graphical view of the chassis)

• Custom (Dell logo with chassis name)

The currently active default screen is highlighted in light blue.

1.

Use the up and down arrow buttons to highlight the screen you want to set to the default.

2.

Press the center button.

The Accept icon is highlighted.

3.

Press the center button again to confirm the change.

The Default Screen is displayed.

224

Graphical Server Status Screen

The Graphical Server Status screen displays icons for each server installed in the chassis and indicates the general health status for each server. The server health is indicated by the color of the server icon:

• Gray — server is off with no errors

• Green — server is on with no errors

• Yellow — server has one or more non-critical errors

• Red — server has one or more critical errors

• Black — server is not present

A blinking light blue rectangle around a server icon indicates that the server is highlighted.

To view the Graphical Module Status screen, highlight the rotate icon, and press the center button.

To view the status screen for a server, use the arrow buttons to highlight the desired server, and press the center button. The

Server Status screen displays.

To return to the Main Menu, use the arrow buttons to highlight the Back icon, and press the center button.

Graphical Module Status Screen

The Graphical Module Status screen displays all modules installed in the rear of the chassis and provides summary health information for each module. Module health is indicated by the color of each module icon as follows:

• Gray — module is off or on standby with no errors

• Green — module is on with no errors

• Yellow — module has one or more non-critical errors

• Red — server has one or more critical errors

• Black — module is not present

A blinking light blue rectangle around a module icon indicates that the module is highlighted.

To view the Graphical Server Status screen, highlight the rotate icon, and press the center button.

To view the status screen for a module, use the up, down, left, and right arrow buttons to highlight the desired module, and press the center button. The Module Status screen displays.

To return to the Main Menu, use the arrow buttons to highlight the Back icon, press the center button. The Main Menu displays.

Enclosure Menu Screen

From this screen, you can navigate to the following screens:

• Module Status screen

• Enclosure Status screen

• IP Summary screen

• Main Menu

Use the navigation buttons to highlight the desired item (highlight the Back icon to return to the Main Menu) and press the center button. The selected screen displays.

Module Status Screen

The Module Status screen displays information and error messages about a module. For messages that can appear on this screen, see

LCD Module and Server Status Information

and

LCD Error Messages

.

Use the up and down arrow keys to move through messages. Use the left and right arrow keys to scroll messages that do not fit on the screen.

Highlight the Back icon and press the center button to return to the Graphical Module Status screen.

225

Enclosure Status Screen

The Enclosure Status screen displays information and error messages about the enclosure. For messages that can appear on this screen, see

LCD Error Messages . Use the up and down arrow keys to move through messages.

Use the left and right arrow keys to scroll messages that do not fit on the screen.

Highlight the Back icon and press the center button to return to the Enclosure Status screen.

IP Summary Screen

The IP Summary screen shows IP information for CMC and iDRAC of each installed server.

Use the up and down arrow buttons to scroll through the list. Use the left and right arrow buttons to scroll selected messages that are longer than the screen.

Use the up and down arrow buttons to select the Back icon and press the center button to return to the Enclosure menu.

Diagnostics

The LCD panel helps you to diagnose problems with any server or module in the chassis. If there is a problem or fault with the chassis or any server or other module in the chassis, the LCD panel status indicator blinks amber. On the Main Menu an icon with an amber background displays next to the menu item—Server or Enclosure—that leads to the faulty server or module.

By following the amber icons through the LCD menu system, you can display the status screen and error messages for the item that has the problem.

Error messages on the LCD panel can be removed by removing the module or server that is the cause of the problem or by clearing the hardware log for the module or server. For server errors, use the iDRAC Web interface or command line interface to clear the server’s System Event Log (SEL). For chassis errors, use the CMC Web interface or command line interface to clear the hardware log.

LCD Hardware Troubleshooting

If you are experiencing issues with the LCD in relation to your use of CMC, use the following hardware troubleshooting items to determine if there is an LCD hardware or connection issue.

226

Figure 12. Removing and Installing LCD Module

1

3

5 cable cover ribbon cable screws (2)

2

4

LCD module hinges (2)

Table 46. LCD Hardware Troubleshooting Items

Symptom

Alert screen message CMC Not

Responding and LED is blinking amber.

Alert screen message CMC Not

Responding and LED is solid amber or is off.

Screen text is scrambled.

LED and LCD is off.

Issue

Loss of communication from CMC to the LCD front panel.

Recovery Action

Check that CMC is booting; then, reset CMC using GUI or RACADM commands.

LCD module communications is stuck during a CMC fail-over or reboots.

Review the hardware log using the GUI or

RACADM commands. Look for a message that states: Can not communicate with LCD controller.

Reseat the LCD module ribbon cable.

Replace the LCD module.

Defective LCD screen.

The LCD cable is not connected properly or is faulty; or the LCD module is faulty.

Review the hardware log using the GUI or

RACADM commands. Look for a message that states:

• The LCD module cable is not connected, or is improperly connected.

The control panel cable is not connected, or is improperly connected.

Reseat cables.

227

LCD screen message No CMC

Found.

No CMC is present in the chassis.

Insert a CMC into the chassis or reseat existing

CMC if present.

Front Panel LCD Messages

This section contains two subsections that list error and status information that is displayed on the front panel LCD.

Error messages on the LCD have a format that is similar to the System Event Log (SEL) viewed from the CLI or Web interface.

The tables in the error section list the error and warning messages that are displayed on the various LCD screens and the possible cause of the message. Text enclosed in angled brackets (< >) indicates that the text may vary.

Status information on the LCD includes descriptive information about the modules in the chassis. The tables in this section describe the information that is displayed for each component.

LCD Error Messages

Table 47. CMC Status Screens

Severity

Critical

Critical

Message

The CMC <number> battery failed.

CMC <number> LAN heartbeat was lost.

Warning

Warning

Warning

A firmware or software incompatibility detected between iDRAC in slot <number> and CMC.

A firmware or software incompatibility detected between system BIOS in slot <number> and CMC.

A firmware or software incompatibility detected between CMC 1 and CMC 2.

Cause

CMC CMOS battery is missing or has no voltage.

The CMC NIC connection has been removed or is not connected.

Firmware between the two devices does not match in order to support one or more features.

Firmware between the two devices does not match in order to support one or more features.

Firmware between the two devices does not match in order to support one or more features.

Table 48. Enclosure/Chassis Status Screen

Severity

Critical

Message

Fan <number> is removed.

Warning

Critical

Critical

Warning

Critical

Critical

Power supply redundancy is degraded.

Power supply redundancy is lost.

The power supplies are not redundant. Insufficient resources to maintain normal operations.

The control panel ambient temperature is greater than the upper warning threshold.

The control panel ambient temperature is greater than the upper warning threshold.

CMC redundancy is lost.

Cause

This fan is required for proper cooling of the enclosure/ chassis.

One or more PSU have failed or removed and the system can no longer support full PSU redundancy.

One or more PSU have failed or removed and the system is no longer redundant.

One or more PSU have failed or removed and the system lacks sufficient power to maintain normal operations. This could cause servers to power down.

Chassis/Enclosure intake temperature exceeded the warning threshold.

Chassis/Enclosure intake temperature exceeded the warning threshold.

CMC no longer redundant. This happens if the standby

CMC is removed.

228

Severity

Critical

Warning

Warning

Message

All event logging is disabled.

Log is full.

Log is almost full.

Cause

The Chassis/Enclosure cannot store events to the logs.

This usually indicates a problem with the control panel or control panel cable.

Chassis has detected that only one more entry can be added to the CEL (hardware log) before it is full.

Chassis event log is 75% full.

Table 49. Fan Status Screens

Severity

Critical

Critical

Message

Fan <number> RPM is operating less than the lower critical threshold.

Fan <number> RPM is operating greater than the upper critical threshold.

Cause

The speed of the specified fan is not sufficient to provide enough cooling to the system.

The speed of the specified fan is too high, usually due to a broken fan blade.

Table 50. IOM Status Screens

Severity

Warning

Warning

Critical

Message Cause

A fabric mismatch detected on I/O module <number>. The IO module fabric does not match that of the server or the redundant I/O module.

A link tuning failure detected on I/O module

<number>.

A failure is detected on I/O module <number>.

The IO module could not be set to correctly use the NIC on one or more servers.

The I/O module has a fault. The same error can also happen if the I/O module is thermal-tripped.

Table 51. iKVM Status Screen

Severity

Warning

Critical

Critical

Non- Recoverable

Message Cause

Console is not available for Local KVM.

Local KVM can not detect any hosts.

Minor failure, such as corrupted firmware.

USB host enumeration failure.

OSCAR, on screen display is not functional for the Local KVM.

OSCAR failure.

Local KVM is not functional, and is powered off.

Serial RIP failure or USB host chip failure.

Table 52. PSU Status Screens

Severity

Critical

Critical

Warning

Message

Power supply <number> failed.

Cause

The PSU has failed.

The power input for power supply <number> is lost.

Loss of AC power or AC cord unplugged.

Power supply <number> is operating at 110 volts, and could cause a circuit breaker fault.

Power supply is plug into a 110 volt source.

Table 53. Server Status Screen

Severity

Warning

Critical

Message

The system board ambient temperature is less than the lower warning threshold.

Cause

Server temperature is getting cool.

The system board ambient temperature is less than the lower critical threshold.

Server temperature is getting cold.

229

Critical

Critical

Critical

Critical

Critical

Critical

Critical

Critical

Critical

Critical

Critical

Critica

Critical

Critical

Severity

Warning

Critical

Critical

Critical

Warning

Critical

Critical

Critical

Critical

Message

The system board ambient temperature is greater than the upper warning threshold.

The system board ambient temperature is greater than the upper critical threshold.

Cause

Server temperature is getting warm.

Server temperature is getting too hot.

The system board Current Latch current is outside of the allowable range

Current crossed a failing threshold.

The system board battery failed.

The storage battery is low.

CMOS battery is not present or has no voltage.

ROMB battery is low.

CMOS battery is not present or has no voltage.

The storage battery failed.

The CPU <number> <voltage sensor name > voltage is outside of the allowable range.

The system board<voltage sensor name > voltage is outside of the allowable range.

The mezzanine card <number> <voltage sensor name > voltage is outside of the allowable range.

The storage <voltage sensor name > voltage is outside of the allowable range.

CPU <number> has an internal error (IERR).

CPU <number> has a thermal trip (overtemperature) event.

CPU <number> configuration is unsupported.

CPU <number> is absent.

Mezz B<slot number> Status: Add-in Card sensor for Mezz B<slot number>, install error was asserted.

Mezz C<slot number> Status: Add-in Card sensor for Mezz C<slot number>, install error was asserted.

CPU failure.

CPU overheated.

Incorrect processor type or in wrong location.

Required CPU is missing or not present.

Incorrect Mezzanine card installed for IO fabric.

Incorrect Mezzanine card installed for IO fabric.

Drive <number> is removed.

Fault detected on Drive <number>.

Storage Drive was removed.

Storage Drive failed.

The system board fail-safe voltage is outside of the allowable range.

This event is generated when the system board voltages are not at normal levels.

The watchdog timer expired.

The watchdog timer reset the system.

The watchdog timer powered off the system.

The watchdog timer power cycled the system.

The iDRAC watchdog timer expires and no action is set.

The iDRAC watchdog detected that the system has crashed (timer expired because no response was received from Host) and the action is set to reboot.

The iDRAC watchdog detected that the system has crashed (timer expired because no response was received from Host) and the action is set to power off.

The iDRAC watchdog detected that the system has crashed (timer expired because no response was

230

Severity

Critical

Warning

Warning

Critical

Critical

Critical

Critical

Message

Log is full.

Cause received from Host) and the action is set to power cycle.

The SEL device detects that only one entry can be added to the SEL before it is full.

Persistent correctable memory errors detected on a memory device at location <location>.

Persistent correctable memory error rate has increased for a memory device at location

<location>.

Multi-bit memory errors detected on a memory device at location <location>.

An I/O channel check NMI was detected on a component at bus <number> device <number> function <number>.

An I/O channel check NMI wa detected on a component at slot <number>.

Correctable ECC errors reach a critical rate.

An uncorrectable ECC error was detected.

A critical interrupt is generated in the I/O Channel.

A critical interrupt is generated in the I/O Channel.

A PCI parity error was detected on a component at bus <number> device <number> function

<number>.

Parity error was detected on the PCI bus.

Critical

Critical

Critical

A PCI parity error was detected on a component at slot <number>.

Parity error was detected on the PCI bus.

PCI error detected by device.

A PCI system error was detected on a component at bus <number> device <number> function

<number>.

A PCI system error was detected on a component at slot <number>.

PCI error detected by device.

Single bit error logging is disable when too many SBE get logged for a memory device.

Critical Persistent correctable memory error logging disabled for a memory device at location

<location>.

Critical All event logging is disabled.

Non- Recoverable CPU protocol error detected.

Non- Recoverable CPU bus parity error detected.

Non- Recoverable CPU initialization error detected.

Non- Recoverable CPU machine check detected.

The processor protocol entered a non-recoverable state.

The processor bus PERR entered a non-recoverable state.

The processor initialization entered a non-recoverable state.

The processor machine check entered a nonrecoverable state.

Critical

Critical

Critical

Memory redundancy is lost.

A bus fatal error was detected on a component at bus <number> device <number> function

<number>.

A software NMI was detected on a component at bus <number> device <number> function

<number>.

Fatal error is detected on the PCIe bus.

Chip error is detected.

231

Severity

Critical

Critical

Critical

Message

Failed to program virtual MAC address on a component at bus <number> device <number> function <number>.

Device option ROM on mezzanine card <number> failed to support Link Tuning or FlexAddress.

Failed to get Link Tuning or FlexAddress data from iDRAC.

Cause

Flex address could be programmed for this device.

Option ROM does not support Flex address or linking tuning.

NOTE: For information on other server related LCD messages, see "Server User Guide".

LCD Module and Server Status Information

The tables in this section describe status items that are displayed on the front panel LCD for each type of component in the chassis.

Table 54. CMC Status

Item

Example: CMC1, CMC2

No Errors

Firmware Version

IP4 <enabled, disabled>

IP4 Address: <address, acquiring>

IP6 <enabled, disabled>

IP6 Local Address: <address>

IP6 Global Address: <address>

MAC: <address>

Description

Name or Location.

If there are no errors then the message “No Errors” is displayed, else error messages are listed. Critical errors are listed first, followed by warnings.

Only displays on an active CMC. Displays Standby for the standby CMC.

Displays current IPv4 enabled state only on an active CMC.

Only displays if IPv4 is enabled only on an active CMC.

Displays current IPv6 enabled state only on an active CMC.

Only displays if IPv6 is enabled only on an active CMC.

Only displays if IPv6 is enabled only on an active CMC.

Displays the CMC's MAC address.

Table 55. Chassis or Enclosure Status

Item

User Define Name

Error Messages

Model Number

Power Consumption

Peak Power

Minimum Power

Ambient Temperature

Service Tag

Description

Example: “Dell Rack System”. You can set this option through the CMC Command Line Interface (CLI) or Web interface.

If there are no errors then the message “No Errors” is displayed, else error messages are listed. Critical errors are listed first, followed by warnings.

Example "PowerEdgeM1000e".

Current power consumption in watts.

Peak power consumed in watts.

Minimum power consumed in watts.

Current ambient temperature in degrees Celsius.

The factory-assigned service tag.

232

CMC redundancy mode

PSU redundancy mode

Table 56. Fan Status

Item

Name/Location

Error Messages

RPM

Table 57. PSU Status

Item

Name/Location

Error Messages

Status

Maximum Wattage

Table 58. IOM Status

Item

Name/Location

Error Messages

Status

Model

Fabric Type

IP address

Service Tag

Table 59. iKVM Status

Item

Name

No Error

Status

Model/Manufacture

Non-Redundant or Redundant.

Non-Redundant, AC Redundant, or DC Redundant.

Description

Example: Fan1, Fan2, and so on.

If no error then "No Errors" is shown; otherwise error messages are listed, critical errors first, then warnings.

Current fan speed in RPM.

Description

Example: PSU1, PSU2, and so on.

If there are no errors then the message “No Errors” is displayed, else error messages are listed. Critical errors are listed first, followed by warnings.

Offline, Online, or Standby.

Maximum Wattage that PSU can supply to the system.

Description

Example: IOM A1, IOM B1. and so on.

If there are no errors then the message “No Errors” is displayed, else error messages are listed. Critical errors are listed first, followed by warnings. For more information see

LCD Error

Messages .

Off or On.

Model of the IOM.

Networking type.

Only shows if IOM is On. This value is zero for a pass through type IOM.

The factory-assigned service tag.

Description iKVM.

If there are no errors then the message “No Errors” is displayed, else error messages are listed. Critical errors are listed first, followed by warnings. For more information see

LCD Error

Messages .

Off or On.

A description of the iKVM model.

233

Service Tag

Part Number

Firmware Version

Hardware Version

NOTE: This information is dynamically updated

The factory-assigned service tag.

The Manufacturer part number.

iKVM firmware version.

iKVM hardware version.

Table 60. Server Status

Item

Example: Server 1, Server 2, etc.

No Errors

Slot Name

Name

Model Number

Service Tag

BIOS Version

Last POST Code iDRAC Firmware Version

IP4 <enabled, disabled>

IP4 Address: <address, acquiring>

IP6 <enabled, disabled>

IP6 Local Address: <address>

IP6 Global Address: <address>

FlexAddress enabled on Fabrics

Description

Name/Location.

If there are no errors then the message “No Errors” is displayed, else error messages are listed. Critical errors are listed first, followed by warnings. For more information see

LCD Error

Messages .

Chassis slot name. For example, SLOT-01.

NOTE: You can set this table through the CMC CLI or

Web interface.

Name of the server, which the user can set through Dell

OpenManage. The name is displayed only if iDRAC has finished booting, and the server supports this feature, else iDRAC booting messages are displayed.

Displays if iDRAC finished booting.

Displays if iDRAC finished booting.

Server BIOS firmware version.

Displays the last server BIOS POST code messages string.

Displays if iDRAC finished booting.

NOTE: iDRAC version 1.01 is displayed as 1.1. There is no iDRAC version 1.10.

Displays the current IPv4 enabled state.

Only displays if IPv4 is enabled.

Only displays if iDRAC supports IPv6. Displays current IPv6enabled state.

Only displays if iDRAC supports IPv6 and IPv6 is enabled.

Only displays if iDRAC supports IPv6 and IPv6 is enabled.

Only displays if the feature is installed. Lists the fabrics enabled for this server (that is, A, B, C).

The information in the table is dynamically updated. If the server does not support this feature, then the following information does not appear, else Server Administrator options are as follows:

• Option “None” = No strings must be displayed on the LCD.

• Option “Default” = No Effect.

• Option “Custom” = Allows you to enter a string name for the server.

234

The information is displayed only if iDRAC has completed booting. For more information on this feature, see the Chassis

Management Controller for Dell PowerEdge M1000e RACADM Command Line Reference Guide at dell.com/support/manuals.

235

advertisement

Key Features

  • Comprehensive hardware monitoring for proactive issue detection
  • Remote access and control for convenient management from anywhere
  • Automated alerts for timely notification of critical events
  • Extensive logging for detailed analysis and troubleshooting
  • Security features for data protection and access control
  • Easy integration with existing management tools for streamlined workflows
  • Support for industry-standard protocols for broad compatibility
  • User-friendly interface for intuitive operation

Related manuals

Frequently Answers and Questions

What are the minimum requirements for using this device?
Please refer to the manual for specific system requirements.
How do I set up remote access to the device?
Instructions for setting up remote access can be found in the manual.
What security measures are in place to protect data?
The device employs robust security measures, including encryption and access control, to safeguard data.
Download PDF

advertisement

Table of contents