HP Smart Array 6400 Series Controller Support Guide

Add to My manuals
82 Pages

advertisement

HP Smart Array 6400 Series Controller Support Guide | Manualzz

B Physical Disk Installation and Replacement

This appendix discusses the procedure for replacing physical disks in an array. This appendix addresses the following topics:

“Overview” (page 67)

“Physical Disk Failure” (page 67)

“Compromised Fault Tolerance” (page 70)

“Automatic Data Recovery” (page 70)

“Physical Disk Replacement Overview” (page 71)

“Physical Disk Failure During Rebuild” (page 71)

Overview

Each SCSI channel on a Smart Array Controller can support up to 14 physical disks. Disks can be Ultra320 or Ultra160.

Each physical disk on a SCSI bus must have a unique ID value from 0 to 15 (except ID 7, which is reserved for controller use). This value is set automatically on hot-pluggable disk drives in the storage systems that are supported by the Smart Array 6400 Series Controller.

When replacing disk drives, consider the following:

• Do not terminate the disk drives. HP servers and internal cabling provide the required termination of the SCSI bus.

• Do not use disk drives with different capacities in the same array. The excess capacity of larger disk drives cannot be used by the array and is wasted.

• Do not use hot-pluggable and non-hot-pluggable disk drives on the same SCSI bus.

Physical Disk Failure

When a physical disk fails, the logical drive it belongs to is affected. Each logical drive connected to a Smart Array Controller can be configured with a different RAID level. Logical drives can be affected differently by a physical disk failure, depending on their configured RAID level.

The effects of physical disk failure for each RAID level are:

RAID 0

RAID 1

Cannot tolerate disk drive failure. If any physical disk in the array fails, the logical drive also fails.

Tolerates one physical disk failure.

RAID 1+0 Tolerates multiple physical disk failures if no failed disks are mirrored to one another.

RAID 5 Tolerates one physical disk failure.

RAID ADG Tolerates simultaneous failure of two physical disks.

If more physical disks fail than the RAID level supports, fault tolerance is compromised and the logical drive fails. All requests from the operating system are rejected with unrecoverable errors.

For to recover from this situation, see

“Compromised Fault Tolerance” (page 70) .

Recognizing Disk Failure

The LEDs on the front of each physical disk are visible through the front of the StorageWorks disk and MSA 30 enclosures. When a physical disk is configured as part of an array and attached to a powered-on controller, you can determine the status of the disk from the illumination pattern of the LEDs.

Figure B-1

illustrates a typical set of status LEDs. Table B-1 describes the meanings of the LED combinations.

Overview 67

Figure B-1 Physical Disk Status LED Indicators

Table B-1 Physical Disk Status from LED Illumination Pattern

Activity (1)

On, Off, or

Flashing

On, Off, or

Flashing

Online (2)

On or Off

On

Fault (3)

Flashing

Off

Interpretation

A predictive failure alert has been received for this disk. Replace the disk as soon as possible.

The disk is online and configured as part of an array.

If the array is configured for fault tolerance, all other disks in the array are online, and a predictive failure alert is received or a disk capacity upgrade is in progress, the disk can be replaced online.

On, or Flashing Flashing Off

On

Flashing

Off

Flashing

Off

Flashing

The disk is rebuilding or undergoing capacity expansion.

Do not remove the disk. Removing a disk may terminate the current operation and cause data loss.

The disk is being accessed, but one of the following conditions applies:

• It is not configured as part of an array.

• It is a replacement disk and rebuild has not yet started.

• It is spinning up during the POST sequence.

Do not remove the disk.

One of the following conditions applies:

• the drive is part of an array being selected by saconfig.

• sautil is upgrading the drive firmware.

Do not remove the disk. Removing the disk can cause data loss in non-fault-tolerant configurations.

68 Physical Disk Installation and Replacement

Table B-1 Physical Disk Status from LED Illumination Pattern (continued)

Activity (1)

Off

Online (2)

Off

Fault (3)

On

Interpretation

The disk has failed and has been placed offline.

You can replace the disk.

Off Off Off One of the following conditions applies:

• It is not configured as part of an array.

• It is part of an array, but it is a replacement drive that is not being accessed or being rebuilt yet.

• It is configured as an online spare.

If the drive is connected to an array controller, you can replace the drive online.

Other ways to recognize that a physical disk has failed are as follows:

• The amber LED lights up on the front of supported StorageWorks disk enclosures if failed drives are inside.

NOTE: Other problems such as fan failure, redundant power supply failure, or over-temperature conditions, will also cause this LED to light up.

• Event Monitoring Services (EMS) sends an alert message when physical or logical drive failure occurs. For more information, see

“Event Monitoring Service” (page 55) , for details.

Use the sautil <device_file> command to confirm physical disk failures.

The LOGICAL DRIVE SUMMARY section ofthe sautil <device_file> command output lists the status of all logical drives known to the RAID firmware.

The SCSI DEVICE SUMMARY section of the sautil <device_file> command output lists all configured disks and all unassigned disks known to the RAID firmware.

The LOGICAL DRIVE sections of the sautil <device_file> command output provide additional information on each logical drive.

For example, in the following sautil <device_file> command output excerpt, spare disk

1:2 is being substituted for failed disk 1:1, which is why the logical drive is in the RECOVERING state.

---- LOGICAL DRIVE SUMMARY ---------------------------------------------------

# RAID Size Status

0 1+0 17361 MB RECOVERING

---- SCSI DEVICE SUMMARY -----------------------------------------------------

Location Ch ID Type Capacity Status

external 1 0 DISK 18.2 GB OK

N/A 1 1 N/A N/A FAILED

external 1 2 DISK 18.2 GB SPARE (activated)

external 1 3 DISK 18.2 GB UNASSIGNED

external 1 7 PROCESSOR

---- LOGICAL DRIVE 0 ---------------------------------------------------------

Logical Drive Device File........... c4t0d0

Fault Tolerance Mode................ RAID 1+0 (Disk Mirroring)

Logical Drive Size.................. 17361 MB

Logical Drive Status................ RECOVERING

# of Participating Physical Disks... 2

Participating Physical Disk(s)...... Ch:ID

1: 0

1: 1 <-- NOT RESPONDING

Participating Spare Disk(s)......... Ch:ID

1: 2 <-- activated for 1:1

Stripe Size......................... 64 KB

Logical Drive Cache Status.......... cache enabled

Configuration Signature............. 0xA9848C3B

Media Exchange Detected?............ no

For more information about the sautil command, see

“The sautil Command” (page 56)

.

Physical Disk Failure 69

advertisement

Related manuals

advertisement

Table of contents