SGI® Altix® ICE 8000 System User`s Guide

SGI® Altix® ICE 8000 System User`s Guide
SGI® Altix® ICE 8000 System
User’s Guide
Document Number 007-4986-001
COPYRIGHT
© 2007 SGI. All rights reserved; provided portions may be copyright in third parties, as indicated elsewhere herein. No permission is granted to copy, distribute,
or create derivative works from the contents of this electronic documentation in any manner, in whole or in part, without the prior written permission of SGI.
LIMITED RIGHTS LEGEND
The software described in this document is "commercial computer software" provided with restricted rights (except as to included open/free source) as specified
in the FAR 52.227-19 and/or the DFAR 227.7202, or successive sections. Use beyond license provisions is a violation of worldwide intellectual property laws,
treaties and conventions. This document is provided with limited rights as defined in 52.227-14.
The electronic (software) version of this document was developed at private expense; if acquired under an agreement with the USA government or any
contractor thereto, it is acquired as “commercial computer software” subject to the provisions of its applicable license agreement, as specified in (a) 48 CFR
12.212 of the FAR; or, if acquired for Department of Defense units, (b) 48 CFR 227-7202 of the DoD FAR Supplement; or sections succeeding thereto.
Contractor/manufacturer is SGI, 1140 E. Arques Avenue, Sunnyvale, CA 94085.
TRADEMARKS AND ATTRIBUTIONS
Altix, SGI, and the SGI logo are registered trademarks of SGI, in the United States and/or other countries worldwide.
Intel, Itanium and Xeon are trademarks or registered trademarks of Intel Corporation or its subsidiaries in the United States and other countries. UNIX is a
registered trademark in the United States and other countries, licensed exclusively through X/Open Company, Ltd. Infiniband is a trademark of the InfiniBand
Trade Association. Voltaire is a registered trademark of Voltaire Inc.
Linux is a registered trademark of Linus Torvalds.
All other trademarks mentioned herein are the property of their respective owners.
Record of Revision
Version
Description
-001
July, 2007
First Release
007-4986-001
iii
Contents
Contents
1.
List of Figures .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
List of Tables
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
Audience.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
Important Information .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
Chapter Descriptions
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
Related Publications .
Conventions .
. ix
.
.
.
. xi
.
.
.
. xiii
.
.
.
.
. xiii
.
.
.
.
. xiv
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. xv
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.xvii
Product Support .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.xvii
Reader Comments .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
xviii
Operation Procedures .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
1
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
1
ESD Precaution .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
1
Safety Precautions .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
2
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
3
Console Connections
007-4986-001
.
.
Precautions .
2.
.
.
.
Powering the System On and Off
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
4
Preparing to Power On .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
5
Powering On and Off .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
8
Console Management Power (cpower) Commands
.
.
.
.
.
.
.
.
.
.
.
.
8
.
.
. 11
Monitoring Your Server .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
System Management .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 13
Using the 1U Console Option
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 14
Levels of System and Chassis Control .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 15
v
Contents
Chassis Controller Interaction
3.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 15
Chassis Manager Interconnects .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 16
Chassis Management Control (CMC) Functions
.
.
.
.
.
.
.
.
.
.
.
.
.
. 17
Chassis Management Control Front Panel Display (Future Features) .
.
.
.
.
.
.
.
. 17
System Power Status .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 18
System Overview
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 19
System Models
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 20
System Architecture .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 22
Memory Controller HUB
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 22
System Bus Interface
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 22
Memory Control Sub-system .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 22
Memory DIMM Subsystem
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 23
ESB-2 I/O Controller .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 23
System Features and Major Components
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 25
.
Modularity and Scalability .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 25
Administrative Server .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 26
Rack Leader Controller .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 26
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 27
Login Server Function .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 27
Batch Server Node Function
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 27
Multiple Chassis Manager Connections
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 28
Reliability, Availability, and Serviceability (RAS) .
.
.
.
.
.
.
.
.
.
.
.
.
. 30
Service Nodes
.
System Components .
4.
vi
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 31
Bay (Unit) Numbering .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 32
Rack Numbering.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 33
Optional System Components .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 33
.
.
Rack Information
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 35
Overview .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 35
Altix ICE 8000 Rack (42U) .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 36
Technical Specifications .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 39
.
.
.
007-4986-001
Contents
5.
6.
A.
ICE Administration/Controller Modules .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 41
Overview
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 41
Administrative/Controller Servers .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 42
Rack Leader Controller Server .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 44
.
.
Basic Troubleshooting .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 45
Troubleshooting Chart .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 46
LED Status Indicators
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 47
IRU Power Supply LEDs .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 47
Compute/Memory Blade LEDs.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 48
Technical Specifications and Pinouts .
System-level Specifications .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 49
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 49
Physical and Power Specifications .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 50
Environmental Specifications
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 51
I/O Port Specifications .
B.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 52
Ethernet Port
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 53
Serial Ports .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 54
Safety Information and Regulatory Specifications .
.
.
.
.
.
.
.
.
.
.
.
.
. 57
Safety Information .
.
.
.
.
.
.
.
.
.
.
.
.
. 57
.
.
.
.
.
.
.
.
Regulatory Specifications
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 59
CMN Number .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 59
CE Notice and Manufacturer’s Declaration of Conformity
.
.
.
.
.
.
.
.
.
.
. 59
Electromagnetic Emissions .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 60
FCC Notice (USA Only) .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 60
Industry Canada Notice (Canada Only)
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 61
VCCI Notice (Japan Only).
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 61
Chinese Class A Regulatory Notice
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 61
.
Korean Class A Regulatory Notice
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 61
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 62
Electrostatic Discharge .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 62
Laser Compliance Statements .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 63
Lithium Battery Statements.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 64
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 65
Shielded Cables .
Index
007-4986-001
.
.
.
.
.
.
.
.
.
vii
List of Figures
List of Figures
007-4986-001
Figure 1-1
Flat Panel Rackmount Console Option .
.
.
.
.
3
Figure 1-2
Administrative Controller Video Console Connection Example
.
.
.
4
Figure 1-3
IRU Power Supply Cable Location Example
.
.
.
.
.
.
.
5
Figure 1-4
Single-Phase Rack PDU Circuit Breaker Switches .
.
.
.
.
.
.
6
Figure 1-5
Three-Phase PDU Example .
.
.
.
.
.
.
.
7
Figure 1-6
IRU Chassis Management Board Location Example
.
.
.
.
.
. 12
Figure 2-1
SGI Altix ICE 8000 System Network Access (Example)
.
.
.
.
. 14
Figure 2-2
Chassis Manager Interconnection Diagram Example
.
.
.
.
.
. 16
Figure 2-3
Chassis Management Control Interface Front Panel .
.
.
.
.
.
. 17
Figure 3-1
SGI Altix ICE 8000 Series System (Single Rack) .
.
.
.
.
.
. 20
Figure 3-2
IRU and Rack Components Example
.
.
.
.
.
. 21
Figure 3-3
Functional Block Diagram of the Individual Rack Unit (IRU) .
.
.
. 24
Figure 3-4
Administration and Rack Leader Control Cabling to Chassis Managers
. 29
Figure 3-5
Altix ICE 8000 IRU System Components Example .
. 32
Figure 4-1
Altix ICE 8000 Rack Example .
.
.
.
.
.
.
.
.
.
. 37
Figure 4-2
Tall (42U) Altix Rack and Front Lock .
.
.
.
.
.
.
.
.
. 38
Figure 5-1
ICE System Administration Hierarchy Example Block Diagram
.
.
. 42
Figure 5-2
Administrative/Controller Server Control Panel Diagram .
.
.
.
. 43
Figure 5-3
1U Administration/Controller Server Front and Rear Panel .
.
.
.
. 44
Figure 6-1
Compute Blade Status LED Locations .
.
.
.
.
.
.
.
.
. 48
Figure A-1
Ethernet Port .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 53
Figure A-2
Serial Port Connector.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 54
Figure B-1
VCCI Notice (Japan Only) .
.
.
.
.
.
.
.
.
.
.
.
. 61
Figure B-2
Chinese Class A Regulatory Notice .
.
.
.
.
.
.
.
.
.
. 61
Figure B-3
Korean Class A Regulatory Notice .
.
.
.
.
.
.
.
.
.
. 61
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
ix
List of Tables
List of Tables
007-4986-001
Table 1-1
cpower option descriptions .
.
.
.
.
.
.
.
.
.
.
.
.
8
Table 1-2
cpower example command strings .
.
.
.
.
.
.
.
.
.
.
9
Table 4-1
Tall Altix Rack Technical Specifications
.
.
.
.
.
.
.
.
. 39
Table 5-1
System administrative server control panel functions
.
.
.
.
.
. 43
Table 6-1T
roubleshooting Chart .
.
.
.
.
.
.
.
.
.
.
.
.
.
. 46
Table 6-2
Power Supply LED States
.
.
.
.
.
.
.
.
.
.
.
.
. 47
Table A-1
Altix ICE 8000 System Configuration Ranges .
.
.
.
.
.
.
. 49
Table A-2
Altix ICE 8000 Physical Specifications .
.
.
.
.
.
.
.
.
. 50
Table A-3
Environmental Specifications
.
.
.
.
.
.
.
.
.
.
.
. 51
Table A-4
Ethernet Pinouts .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 53
Table A-5
Serial Port Pinout.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. 55
xi
About This Guide
This guide provides an overview of the architecture, general operation and descriptions of the
major components that compose the SGI® Altix® ICE 8000 blade systems. It also provides the
standard procedures for powering on and powering off the system, basic troubleshooting
information, and important safety and regulatory specifications.
Audience
This guide is written for owners, system administrators, and users of SGI Altix ICE 8000
computer systems.
It is written with the assumption that the reader has a good working knowledge of computers and
computer systems.
Important Information
Warning: To avoid problems that could void your warranty, your SGI or other approved
system support engineer (SSE) should perform all the set up, addition, or replacement of
parts, cabling, and service of your SGI Altix ICE 8000 system, with the exception of the
following items that you can perform yourself:
007-4986-001
•
Using your system console or network access workstation to enter commands and perform
system functions such as powering on and powering off, as described in this guide.
•
Adding and replacing disk drives in optional storage modules used with your system and
using the ESI/ops panel (operating panel) on optional mass storage.
xiii
About This Guide
Chapter Descriptions
The following topics are covered in this guide:
xiv
•
Chapter 1, “Operation Procedures,” provides instructions for powering on and powering off
your system.
•
Chapter 2, “System Management,” describes the function of the chassis management
controllers (CMC) and provides overview instructions for operating the controllers.
•
Chapter 3, “System Overview,” provides environmental and technical information needed to
properly set up and configure the blade systems.
•
Chapter 4, “Rack Information,” describes the system’s rack features.
•
Chapter 5, “ICE Administration/Controller Modules” describes all the controls, connectors
and LEDs located on the front of the stand-alone administrative, rack leader and other
support server nodes. An outline of the server functions is also provided.
•
Chapter 6, “Basic Troubleshooting,” provides recommended actions if problems occur on
your system.
•
Appendix A, “Technical Specifications and Pinouts‚" provides physical, environmental, and
power specifications for your system. Also included are the pinouts for the non-proprietary
connectors.
•
Appendix B, “Safety Information and Regulatory Specifications‚" lists regulatory
information related to use of the blade cluster system in the United States and other
countries. It also provides a list of safety instructions to follow when installing, operating, or
servicing the product.
007-4986-001
Related Publications
Related Publications
The following documents are relevant to and can be used with the Altix ICE 8000 systems:
•
SGI Altix ICE System Administrator's Guide, (P/N 007-4993-00x)
This guide discuses the ICE 8000 system’s general system configuration and software
administration operations on SGI Altix ICE 8000 systems.
•
Superserver 6015B User’s Manual (P/N 860-0473-00x)
This guide discuses the use, maintenance and operation of the 1U server primarily used as the
administrative server and as the rack leader controller (RLC) server. This stand-alone compute
node may be used as an RLC, as a login, or batch server, or other type of support server used with
the Altix ICE 8000 systems.
•
Guide to Programming Environments and Tools Available on SGI Altix XE Systems
(P/N 007-4901-00x)
This guide describes how to use the software tools run with SGI ProPack 5 for Linux operating
systems on Altix XE systems. It explains how to perform general system configuration and
operations and describes programming environments and tools available.
•
Man pages (online)
Man pages locate and print the titled entries from the online reference manuals.
You can obtain SGI documentation, release notes, or man pages in the following ways:
•
See the SGI Technical Publications Library at http://docs.sgi.com. Various formats are
available. This library contains the most recent and most comprehensive set of online books,
release notes, man pages, and other information.
•
The release notes, which contain the latest information about software and documentation in
this release, are in a file named README.SGI in the root directory of the SGI ProPack for
Linux Documentation CD.
•
You can also view man pages by typing man <title> on a command line.
SGI systems include a set of Linux man pages, formatted in the standard UNIX “man page” style.
Important system configuration files and commands are documented on man pages. These are
found online on the internal system disk (or DVD) and are displayed using the man command. For
example, to display a man page, type the request on a command line:
man commandx
007-4986-001
xv
About This Guide
References in the documentation to these pages include the name of the command and the section
number in which the command is found. For additional information about displaying man pages
using the man command, see man(1). In addition, the apropos command locates man pages
based on keywords. For example, to display a list of man pages that describe disks, type the
following on a command line:
apropos disk
For information about setting up and using apropos, see apropos(1).
xvi
007-4986-001
Conventions
Conventions
The following conventions are used throughout this document
:
Convention
Meaning
Command
This fixed-space font denotes literal items such as commands, files,
routines, path names, signals, messages, and programming language
structures.
variable
The italic typeface denotes variable entries and words or concepts being
defined. Italic typeface is also used for book titles.
user input
This bold fixed-space font denotes literal items that the user enters in
interactive sessions. Output is shown in nonbold, fixed-space font.
[]
Brackets enclose optional portions of a command or directive line.
...
Ellipses indicate that a preceding element can be repeated.
man page(x)
Man page section identifiers appear in parentheses after man page names.
GUI element
This font denotes the names of graphical user interface (GUI) elements such
as windows, screens, dialog boxes, menus, toolbars, icons, buttons, boxes,
fields, and lists.
Product Support
SGI provides a comprehensive product support and maintenance program for its products, as
follows:
007-4986-001
•
If you are in North America, contact the Technical Assistance Center at
+1 800 800 4SGI or contact your authorized service provider.
•
If you are outside North America, contact the SGI subsidiary or authorized distributor in
your country.
xvii
About This Guide
Reader Comments
If you have comments about the technical accuracy, content, or organization of this document,
contact SGI. Be sure to include the title and document number of the manual with your comments.
(Online, the document number is located in the front matter of the manual. In printed manuals, the
document number is located at the bottom of each page.)
You can contact SGI in any of the following ways:
•
Send e-mail to the following address: techpubs@sgi.com
•
Contact your customer service representative and ask that an incident be filed in the SGI
incident tracking system.
•
Send mail to the following address:
Technical Publications
SGI
1140 East Arques Avenue, M/S 50-1-946
Sunnyvale, California 94085
SGI values your comments and will respond to them promptly.
xviii
007-4986-001
Chapter 1
1. Operation Procedures
This chapter explains how to operate your new system in the following sections:
•
“Precautions” on page 1
•
“Console Connections” on page 3
•
“Powering the System On and Off” on page 4
•
“Monitoring Your Server” on page 11
Precautions
Before operating your system, familiarize yourself with the safety information in the following
sections:
•
“ESD Precaution” on page 1
•
“Safety Precautions” on page 2
ESD Precaution
Caution: Observe all ESD precautions. Failure to do so can result in damage to the equipment.
Wear an SGI-approved wrist strap when you handle an ESD-sensitive device to eliminate possible
ESD damage to equipment. Connect the wrist strap cord directly to earth ground.
007-4986-001
1
1: Operation Procedures
Safety Precautions
Warning: Before operating or servicing any part of this product, read the “Safety
Information” on page 57.
Danger: Keep fingers and conductive tools away from high-voltage areas. Failure to
follow these precautions will result in serious injury or death. The high-voltage areas of the
system are indicated with high-voltage warning labels.
!
Caution: Power off the system only after the system software has been shut down in an orderly
manner. If you power off the system before you halt the operating system, data may be corrupted.
Warning: If a lithium battery is installed in your system as a soldered part, only qualified
SGI service personnel should replace this lithium battery. For a battery of another type,
replace it only with the same type or an equivalent type recommended by the battery
manufacturer, or an explosion could occur. Discard used batteries according to the
manufacturer’s instructions.
2
007-4986-001
Console Connections
Console Connections
The flat panel console option (see Figure 1-1) has the following listed features:
1.
Slide Release - Move this tab sideways to slide the console out. It locks the drawer closed
when the console is not in use and prevents it from accidentally sliding open.
2. Handle - Used to push and pull the module in and out of the rack.
3. LCD Display Controls - The LCD controls include On/Off buttons and buttons to control
the position and picture settings of the LCD display.
4. Power LED - Illuminates blue when the unit is receiving power.
1
2
3
4
Figure 1-1
007-4986-001
Flat Panel Rackmount Console Option
3
1: Operation Procedures
A console is defined as a connection to the system (to the administration controller or rack leader
controller server) that provides administrative access to the cluster. SGI offers a rackmounted flat
panel console option that attaches to the administrative or rack leader server’s video, keyboard and
mouse connectors.
A console can also be a LAN-attached personal computer, laptop or workstation (RJ45 Ethernet
connection).
In certain limited cases, a dumb terminal could be used. An RS-232 style serial connection can be
used to communicate directly with the administrative controller server. This connection is
typically used for service purposes or for system console access in smaller systems, or where an
external ethernet connection is not used or available.
The flat panel rackmount or other optional VGA console connects to the administration controller
server video and keyboard/mouse connectors as shown in Figure 1-2.
Mouse
Keyboard
Figure 1-2
VGA
port
Administrative Controller Video Console Connection Example
Powering the System On and Off
This section explains how to power on and power off individual rack units, or your entire Altix
ICE system, as follows:
•
“Preparing to Power On” on page 5
•
“Powering On and Off” on page 8
Entering commands from a system console, you can power on and power off individual IRUs,
blade-based nodes, and stand-alone controller servers, or the entire system.
4
007-4986-001
Powering the System On and Off
When using the SGI cluster manager software, you can monitor and manage your server from a
remote location (see the SGI Altix ICE System Administrator’s Guide).
You may also monitor and manage your server with tools such as the Voltaire or Intel message
passing interface (MPI). For details, see the documentation for the particular tool.
Preparing to Power On
To prepare to power on your system, follow these steps:
1.
Check to ensure that the cabling between the rack’s power distribution units (PDUs) and the
wall power-plug receptacle is secure.
2. For each individual IRU that you want to power on, make sure that the power cables are
plugged into all the IRU power supplies correctly, as shown in Figure 1-3. Setting the circuit
breakers on the PDUs to the “On” position will apply power to the IRU and will start the
chassis manager in each IRU. Note that the chassis manager in each IRU stays powered on
as long as there is power coming into the unit. Turn off the PDU breaker switch that supplies
voltage to the IRU if you want to remove all power from the unit.
Figure 1-3
007-4986-001
IRU Power Supply Cable Location Example
5
1: Operation Procedures
3. If you plan to power on a server that includes optional mass storage enclosures, make sure
that the power switch on the rear of each PSU/cooling module (one or two per enclosure) is
in the 1 (on) position.
4. Make sure that all PDU circuit breaker switches (see the examples in Figure 1-4 and
Figure 1-5) are turned on to provide power when the system is booted up.
Power
distribution
unit (PDU)
Power
source
Figure 1-4
Single-Phase Rack PDU Circuit Breaker Switches
Figure 1-5 shows an example of the three-phase PDU.
6
007-4986-001
Powering the System On and Off
Power
distribution
unit (PDU)
Figure 1-5
007-4986-001
Three-Phase PDU Example
7
1: Operation Procedures
Powering On and Off
The power-on and off procedure varies with your system setup. See the SGI Altix ICE System
Administrator's Guide for a more complete description of system commands.
Note: The cpower commands are normally run through the administration controller server. If
you have a terminal connected to an administrative controller server with a serial interface, you
should be able execute these commands.
Console Management Power (cpower) Commands
This section provides an overview of the console management power (cpower) commands for the
SGI Altix ICE system.
The cpower commands allows you to power-up, power-down, reset, and show the power status
of multiple or single system components or individual racks.
The cpower command is, as follows:
cpower <option...> <target_type> <action> <target>
The cpower command accepts the following arguments as described in Table 1-1.
Table 1-1
Argument
cpower option descriptions
Description
Option
8
--noleader
Do not include rack leader nodes. Valid with rack and system domains only.
--noservice
Do not include service nodes.
--ipmi
Uses ipmitool to communicate.
--ssh
Uses ssh to communicate.
--intelplus
Use the “-o intelplus option” for ipmitool [default].
--verbose
Print additional information on command progress.
--noexec
Display but do not execute commands that affect power.
007-4986-001
Powering the System On and Off
Table 1-1 (continued)
Argument
cpower option descriptions
Description
Target_type
--node
Apply the action to a node or nodes. Nodes can be blade compute nodes (inside an IRU),
administration controller server nodes, rack leader controller nodes or service nodes.
--iru
Apply the action at the IRU level
--rack
Apply the action to all components in a rack
--system
Apply the action to the system. You must not specify a target with this type.
--all
Allows the use of wildcards in the target name
Action
--status
Shows the power status of the target [default]
--up | --on
Powers up the target
--down | --off
Powers down the target
--cycle
Power cycles the target
--reboot
Reboot the target, even if it is already booted. Wait for all targets to boot.
--shutdown
Shutdown the target, but do not power it off. Wait for targets to shut down.
--help
Usage and help text
Note: If you include a rack leader controller in your wildcard specification, and a command that
may take it offline, you will see a warning intended to prevent accidental resets of the RLC, as that
could make the rack unreachable.
Table 1-2
007-4986-001
cpower example command strings
Command
Status/result
# cpower --system --up
Powers up all nodes in the system.
# cpower --rack r1
Determines the power status of all nodes in rack 1 (including
the RLC), except CMCs.
9
1: Operation Procedures
Table 1-2 (continued)
cpower example command strings
Command
Status/result
# cpower --all --status r*i0* Provides power status of all IRU0 compute nodes in the
system.
# cpower --all --status "*"
Provides status of every compute node in the system.
# cpower --boot --rack r1
Boots any nodes in rack 1 not already online.
# cpower --system --down
Completely powers down every node in the system. Use only
if you want to shut down all nodes (see the next example).
# cpower --shutdown --system
--noleader --noservice
Shuts down all the IRU compute nodes, but not the
administrative controller server, rack leader controller or
other service nodes.
# cpower --boot r1i0n8
Command tries to specifically boot rack 1, IRU0, node 8.
# cpower --shutdown --off
--rack r1
Using --shutdown and --off together is the only time you can
use more than one command on the cpower command line.
This combination will shutdown, then power off all of the
computer nodes in parallel, then shutdown the rack leader
controller. Use --noleader if you want the RLC to remain on.
See the SGI Altix ICE System Administrator's Guide for more information on cpower commands.
See the section “System Power Status” on page 18 for additional related console information.
10
007-4986-001
Monitoring Your Server
Monitoring Your Server
You can monitor your Altix ICE 8000 server from the following sources:
•
An optional flat panel rackmounted monitor with PS2 keyboard/mouse can be connected to
the administration server node for basic monitoring and administration of the Altix system.
See the section “Console Connections” on page 3 for more information. SLES 10 or higher
is required for this option.
•
You can attach an optional LAN-connected console via secure shell (ssh) to an Ethernet port
adapter on the administration controller server. You will need to connect either a local or
remote workstation/PC to the IP address of the administration controller server to access and
monitor the system via IPMI.
See the Console Management section in the SGI Altix ICE System Administrator's Guide for
more information on the open source console management package.
These console connections enable you to view the status and error messages generated by
your Altix ICE 8000 system. You can also use these consoles to input commands to manage
and monitor your system. See the section “System Power Status” on page 18, for additional
information.
Figure 1-6 on page 12 shows a front view of a system IRU.
Note: The chassis management display shown in Figure 1-6 on page 12 is for future functional
enhancement. Contact your SGI support representative after 10-31-07 for updates.
007-4986-001
11
1: Operation Procedures
Switch blade 0
Power supplies
Chassis manager
Unused
switch blade
slot
Figure 1-6
Switch blade 1
Power supplies
Chassis management
display
Unused
switch blade
slot
IRU Chassis Management Board Location Example
PCI/PCI-X and PCIe based I/O sub-systems are sited in the administrative controller server and
rack leader controller systems used with the system IRUs. These are the primary configurable I/O
system interfaces for the Altix ICE 8000 systems. See the server’s user guide for information on
installing optional I/O cards or other components.
12
007-4986-001
Chapter 2
2. System Management
This chapter describes the interaction and functions of system controllers in the following
sections:
•
“Levels of System and Chassis Control” on page 15
•
“Chassis Management Control (CMC) Functions” on page 17
•
“System Power Status” on page 18
Each IRU has one chassis manager, which is located directly below compute blade slot 0. The
chassis manager supports power-up and power-down of the IRU’s compute/memory node blades
and environmental monitoring of all units within the IRU.
Note that 1U stand-alone servers such as the administration controller, rack leader controller, or
other service nodes and mass storage enclosures do not have a chassis manager.
Figure 2-1 shows an example remote LAN-connected console used to monitor a single-rack Altix
ICE 8000 system.
007-4986-001
13
2: System Management
Remote
workstation
monitor
Local Area Network (LAN)
Altix ICE 8200
Local Area Network (LAN)
Cat-5 Ethernet
Figure 2-1
SGI Altix ICE 8000 System Network Access (Example)
Using the 1U Console Option
The SGI optional 1U console is a rackmountable unit that includes a built-in keyboard/touchpad,
and uses a 17-inch (43-cm) LCD flat panel display of up to 1024 x 768 pixels. The 1U console
attaches to the administrative controller server using PS/2 and HD15M connectors or to a KVM
14
007-4986-001
Levels of System and Chassis Control
switch (not provided by SGI). The 1U console is basically a “dumb” VGA terminal, it cannot be
used as a workstation or loaded with any system administration program.
Note: While the 1U console is normally plugged into the administrative controller server in the
ICE system, it can also be connected to a rack leader controller server in the system for terminal
access purposes.
The 27-pound (12.27-kg) console automatically goes into sleep mode when the monitor cover is
closed down.
Levels of System and Chassis Control
The chassis management control network configuration of your ICE 8000 series machine will
depend on the size of the system and the control options selected. Typically, any system with
multiple IRUs will be interconnected by the chassis managers in each IRU.
Note: Mass storage option enclosures are not monitored by the IRU’s chassis manager. Most
optional mass storage enclosures have their own internal microcontrollers for monitoring and
controlling all elements of the disk array. See the owner’s guide for your mass storage option for
more information on this topic.
Chassis Controller Interaction
In all Altix ICE 8000 clusters all the system chassis management controllers communicate with
each other in the following ways:
007-4986-001
•
All enclosures within a system communicate with each other through their chassis manager
connections (CMC) (note the chassis managers are enlarged for clarity in Figure 2-2).
•
The CMC does the environmental management for an IRU as well as power control, and
provides an ethernet network infrastructure for the management of the system.
15
2: System Management
Chassis Manager Interconnects
The chassis manager in the lower IRU connects to the administration and leader servers, see
Figure 2-2.
Admin node
Leader node
Optional additional
rack connections
Figure 2-2
16
Chassis Manager Interconnection Diagram Example
007-4986-001
Chassis Controller Interaction
Chassis Management Control (CMC) Functions
The following list summarizes the control and monitoring functions that the CMC performs. Most
functions are common across multiple IRUs.
•
Controls voltage margining within the IRU
•
Controls and monitors IRU fan speeds
•
Reads system identification (ID) PROMs
•
Monitors voltage levels and reports failures
•
Monitors the On/Off power sequence
•
Monitors system resets
Chassis Management Control Front Panel Display (Future Features)
Figure 2-3 shows the chassis management controller display panel on the IRU.
CMC interface LCD panel
Figure 2-3
Chassis Management Control Interface Front Panel
Note: The CMC front panel display is a future feature enhancement item. Contact your SGI
support representative after 10-31-07 for information on software or firmware updates.
007-4986-001
17
2: System Management
System Power Status
The cpower command is the main interface for all power management commands. You can
request power status, power-on and power-off the system with commands entered via the
administrative controller server or rack leader controller in the system rack. The cpower
commands are communicating with BMCs using the IPMI protocol.
The cpower commands may require several seconds to several minutes to complete, depending
on how many IRUs are being queried for status, powered-up, or shut down.
# cpower --all --status "*"
This command gives the status of all compute nodes in the system.
To power on or power off a specific IRU, enter the following commands:
# cpower --up r1i*
The system should respond by powering up all the IRU nodes in rack 1. Note that --on is the same
as --up. This command does not power-up the system administration (server) controller, rack
leader controller server or other service nodes.
# cpower --down r1i*
This command powers off all the nodes in every IRU in rack 1. Note that --down is the same as
--off. This command does not power-down the system administration (server) controller, rack
leader controller server or other service nodes.
See “Console Management Power (cpower) Commands” on page 8 for additional information on
power-on, power-off and power status commands. The SGI Altix ICE System Administrator's
Guide, (P/N 007-4993-00x) has extensive infomation on these topics.
18
007-4986-001
Chapter 3
3. System Overview
This chapter provides an overview of the physical and architectural aspects of your SGI Altix
Integrated Compute Environment (ICE) 8000 series system. The major components of the Altix
ICE systems are described and illustrated.
Because the system is modular, it combines the advantages of lower entry-level cost with global
scalability in processors, memory, InfiniBand connectivity and I/O. You can install and operate
the Altix ICE 8000 series system in your lab or server room. Each 42U SGI rack holds from one
to four 10U-high individual rack units (IRUs) that support up to sixteen compute/memory cluster
sub modules known as “blades.” These blades are single printed circuit boards (PCBs) with
ASICS, processors, memory components and I/O chip sets mounted on a mechanical carrier. The
blades slide directly in and out of the IRU enclosures. Every processor node blade contains at least
two dual-inline memory modules (DIMM) memory units.
Each blade supports two processor sockets that can have two or 4 processor cores. A maximum
system size of 64 compute/memory blades (512 cores) per rack is supported at the time this
document was published. Optional chilled water cooling may be required for large processor
count rack systems. Customers wishing to emphasize memory capacity over processor count can
choose blades configured with only one processor installed per blade. Contact your SGI sales or
service representative for the most current information on these topics.
The SGI Altix ICE 8000 series systems can run parallel programs using a message passing tool
like the Message Passing Interface (MPI). The ICE blade system uses a distributed memory
scheme as opposed to a shared memory system like that used in the SGI Altix 450 or Altix 4700
high-performance compute servers. Instead of passing pointers into a shared virtual address space,
parallel processes in an application pass messages and each process has its own dedicated
processor and address space. This chapter consists of the following sections:
007-4986-001
•
“System Models” on page 20
•
“System Architecture” on page 22
•
“System Features and Major Components” on page 25
•
“System Components” on page 31
19
3: System Overview
System Models
The basic enclosure within the Altix ICE system is the 10U high (17.5 inch or 44.45 cm)
“individual rack unit” (IRU). The IRU enclosure supports a maximum of 16 compute/memory
blades, up to eight power supplies, one chassis manager interface and two InfiniBand architecture
I/O fabric switch interface blades. Each IRU comes with 4x or optional 12x InfiniBand fabric
switch blades.
The 42U rack for this server houses all IRU enclosures, option modules, and other components;
up to 128 processor sockets (512 processor cores) in a single rack. Note that optional water chilled
rack cooling may be required for systems with high processor counts.
Figure 3-1 shows an example configuration of a single-rack Altix ICE 8000 server.
Figure 3-1
SGI Altix ICE 8000 Series System (Single Rack)
The system requires a minimum of one 42U tall rack with two single-phase power distribution
units (PDUs) per IRU installed in the rack. Each single-phase PDU has 5 outlets (eight are
required to support the eight power supplies that can be installed in each IRU).
20
007-4986-001
System Models
The three-phase PDU has 18 outlets (16 connections are required to support up to two IRUs
installed in the rack).
You can also add additional RAID and non-RAID disk storage to your rack system. Figure 3-2 on
page 21 shows an IRU and Rack. The chassis management display panel shown is used for future
functional enhancements.
42U High Rack
Rack leader controller
Independent Rack Unit (IRU)
Power supplies
Power supplies
Admin server
IRU
IRU
Chassis manager
InfiniBand
switch blade
Figure 3-2
007-4986-001
Chassis
management
display
InfiniBand
switch blade
IRU and Rack Components Example
21
3: System Overview
System Architecture
The Altix ICE 8000 series of computer systems are based on an InfiniBand I/O fabric which
operates in a manner similar to a fibre channel switch attached network (SAN). This concept is
enhanced by using the technologies described in the following subsections.
Memory Controller HUB
The Memory Controller HUB (MCH) is a single flip chip ball grid array (FCBGA) which supports
the following core platform functions:
•
System bus interface for the processors
•
Memory control sub-system
•
PCI Express ports
•
Fully buffered DIMM (FBD) thermal management
•
Memory (DIMM) sub-system
•
ESB-2 I/O controller
These functions are elaborated in the following subsections.
System Bus Interface
The system bus is configured for symmetric multi-processing across two independent front side
bus interfaces that connect the dual-core or quad-Core Intel Xeon processors. Each front side bus
on the MCH uses a 64-bit wide data bus. The 1333 MHz data bus is capable of addressing up to
64 GB of memory. The MCH is the priority agent for both front side bus interfaces, and is
optimized for one processor on each bus.
Each cluster node board supports two dual-core or quad-core Intel Xeon processors, with score
frequencies starting at 2.33 GHz. Previous generations of Intel Xeon processors are not supported
on the node board.
Memory Control Sub-system
The MCH provides four channels of Fully Buffered DIMM (FB-DIMM) memory. Each channel
can support up to 2 Dual Ranked Fully Buffered (DDR2) DIMMs. FB-DIMM memory channels
are organized into two branches with a capability to support RAID 1 (mirroring). The MCH can
22
007-4986-001
System Architecture
support up to 8 DIMMs or a maximum memory size of 32 GB physical memory in non-mirrored
mode and 16 GB physical memory in a mirrored (RAID 1) configuration.
Using all four channels a maximum read bandwidth of 21 GB/s for four FB-DIMM channels is
possible. This option also provides up to 10.7 GB/s of write memory bandwidth for four
FB-DIMM channels.
Memory DIMM Subsystem
A minimum of one dual-inline-memory module (DIMM) set (2 DIMMs) is required for each
blade. Blades are supported with 2, 4, 6, or 8 installed DIMMs. A maximum of four DIMM sets
(8 total DIMMs) can be installed in a compute blade. Each set of DIMMs (pair) on a blade must
be the same capacity and functional speed. When possible, it is generally recommended that all
blades within an IRU use the same number and capacity (size) DIMMs.
Each blade in the IRU may have a different total DIMM capacity. For example, one blade may
have eight DIMMs, and another may have only two. Note that while this difference in capacity is
acceptable functionally, it may have impacts on compute “load balancing” within the system.
ESB-2 I/O Controller
The ESB-2 is a multi-function device that provides the following four distinct functions:
•
IO controller
•
PCI-X bridge
•
Gb Ethernet controller
•
Baseboard Management Controller (BMC)
Each function within the ESB-2 has its own set of configuration registers. Once configured, each
appears to the system as a distinct hardware controller. The primary role of the ESB-2 is to provide
the Gigabit Ethernet interface between the Chassis Management Controller (CMC) and the
Baseboard Management Controller (BMC). Each blade‘s node board uses the following features:
•
Dual GbE MAC
•
Baseboard Management Controller (BMC)
•
Power management
Figure 3-3 on page 24 shows a functional block diagram of the Altix ICE 8000 series system IRU
compute/memory blades, InfiniBand interface, and component interconnects.
007-4986-001
23
3: System Overview
Compute 8
Compute 9
Compute 10
Compute 11
Compute 12
Compute 13
Compute 14
Compute 15
IB0 IB1
IB0 IB1
IB0 IB1
IB0 IB1
IB0 IB1
IB0 IB1
IB0 IB1
IB0 IB1
1
13
2
1
2
1
2
1
2
1
2
1
2
1
2
1
2
19
16
24
23
2
6
4
20
20
4
6
8
12
10
14
15
11
7
9
3
15
11
7
9
22
18
17
16
19
13
Figure 3-3
1
2
1
2
1
2
1
2
IB0 IB1
IB0 IB1
IB0 IB1
IB0 IB1
IB0 IB1
IB0 IB1
IB0 IB1
IB0 IB1
Compute 7
2
Compute 6
1
Compute 5
2
Compute 4
1
Compute 3
2
Compute 2
1
Compute 1
2
Compute 0
1
24
23
24
3
1
5
21
18
2
8
12
10
14
21
5
1
17
22
Functional Block Diagram of the Individual Rack Unit (IRU)
007-4986-001
System Features and Major Components
System Features and Major Components
The main features of the Altix ICE 8000 series server systems are introduced in the following
sections:
•
“Modularity and Scalability” on page 25
•
“Reliability, Availability, and Serviceability (RAS)” on page 30
•
“Bay (Unit) Numbering” on page 32
Modularity and Scalability
The Altix ICE 8000 series systems are modular systems. The components are primarily housed in
building blocks referred to as individual rack units (IRUs). However, other “free-standing” Altix
compute servers are used to administer, access and enhance the ICE 8000 systems. Additional
optional mass storage may be added to the system along with additional IRUs. You can add
different types of stand-alone module options to a system rack to achieve the desired system
configuration. You can configure and scale IRUs around processing capability, memory size or
InfiniBand fabric I/O capability. The air-cooled IRU enclosure has redundant, hot-swap fans and
redundant, hot-swap power supplies. The water-chilled rack option expands a single rack’s
compute density with added heat dissipation capability for the IRU components.
A number of free-standing (non-blade) compute and I/O servers are used with Altix ICE 8000
systems in addition to the standard two-socket blade-based compute nodes. These free-standing
units are:
•
Administrative controller server
•
System rack leader controller (RLC) server
•
Service nodes with the following functions:
–
Fabric management service (incorporated as part of the RLC)
–
Login server
–
Batch server
As a general rule, each ICE 8000 system will have at least one administrative controller server,
one rack leader controller server and one service node. These are all stand-alone 1U servers. The
following subsections further define the free-standing unit functions described in the previous list.
007-4986-001
25
3: System Overview
Administrative Server
As a general rule, there is one stand-alone administration controller server and I/O unit per system
rack. The administrative controller is a non-blade Altix 1U server system. The administration
controller server is used to install ICE system software, administer that software and monitor
information from all the nodes in the system.
A significant operating factor for the administrative controller server is the file system structure.
If the administration unit is NFS-mounting a network storage system outside the ICE system, input
data and output results will need to pass through the administration server for each job. Multiple
administration servers distribute this load. The exact number of administration controller servers
an ICE system requires for maximum performance is size and application dependent.
Another factor is the number of interactive logins. Since the administrative controller server is the
only server in the ICE 8000 that is connected to the external network, this is where interactive
logins occur. Some ICE systems are configured with dedicated “login servers” for this purpose.
In this case you might configure multiple “service nodes” but have all but one devoted to
interactive logins as “login nodes”, see the “Login Server Function” on page 27.
Rack Leader Controller
A rack leader controller (RLC) server is generally used by administrators to provision and manage
the system using SGI’s cluster management (CM) software. There is generally only one leader
controller per rack and it is a non-blade “stand-alone” 1U server. The rack leader controller is
guided and monitored by the administrative server. It in turn monitors, pulls and stores data from
the compute nodes of all the IRUs within the rack. The rack leader then consolidates and forwards
data requests received from the IRU’s blade compute nodes to the administrative server. The
leader controller may also supply boot and root file sharing images to the compute nodes in the
IRUs.
This RLC server is the point of submittal for all message passing interface (MPI) applications run
in the system. An MPI job is started from the RLC and the sub-processes are distributed to the ICE
system’s compute nodes.
For large systems or systems that run many MPI jobs, multiple RLC servers may be used to
distribute the load. The first RLC in the ICE system is the “master” controller server. Additional
RLCs are slaved to the first RLC (which is usually in rack 001). The second RLC runs the same
fabric management image as the primary “master” RLC and can “fail over” and continue to
support the ICE system’s fabric management without halting the overall system.
26
007-4986-001
System Features and Major Components
Under most ICE configurations the fabric management function is incorporated in a combination
within the rack leader controller (RLC) node. See the “Rack Leader Controller” subsection for
more detail. The fabric management node software function is monitored by and communicates
directly with the RLC server. If a separate fabric management unit is desired or required, it
functions to host the fabric management software and monitors the overall functionality of the ICE
system’s InfiniBand fabric communications. It supplies this information to the RLC server
periodically or upon request. As with the rack leader controller server, only one per rack is
supported and it is a stand-alone 1U server node.
Service Nodes
The service functionality of the service nodes listed in this subsection are all services that can
technically run on a single hardware server unit. Or, in the case of the fabric management function,
it can be co-resident on the rack leader controller node. As the system scales, you can add more
1U servers (nodes) and dedicate them to these service functions if the size of the system requires
it. However you can also have a smaller system where many of the services are combined on just
a single 1U service node.
Login Server Function
The login server function within the ICE system can be functionally combined with the batch
server node and/or gateway node function in some configurations. One or more per system are
supported. Very large systems with high levels of user logins may use one or more dedicated 1U
login server nodes. The login node functionality is generally used to create and compile programs,
and additional 1U login server nodes can be added as the total number of user logins increase.
Batch Server Node Function
The batch server function may be combined with login and/or gateway service nodes for many
configurations. Additional batch server nodes can be added as the total number of user logins
increase. This server node runs batch scheduler portable-batch system/load-sharing facility
(PBS/LSF) programs. Users login or connect to this node to submit jobs to the ICE 8000 system
compute nodes. If required, the batch server function can be an optional 1U or 5U stand-alone
server within the ICE system. One or more batch nodes are supported per system, based on system
size and functional requirement.
007-4986-001
27
3: System Overview
Multiple Chassis Manager Connections
In certain multiple-IRU configurations the chassis managers in each IRU may be interconnected
and wired to the administrative server and the rack leader controller (RLC) server. Figure 3-4 on
page 29 shows an example diagram of the interconnects. Note that the scale of the CMC drawings
is adjusted to clarify the interconnect locations.
28
007-4986-001
System Features and Major Components
Admin node
Leader node
Optional additional
rack connections
Figure 3-4
007-4986-001
Administration and Rack Leader Control Cabling to Chassis Managers
29
3: System Overview
Reliability, Availability, and Serviceability (RAS)
The Altix ICE 8000 server series components have the following features to increase the
reliability, availability, and serviceability (RAS) of the systems.
•
•
•
•
30
Power and cooling:
–
IRU power supplies are redundant and can be hot-swapped under most circumstances.
Note that this might not be possible in a “fully loaded” IRU.
–
A rack-level water chilled cooling option is available for systems with high-density
configurations.
–
IRUs have overcurrent protection at the blade and power supply level.
–
Fans are redundant and can be hot-swapped.
–
Fans run at multiple speeds in the IRUs. Speed increases automatically when
temperature increases or when a single fan fails.
System monitoring:
–
Chassis managers monitor the internal voltage, power and temperature of the IRUs.
–
Each IRU and each blade/node installed has failure LEDs that indicate the failed part;
LEDs are readable at the front of the IRU.
–
Systems support remote console and maintenance activities.
Error detection and correction
–
External memory transfers are protected by cyclical redundancy correction (CRC) error
detection. If a memory packet does not checksum, it is retransmitted.
–
Nodes within each IRU exceed SECDED standards by detecting and correcting 4-bit
and 8-bit DRAM failures.
–
Detection of all double-component 4-bit DRAM failures occur within a pair of DIMMs.
–
32-bits of error checking code (ECC) are used on each 256 bits of data.
–
Automatic retry of uncorrected errors occurs to eliminate potential soft errors.
Power-on and boot:
–
Automatic testing occurs after you power on the system nodes. (These power-on
self-tests or POSTs are also referred to as power-on diagnostics or PODs).
–
Processors and memory are automatically de-allocated when a self-test failure occurs.
–
Boot times are minimized.
007-4986-001
System Components
System Components
The Altix ICE 8000 series system features the following major components:
•
42U rack. This is a custom rack used for both the compute and I/O rack in the Altix ICE
8000 system. Up to 4 IRUs can be installed in each rack. There is 2U of space reserved for
the 1U administrative controller server and 1U rack leader controller server.
•
Individual Rack Unit (IRU). This enclosure contains the compute/memory blades, chassis
manager, InfiniBand fabric I/O blades and front-access power supplies for the Altix ICE
8000. The enclosure is 10U high. Figure 3-5 on page 32 shows the Altix ICE 8000 IRU
system components. Note that the chassis management display is a future enhanced feature
component.
•
Compute/memory blade. Holds one or two processor sockets (dual or quad-core) and 2, 4,
6 or 8 memory DIMMs.
•
1U Administrative server with PCIe/PCI-X expansion. This server node supports an
optional console, administrative software and three PCI Express option cards.
•
1U (Rack leader controller). The 1U rack leader server can also be used as an optional
login, batch, or fabric functional node.
•
5U (Batch server controller). The optional 5U batch controller server node is offered with
certain configurations needing higher performance batch node access for the ICE system. It
offers multiple I/O options and higher performance processors than the 1U server nodes.
Note: PCIe options may be limited, check with your SGI sales or support representative.
007-4986-001
31
3: System Overview
Switch blade 0
Power supplies
Chassis manager
Unused
switch blade
slot
Figure 3-5
Switch blade 1
Power supplies
Chassis management
display
Unused
switch blade
slot
Altix ICE 8000 IRU System Components Example
Bay (Unit) Numbering
Bays in the racks are numbered using standard units. A standard unit (SU) or unit (U) is equal to
1.75 inches (4.445 cm). Because IRUs occupy multiple standard units, IRU locations within a rack
are identified by the bottom unit (U) in which the IRU resides. For example, in a 42U rack, an IRU
positioned in U01 through U10 is identified as U01.
32
007-4986-001
System Components
Rack Numbering
Each rack is numbered with a three-digit number sequentially beginning with 001. A rack contains
IRU enclosures, optional mass storage enclosures, administrative and rack leader server nodes,
and potentially other options. In a single compute rack system, the rack number is always 001.
Optional System Components
Availability of optional components for the SGI ICE 8000 systems may vary based on new
product introductions or end-of-life components. Some options are listed in this manual, others
may be introduced after this document goes to production status. Check with your SGI sales or
support representative for the most current information on available product options not discussed
in this manual.
007-4986-001
33
Chapter 4
4. Rack Information
This chapter describes the physical characteristics of the tall (42U) ICE 8000 racks in the
following sections:
•
“Overview” on page 35
•
“Altix ICE 8000 Rack (42U)” on page 36
•
“Technical Specifications” on page 39
Overview
At the time this document was published only the tall (42U) Altix ICE 8000 rack shown in
Figure 4-2 was approved for use with the ICE systems.
007-4986-001
35
4: Rack Information
Altix ICE 8000 Rack (42U)
The tall rack (shown in Figure 4-1 on page 37) has the following features and components:
•
Front and rear door. The front door is opened by grasping the wide end of the
triangle-shaped door piece and pulling outward. It uses a key lock for security purposes that
should open all the front doors in a multi-rack system (see Figure 4-2 on page 38). Note that
the front door and rear door locks are keyed differently. The optional water-chilled rear door
does not use a lock.
The rear door has a push-button key lock to prevent unauthorized access to the system. The
rear doors have a master key that locks and unlocks all rear doors in a system made up of
multiple racks. You cannot use the rear door key to secure the front door lock.
36
•
Cable entry/exit area. Cable access openings are located in the front floor and top of the
rack. Cables are only attached to the front of the IRUs; therefore, most cable management
occurs in the front and top of the rack. Stand-alone administrative, leader and login server
modules are the exception to this rule and have cables that attach at the rear of the rack. Rear
cable connections will also be required for optional storage modules installed in the same
rack with the IRU(s). Optional inter-rack communication cables pass through the top of the
rack. I/O and power cables normally pass through the bottom of the rack.
•
Rack structural features. The rack is mounted on four casters; the two rear casters swivel.
There are four leveling pads available at the base of the rack. The base of the rack also has
attachment points to support an optional ground strap, and/or seismic tie-downs.
•
Power distribution units within the rack. A maximum of four single-phase power
distribution units (PDUs) are needed per rack, depending on the number of IRUs. Each IRU
requires one 5-outlet single-phase PDU. A three-phase power distribution unit has 18 outlet
connections, of which up to 16 may be needed to power two IRUs.
007-4986-001
Altix ICE 8000 Rack (42U)
Figure 4-1
007-4986-001
Altix ICE 8000 Rack Example
37
4: Rack Information
Figure 4-2
38
Tall (42U) Altix Rack and Front Lock
007-4986-001
Technical Specifications
Technical Specifications
Table 4-1 lists the technical specifications of the Altix ICE 8000 system tall rack.
Table 4-1
Tall Altix Rack Technical Specifications
Characteristic
Specification
Height
79.5 in. (201.9 cm)
Width
31.3 in. (79.5 cm)
Depth
45.8 in. (116.3 cm)
Weight (full)
2,200 lbs. (1,000 kg) approximate
Shipping weight (max)
2,450 lbs. (1,113.6 kg) approximate
Voltage range
North America/International
Nominal
200-240 VAC /230 VAC
Tolerance range
180-264 VAC /180-254 VAC
Frequency
North America/International
Nominal
50/60 Hz /50 Hz
Tolerance range
47-63 Hz /47-63 Hz
Phase required
Single-phase or 3-phase
Power requirements (max) 31.63 kVA (31 kW)
007-4986-001
Hold time
20 ms
Power cable
12 ft. (3.66 m) pluggable cords
39
Chapter 5
5. ICE Administration/Controller Modules
This chapter describes the function and physical components of the administrative/controller
server modules in the following sections:
•
“Overview” on page 41
•
“Administrative/Controller Servers” on page 42
For purposes of this chapter “administration/controller module” is used as a catch-all phrase to
describe the stand-alone servers that act as management infrastructure controllers. The specialized
functions these servers perform within the ICE system can include:
•
Administration
•
Rack leader controller (RLC)
•
Fabric management
•
Login
•
Batch
•
Storage
Overview
User interfaces consist of the Compute Cluster Administrator, the Compute Cluster Job Manager,
and a Command Line Interface (CLI). Management services include job scheduling, job and
resource management, Remote Installation Services (RIS), and a remote command environment.
The 1U administrative controller server is connected to the system via a Gigabit Ethernet link, (it
is not directly linked to the system’s InfiniBand communication fabric).
Note: The system management software runs on the administrative controller server, RLC and
service nodes as a distributed software function. The system management software performs all
of its tasks on the ICE system through an Ethernet network.
007-4986-001
41
5: ICE Administration/Controller Modules
The administrative controller server is at the top of the distributed management infrastructure
within the ICE system. The overall ICE 8000 system management is hierarchical (see Figure 5-1),
with the RLC(s) communicating with the compute nodes via CMC.
System management hierarchy
System admin
controller
Rack leader
controller
Rack leader
controller
Rack leader
controller
Compute blade
Compute blade
Compute blade
Compute blade
Compute blade
Compute blade
Compute blade
Compute blade
Compute blade
Compute blade
Compute blade
Compute blade
Compute blade
Compute blade
Compute blade
A450 storage
node
Service
node
A maximum of 64 compute blades per rack leader controller
Figure 5-1
ICE System Administration Hierarchy Example Block Diagram
Administrative/Controller Servers
The administrative conroller unit acts as the ICE system interface to the “outside world”, typically
a local area network (LAN). The administrative unit control panel features are shown in
Figure 5-2.
42
007-4986-001
Overview
RESET
Power
UID
Unit identifier
button
RESET
Universal information LED
NIC 2 activity LED
Power LED
HDD activity LED
NIC 1 activity LED
Figure 5-2
Administrative/Controller Server Control Panel Diagram
Table 5-1
System administrative server control panel functions
Functional feature
Functional description
Unit identifier button
Pressing this button lights an LED on both the front and rear of the
server for easy system location in large configurations. The LED will
remain on until the button is pushed a second time.
Universal information LED This multi-color LED blinks red quickly, to indicate a fan failure and blinks
red slowly for a power failure. A continuous solid red LED indicates a CPU
is overheating. This LED will be on solid blue or blinking blue when used for
UID (Unit Identifier).
007-4986-001
NIC 2 Activity LED
Indicates network activity on LAN 2 when flashing green.
NIC 1 Activity LED
Indicates network activity on LAN 1 when flashing green.
Disk activity LED
Indicates drive activity when flashing.
Power LED
Indicates power is being supplied to the server’s power supply units.
Reset button
Pressing this button reboots the server.
Power button
Pressing the button applies/ removes power from the power supply to
the server. Turning off power with this button removes main power
but keeps standby power supplied to the system.
43
5: ICE Administration/Controller Modules
Slim floppy drive
(optional)
Slim DVD-ROM drive
System
LEDs
System
reset
Main
power
UID
RESET
SATA drive bays
Power supply
modules
PCI expansion slots
(with riser cards)
Mouse
Keyboard
COM1
port
VGA
port
Ethernet
ports
USB
ports
Figure 5-3
1U Administration/Controller Server Front and Rear Panel
Rack Leader Controller Server
An MPI job is started from the rack leader controller server and the sub-processes are distributed
to the system blade compute nodes. The main process on the RLC server will wait for the
sub-processes to finish. For very large systems or systems that run many MPI jobs, multiple RLC
servers may be used to distribute the load (one per rack).
The 1U rack leader server may also run the software for login purposes as the system “login node”.
In other optional cases the RLC 1U server may be used to run the “batch node” function.
Batch or login functions may be run on individual 1U service nodes, especially when the system
is a large scale multi-rack installation or has a large number of users. See the section “Modularity
and Scalability” on page 25 for a list of administration and support server types and additional
functional descriptions.
44
007-4986-001
Chapter 6
6. Basic Troubleshooting
This chapter provides the following sections to help you troubleshoot your system:
007-4986-001
•
“Troubleshooting Chart” on page 46
•
“LED Status Indicators” on page 47
45
6: Basic Troubleshooting
Troubleshooting Chart
Table 6-1 lists recommended actions for problems that can occur. To solve problems that are not
listed in this table, contact your SGI system support engineer (SSE).
Table 6-1
Troubleshooting Chart
Problem Description
Recommended Action
The system will not power on.
Ensure that the power cords of the IRU are seated properly
in the power receptacles.
Ensure that the PDU circuit breakers are on and properly
connected to the wall source.
If the power cord is plugged in and the circuit breaker is on,
contact your SSE.
An individual IRU will not power on.
Ensure the power cables of the IRU are plugged in.
View the CMC output from your system administration
controller console.
If the CMC is not running, contact your SSE.
The system will not boot the operating system. Contact your SSE.
The Service Required LED illuminates on an View the CMC display of the failing IRU; contact your
IRU.
SSE for help as needed.
46
The Failure LED illuminates on an IRU.
View the CMC display of the failing IRU; contact your
SSE for help as needed.
The PWR LED of a populated PCI slot in a
support server is not illuminated.
Reseat the PCI card.
The Fault LED of a populated PCI slot in a
support server is illuminated (on).
Reseat the PCI card. If the fault LED remains on, replace
the PCI card.
The amber LED of a disk drive is on.
Replace the disk drive.
007-4986-001
LED Status Indicators
LED Status Indicators
There are a number of LEDs on the front of the IRUs that can help you detect, identify and
potentially correct functional interruptions in the system.
The following subsections describe these LEDs and ways to use them to understand potential
problem areas.
IRU Power Supply LEDs
Each power supply installed in an IRU has a single bi-color (green/amber) status LED.
The LED will either light green or amber (yellow), or flash green or yellow to indicate the status
of the individual supply. See Table 6-2 for a complete list.
Table 6-2
007-4986-001
Power Supply LED States
Power supply status
Green LED
Amber LED
No AC power to the supply
Off
Off
Power supply has failed
Off
On
Power supply problem warning
Off
Blinking
AC available to supply (standby)
but IRU is off
Blinking
Off
Power supply on (IRU on)
On
Off
47
6: Basic Troubleshooting
Compute/Memory Blade LEDs
Each compute/memory blade installed in an IRU has a total of ten LED indicators arranged in a
single row behind the perforated sheetmetal of the blade. The LEDs are located in the front lower
left section of the compute blade and are visible through the screen of the compute blade, see
Figure 6-1. The functions of the 10 LED status lights are as follows:
1.
UID - Unit identifier - this blue LED is used during troubleshooting to find a specific
compute node. The LED can be lit via software to aid in locating a specific compute node.
2. CPU Power OK - this green LED lights when the correct power levels are present on the
processor(s).
3. IB0 link - green LED lights when a link is established on the internal InfiniBand 0 port
4. IB0 active - this amber LED flashes when IB0 is active (transmitting data)
5. IB1 link - green LED lights when a link is established on the internal InfiniBand 1 port
6. IB1 active - this amber LED flashes when IB1 is active (transmitting data)
7. Eth1 link - this green LED is illuminated when a link as been established on the system
control Eth1 port
8. Eth1 active - this amber LED flashes when Eth1 is active (transmitting data)
9. Eth2 link - not used
10. Eth2 active - not used
1
Figure 6-1
48
2
3
4
6
5
7
8
9 10
Compute Blade Status LED Locations
007-4986-001
Appendix A
A. Technical Specifications and Pinouts
This appendix contains technical specification information about your system, as follows:
•
“System-level Specifications” on page 49
•
“Physical and Power Specifications” on page 50
•
“Environmental Specifications” on page 51
•
“I/O Port Specifications” on page 52
System-level Specifications
Table A-1 summarizes the Altix ICE 8000 system configuration ranges. Note that while each
compute/memory board can house either one or two processors; each processor socket houses four
processor “cores”.
Table A-1
Altix ICE 8000 System Configuration Ranges
Category
Minimum
Maximum
Processors
16 processor cores (4 blades)a 512 processor cores per rack
Individual Rack Units (IRUs)
1 per rack
4 per rack
Compute/memory blade DIMM
capacity
2 DIMMs per blade
8 DIMMs per blade
System main memory DIMMs
8 per IRU
128 per IRU (512 per rack)
Chassis management blades
One per IRU
4 per rack
InfiniBand switch blades
2 per IRU
4 per IRU (16 per rack)
a. Compute blades support one or two stuffed sockets each.
This is a total of four or eight cores per blade.
007-4986-001
49
A: Technical Specifications and Pinouts
Physical and Power Specifications
Table A-2 shows the physical specifications of the Altix ICE 8000 system.
Table A-2
Altix ICE 8000 Physical Specifications
System Features (single rack)
Specification
Height
79.5 in. (201.9 cm)
Width
31.3 in. (79.5 cm)
Depth
45.8 in. (116.3 cm)
Weight (full) maximum
2,200 lbs. (1,000 kg) approximate
Shipping weight maximum
2,450 lbs. (1,113.6 kg) approximate
Voltage range
North America/International
Nominal
200-240 VAC /230 VAC
Tolerance range
180-264 VAC /180-254 VAC
Frequency
North America/International
Nominal
50/60 Hz /50 Hz
Tolerance range
47-63 Hz /47-63 Hz
Phase required
Single-phase or 3-phase
Power requirements (max)
31.63 kVA (31 kW)
Hold time
20 ms
Power cable
12 ft. (3.66 m) pluggable cords
Shipping dimensions
Height: 81.375 in. (206.7 cm)
Width: 48 in. (121.9 cm)
Depth: 54 in. (137.1 cm)
Access requirements
50
Front
48 in. (121.9 cm)
Rear
48 in. (121.9 cm)
Side
None
007-4986-001
Environmental Specifications
Environmental Specifications
Table A-3 lists the environmental specifications of the system.
Table A-3
Environmental Specifications
Feature
Specification
Temperature tolerance
(operating)
+5 °C (41 °F) to +35 °C (95 °F) (up to 1500 m / 5000 ft.)
+5 °C (41 °F) to +30 °C (86 °F) (1500 m to 3000 m /5000 ft. to 10,000 ft.)
Temperature tolerance
(non-operating)
-40 °C (-40 °F) to +60 °C (140 °F)
Relative humidity
10% to 80% operating (no condensation)
8% to 95% non-operating (no condensation)
Heat dissipation
007-4986-001
Altix ICE 8000 (rack)
Approximately 105.8 kBTU/hr maximum (based on 31 kW)
Cooling requirement
Ambient air or optional water cooling
Air flow: intake (front),
exhaust (rear)
Approximately 6,000 CFM (normal operation) 4,000 CFM typical
Maximum altitude
10,000 ft. (3,049 m) operating
40,000 ft. (12,195 m) non-operating
Acoustical noise level
Less than 65 dBa maximum
51
A: Technical Specifications and Pinouts
I/O Port Specifications
This section contains specifications and port pinout information for the base I/O ports of your
system, as follows:
52
•
“Ethernet Port” on page 53
•
“Serial Ports” on page 54
007-4986-001
I/O Port Specifications
Ethernet Port
The system auto-selects the Ethernet port speed and type (duplex vs. half-duplex) when the server
is booted, based on what it is connected to. Figure A-1 shows the Ethernet port.
Pin 4
Pin 3
Pin 5
Pin 6
Pin 7
Pin 2
Pin 1
Pin 8
Figure A-1
Ethernet Port
Table A-4 shows the cable pinout assignments for the Ethernet port operating in 10/100-Base-T
mode and also operating in 1000Base-T mode.
Table A-4
Ethernet Pinouts
Ethernet 10/100Base-T Pinouts
Gigabit Ethernet Pinouts
Pins
Assignment
Pins
Assignment
1
Transmit +
1
Transmit/Receive 0 +
2
Transmit –
2
Transmit/Receive 0 –
3
Receive +
3
Transmit/Receive 1 +
4
NU
4
Transmit/Receive 2 +
5
NU
5
Transmit/Receive 2 –
6
Receive –
6
Transmit/Receive 1 –
7
NU
7
Transmit/Receive 3 +
8
NU
8
Transmit/Receive 3 –
NU = Not used
007-4986-001
53
A: Technical Specifications and Pinouts
Serial Ports
The IRU’s chassis management control boards have 9-pin serial interface connectors. These ports
are capable of transferring data at rates as high as 230 kbps. Other features of the ports include the
following:
•
Programmable data, parity, and stop bits
•
Programmable baud rate and modem control
Figure A-2 shows an example serial port.
Pin 3
Transmit
Data (TXD)
Pin 2
Receive Data
(RXD)
Pin 1
Data Carrier
Detect (DCD)
(not used)
Pin 4
Data Terminal
Ready (DTR)
(not used)
Pin 5
Ground
Pin 6
Data Set
Ready (DSR)
(not used)
Pin 7
Request to
Send (RTS)
Figure A-2
54
Pin 9
Ringing Indicator (RI)
(not used)
Pin 8
Clear to Send
(CTS)
Serial Port Connector
007-4986-001
I/O Port Specifications
Table A-5 shows pinout assignments for the 9-pin male DB-9 connector.
Table A-5
007-4986-001
Serial Port Pinout
Pin
Assignment
Description
1
DCD
Data carrier detect
2
RXD
Receive data
3
TXD
Transmit data
4
DTR
Data terminal ready
5
GND
Signal ground
6
DSR
Data set ready
7
RTS
Request to send
8
CTS
Clear to send
9
RI
Ring indicator
55
Appendix B
B. Safety Information and Regulatory Specifications
This appendix provides safety information and regulatory specifications for your system in the
following sections:
•
“Safety Information” on page 57
•
“Regulatory Specifications” on page 59
Safety Information
Read and follow these instructions carefully:
1.
Follow all warnings and instructions marked on the product and noted in the documentation
included with this product.
2. Unplug this product before cleaning. Do not use liquid cleaners or aerosol cleaners. Use a
damp cloth for cleaning.
3. Do not use this product near water.
4. Do not place this product or components of this product on an unstable cart, stand, or table.
The product may fall, causing serious damage to the product.
5. Slots and openings in the system are provided for ventilation. To ensure reliable operation of
the product and to protect it from overheating, these openings must not be blocked or
covered. This product should never be placed near or over a radiator or heat register, or in a
built-in installation, unless proper ventilation is provided.
6. This product should be operated from the type of power indicated on the marking label. If
you are not sure of the type of power available, consult your dealer or local power company.
7. Do not allow anything to rest on the power cord. Do not locate this product where people
will walk on the cord.
8. Never push objects of any kind into this product through cabinet slots as they may touch
dangerous voltage points or short out parts that could result in a fire or electric shock. Never
spill liquid of any kind on the product.
007-4986-001
57
B: Safety Information and Regulatory Specifications
9. Do not attempt to service this product yourself except as noted in this guide. Opening or
removing covers of node and switch internal components may expose you to dangerous
voltage points or other risks. Refer all servicing to qualified service personnel.
10. Unplug this product from the wall outlet and refer servicing to qualified service personnel
under the following conditions:
•
When the power cord or plug is damaged or frayed.
•
If liquid has been spilled into the product.
•
If the product has been exposed to rain or water.
•
If the product does not operate normally when the operating instructions are followed.
Adjust only those controls that are covered by the operating instructions since improper
adjustment of other controls may result in damage and will often require extensive work
by a qualified technician to restore the product to normal condition.
•
If the product has been dropped or the cabinet has been damaged.
•
If the product exhibits a distinct change in performance, indicating a need for service.
11. If a lithium battery is a soldered part, only qualified SGI service personnel should replace
this lithium battery. For other types, replace it only with the same type or an equivalent type
recommended by the battery manufacturer, or the battery could explode. Discard used
batteries according to the manufacturer’s instructions.
12. Use only the proper type of power supply cord set (provided with the system) for this unit.
13. Do not attempt to move the system alone. Moving a rack requires at least two people.
14. Keep all system cables neatly organized in the cable management system. Loose cables are a
tripping hazard that cause injury or damage the system.
58
007-4986-001
Regulatory Specifications
Regulatory Specifications
The following topics are covered in this section:
•
“CMN Number” on page 59
•
“CE Notice and Manufacturer’s Declaration of Conformity” on page 59
•
“Electromagnetic Emissions” on page 60
•
“Shielded Cables” on page 62
•
“Electrostatic Discharge” on page 62
•
“Laser Compliance Statements” on page 63
•
“Lithium Battery Statements” on page 64
This SGI system conforms to several national and international specifications and European
Directives listed on the “Manufacturer’s Declaration of Conformity.” The CE mark insignia
displayed on each device is an indication of conformity to the European requirements.
!
Caution: This product has several governmental and third-party approvals, licenses, and permits.
Do not modify this product in any way that is not expressly approved by SGI. If you do, you may
lose these approvals and your governmental agency authority to operate this device.
CMN Number
The model number, or CMN number, for the system is on the system label, which is mounted
inside the rear door on the base of the rack.
CE Notice and Manufacturer’s Declaration of Conformity
The “CE” symbol indicates compliance of the device to directives of the European Community.
A “Declaration of Conformity” in accordance with the standards has been made and is available
from SGI upon request.
007-4986-001
59
B: Safety Information and Regulatory Specifications
Electromagnetic Emissions
This section provides the contents of electromagnetic emissions notices from various countries.
FCC Notice (USA Only)
This equipment complies with Part 15 of the FCC Rules. Operation is subject to the following two
conditions:
•
This device may not cause harmful interference.
•
This device must accept any interference received, including interference that may cause
undesired operation.
Note: This equipment has been tested and found to comply with the limits for a Class A digital
device, pursuant to Part 15 of the FCC Rules. These limits are designed to provide reasonable
protection against harmful interference when the equipment is operated in a commercial
environment. This equipment generates, uses, and can radiate radio frequency energy and, if not
installed and used in accordance with the instruction manual, may cause harmful interference to
radio communications. Operation of this equipment in a residential area is likely to cause harmful
interference, in which case you will be required to correct the interference at your own expense.
If this equipment does cause harmful interference to radio or television reception, which can be
determined by turning the equipment off and on, you are encouraged to try to correct the
interference by using one or more of the following methods:
•
Reorient or relocate the receiving antenna.
•
Increase the separation between the equipment and receiver.
•
Connect the equipment to an outlet on a circuit different from that to which the receiver is
connected.
Consult the dealer or an experienced radio/TV technician for help.
!
60
Caution: Changes or modifications to the equipment not expressly approved by the party
responsible for compliance could void your authority to operate the equipment.
007-4986-001
Regulatory Specifications
Industry Canada Notice (Canada Only)
This Class A digital apparatus meets all requirements of the Canadian Interference-Causing
Equipment Regulations.
Cet appareil numérique német pas de perturbations radioélectriques dépassant les normes
applicables aux appareils numériques de Classe A préscrites dans le Règlement sur les
interferences radioélectriques établi par le Ministère des Communications du Canada.
VCCI Notice (Japan Only)
Figure B-1
VCCI Notice (Japan Only)
Chinese Class A Regulatory Notice
Figure B-2
Chinese Class A Regulatory Notice
Korean Class A Regulatory Notice
Figure B-3
007-4986-001
Korean Class A Regulatory Notice
61
B: Safety Information and Regulatory Specifications
Shielded Cables
This SGI system is FCC-compliant under test conditions that include the use of shielded cables
between the system and its peripherals. Your system and any peripherals you purchase from SGI
have shielded cables. Shielded cables reduce the possibility of interference with radio, television,
and other devices. If you use any cables that are not from SGI, ensure that they are shielded.
Telephone cables do not need to be shielded.
Optional monitor cables supplied with your system use additional filtering molded into the cable
jacket to reduce radio frequency interference. Always use the cable supplied with your system. If
your monitor cable becomes damaged, obtain a replacement cable from SGI.
Electrostatic Discharge
SGI designs and tests its products to be immune to the effects of electrostatic discharge (ESD).
ESD is a source of electromagnetic interference and can cause problems ranging from data errors
and lockups to permanent component damage.
It is important that you keep all the covers and doors, including the plastics, in place while you are
operating the system. The shielded cables that came with the unit and its peripherals should be
installed correctly, with all thumbscrews fastened securely.
An ESD wrist strap may be included with some products, such as memory or PCI upgrades. The
wrist strap is used during the installation of these upgrades to prevent the flow of static electricity,
and it should protect your system from ESD damage.
62
007-4986-001
Regulatory Specifications
Laser Compliance Statements
The DVD-ROM drive in this computer is a Class 1 laser product. The DVD-ROM drive’s
classification label is located on the drive.
Warning: Avoid exposure to the invisible laser radiation beam when the device is open.
Warning: Attention: Radiation du faisceau laser invisible en cas d’ouverture. Evitter
toute exposition aux rayons.
Warning: Vorsicht: Unsichtbare Laserstrahlung, Wenn Abdeckung geöffnet, nicht dem
Strahl aussetzen.
Warning: Advertencia: Radiación láser invisible al ser abierto. Evite exponerse a los
rayos.
Warning: Advarsel: Laserstråling vedåbning se ikke ind i strålen
Warning: Varo! Lavattaessa Olet Alttina Lasersåteilylle
Warning: Varning: Laserstrålning når denna del år öppnad ålå tuijota såteeseenstirra ej
in i strålen.
Warning: Varning: Laserstrålning nar denna del år öppnadstirra ej in i strålen.
Warning: Advarsel: Laserstråling nar deksel åpnesstirr ikke inn i strålen.
007-4986-001
63
B: Safety Information and Regulatory Specifications
Lithium Battery Statements
Warning: If a lithium battery is a soldered part, only qualified SGI service personnel
should replace this lithium battery. For other types, replace the battery only with the same
type or an equivalent type recommended by the battery manufacturer, or the battery could
explode. Discard used batteries according to the manufacturer’s instructions.
Warning: Advarsel!: Lithiumbatteri - Eksplosionsfare ved fejlagtig håndtering.
Udskiftning må kun ske med batteri af samme fabrikat og type. Léver det brugte batteri
tilbage til leverandøren.
Warning: Advarsel: Eksplosjonsfare ved feilaktig skifte av batteri. Benytt samme
batteritype eller en tilsvarende type anbefalt av apparatfabrikanten. Brukte batterier
kasseres i henhold til fabrikantens instruksjoner.
Warning: Varning: Explosionsfara vid felaktigt batteribyte. Anvãnd samma batterityp
eller en ekvivalent typ som rekommenderas av apparattillverkaren. Kassera anvãnt batteri
enligt fabrikantens instruktion.
Warning: Varoitus: Päristo voi räjähtää, jos se on virheellisesti asennettu. Vaihda paristo
ainoastaan laitevalmistajan suosittelemaan tyyppiin. Hävitä käytetty paristo valmistajan
ohjeiden mukaisesti.
Warning: Vorsicht!: Explosionsgefahr bei unsachgemäßen Austausch der Batterie.
Ersatz nur durch denselben oder einen vom Hersteller empfohlenem ähnlichen Typ.
Entsorgung gebrauchter Batterien nach Angaben des Herstellers.
64
007-4986-001
Index
Numbers
D
33361
Heading1
Led Status Indicators, 47
documentation
available via the World Wide Web, xv
conventions, xvii
A
E
Altix ICE servers
monitoring locations, 11
Altix ICE single-rack server
illustration, 20
environmental specifications, 51
F
B
front panel display
L1 controller, 17
battery statements, 64
bay slot numbering, 32
block diagram
system, 24
L
laser compliance statements, 63
LEDs on the front of the IRUs, 47
lithium battery warning statements, 2, 64
C
chassis management controller
front panel display, 17
CMC controller
functions, 17
CMN number, 59
Compute/Memory Blade LEDs, 48
customer service, xvii
007-4986-001
M
Message Passing Interface, 19
monitoring
server, 11
65
Index
N
S
numbering
bays in a rack, 32
racks, 33
server
monitoring locations, 11
system architecture, 22
system block diagram, 24
system components
Altix ICE IRU front, 32
list of, 31
system features, 25
system overview, 19
O
optional water chilled rack cooling, 20
P
physical specifications
Altix ICE 8000 System Physical Specifications, 50
pinouts
Ethernet connector, 53
serial connector, 54
Power Supply LEDs, 47
powering on
preparation, 5
product support, xvii
T
tall rack
features, 36
technical specifications
system level, 49
technical support, xvii
three-phase PDU, 21
troubleshooting
problems and recommended actions, 46
Troubleshooting Chart, 46
R
RAS features, 30
66
007-4986-001
Was this manual useful for you? yes no
Thank you for your participation!

* Your assessment is very important for improving the work of artificial intelligence, which forms the content of this project

Download PDF

advertising