advertisement
SGI
®
Rackable
™
C1104-GP1
System User Guide
007-6364-001
COPYRIGHT
© 2014 Silicon Graphics International Corp. All rights reserved; provided portions may be copyright in third parties, as indicated elsewhere herein. No permission is granted to copy, distribute, or create derivative works from the contents of this electronic documentation in any manner, in whole or in part, without the prior written permission of SGI.
LIMITED RIGHTS LEGEND
The software described in this document is “commercial computer software” provided with restricted rights (except as to included open/free source) as specified in the FAR 52.227-19 and/or the DFAR 227.7202, or successive sections. Use beyond license provisions is a violation of worldwide intellectual property laws, treaties and conventions. This document is provided with limited rights as defined in 52.227-14.
The electronic (software) version of this document was developed at private expense; if acquired under an agreement with the USA government or any contractor thereto, it is acquired as “commercial computer software” subject to the provisions of its applicable license agreement, as specified in (a) 48 CFR
12.212 of the FAR; or, if acquired for Department of Defense units, (b) 48 CFR 227-7202 of the DoD FAR Supplement; or sections succeeding thereto.
Contractor/manufacturer is SGI, 46600 Landing Parkway, Fremont, CA 94538.
TRADEMARKS AND ATTRIBUTIONS
Silicon Graphics, SGI, the SGI logo, Rackable, and Supportfolio are trademarks or registered trademarks of Silicon Graphics International Corp. in the United
States and/or other countries worldwide.
Intel, Intel QuickPath Interconnect (QPI) and Xeon are trademarks or registered trademarks of Intel Corporation or its subsidiaries in the United States and other countries.
Fusion-MPT, Integrated RAID, MegaRAID, and LSI Logic are trademarks or registered trademarks of LSI Logic Corporation.
HyperTransport is a licensed trademark of the HyperTransport Technology Consortium.
InfiniBand is a registered trademark of the InfiniBand Trade Association.
Internet Explorer and MS-DOS are registered trademarks of Microsoft Corporation.
Java and Java Virtual Machine are trademarks or registered trademarks of Sun Microsystems, Inc.
Linux is a registered trademark of Linus Torvalds, used with permission by SGI.
Novell and Novell Netware are registered trademarks of Novell Inc.
PCIe and PCI-X are registered trademarks of PCI SIG.
Phoenix and PhoenixBIOS are registered trademarks of Phoenix Technologies Ltd.
Red Hat and all Red Hat-based trademarks are trademarks or registered trademarks of Red Hat, Inc. in the United States and other countries.
SUSE LINUX and the SUSE logo are registered trademarks of Novell, Inc.
UNIX is a registered trademark in the United States and other countries, licensed exclusively through X/Open Company, Ltd.
All other trademarks mentioned herein are the property of their respective owners.
Record of Revision
Version
001
Description
September, 2014
First release
007-6364-001 iii
Contents
Record of Revision . . . . . . . . . . . . . . . . . . . . . . . iii
About This Guide . . . . . . . . . . . . . . . . . . . . . . . ix
Audience. . . . . . . . . . . . . . . . . . . . . . . . . . ix
Chapter Descriptions . . . . . . . . . . . . . . . . . . . . . . ix
Related Publications . . . . . . . . . . . . . . . . . . . . . . . x
Conventions . . . . . . . . . . . . . . . . . . . . . . . . . xi
Product Support . . . . . . . . . . . . . . . . . . . . . . . . xi
Reader Comments . . . . . . . . . . . . . . . . . . . . . . . xii
Introduction . . . . . . . . . . . . . . . . . . . . . . . . 1
Server Board Features . . . . . . . . . . . . . . . . . . . . . . 3
Processors . . . . . . . . . . . . . . . . . . . . . . . . 3
QPI Interconnect . . . . . . . . . . . . . . . . . . . . . . 3
Memory . . . . . . . . . . . . . . . . . . . . . . . . . 3
Serial ATA and Optional SAS . . . . . . . . . . . . . . . . . . . 4
PCI Express Expansion Slots . . . . . . . . . . . . . . . . . . . 4
Onboard Controllers/Ports . . . . . . . . . . . . . . . . . . . . 4
Onboard Graphics Controller . . . . . . . . . . . . . . . . . . . 4
IPMI . . . . . . . . . . . . . . . . . . . . . . . . . 4
Other Features . . . . . . . . . . . . . . . . . . . . . . . 5
Server Chassis Features . . . . . . . . . . . . . . . . . . . . . . 5
System Power . . . . . . . . . . . . . . . . . . . . . . . 5
Serial ATA Subsystems . . . . . . . . . . . . . . . . . . . . 5
Front Control Panel . . . . . . . . . . . . . . . . . . . . . . 5
Serverboard and GPU Subsystem . . . . . . . . . . . . . . . . . . 6
GPU Features . . . . . . . . . . . . . . . . . . . . . . 6
Cooling System . . . . . . . . . . . . . . . . . . . . . . . 7
007-6364-001 v
Contents
vi
Server Installation . . . . . . . . . . . . . . . . . . . . . . . 9
Unpack the System . . . . . . . . . . . . . . . . . . . . . . . 9
Prepare for Setup . . . . . . . . . . . . . . . . . . . . . . 9
Choose a Setup Location . . . . . . . . . . . . . . . . . . . . 9
System Warnings and Precautions . . . . . . . . . . . . . . . . . . . 10
Server Precautions . . . . . . . . . . . . . . . . . . . . . . 11
Rack Mounting Considerations . . . . . . . . . . . . . . . . . . . . 11
Ambient Operating Temperature . . . . . . . . . . . . . . . . . . 11
Reduced Airflow . . . . . . . . . . . . . . . . . . . . . . 12
Mechanical Loading . . . . . . . . . . . . . . . . . . . . . . 12
Circuit Overloading . . . . . . . . . . . . . . . . . . . . . . 12
Reliable Ground . . . . . . . . . . . . . . . . . . . . . . . 12
Install the System into a Rack . . . . . . . . . . . . . . . . . . . . 12
Separate the Sections of the Rack Rails . . . . . . . . . . . . . . . . . 12
Inner Rail Extensions . . . . . . . . . . . . . . . . . . . . . 14
Installing the Inner Rail Extensions . . . . . . . . . . . . . . . . 14
Assembling the Outer Rails . . . . . . . . . . . . . . . . . . . . 15
Assembling the Outer Rails. . . . . . . . . . . . . . . . . . . 15
Attaching the Outer Rack Rails . . . . . . . . . . . . . . . . . . . 16
Using the Rail Locking Tabs . . . . . . . . . . . . . . . . . . 17
Install the Server in a Rack . . . . . . . . . . . . . . . . . . . . 18
Supply Power to the System . . . . . . . . . . . . . . . . . . 19
System Interface. . . . . . . . . . . . . . . . . . . . . . . . 21
Overview . . . . . . . . . . . . . . . . . . . . . . . . . . 21
Control Panel Buttons . . . . . . . . . . . . . . . . . . . . . . 21
Control Panel LEDs . . . . . . . . . . . . . . . . . . . . . . . 22
Power Fail LED . . . . . . . . . . . . . . . . . . . . . . . 23
Overheat/Fan Fail/UID LED . . . . . . . . . . . . . . . . . . . 23
NIC1 . . . . . . . . . . . . . . . . . . . . . . . . . . 23
NIC2 . . . . . . . . . . . . . . . . . . . . . . . . . . 24
HDD . . . . . . . . . . . . . . . . . . . . . . . . . . 24
Power . . . . . . . . . . . . . . . . . . . . . . . . . 24
Drive Carrier LEDs . . . . . . . . . . . . . . . . . . . . . . . 25
007-6364-001
Contents
System Safety . . . . . . . . . . . . . . . . . . . . . . . . 27
Electrical Safety Precautions . . . . . . . . . . . . . . . . . . . . 27
Serverboard Battery. . . . . . . . . . . . . . . . . . . . . . 28
ESD Precautions . . . . . . . . . . . . . . . . . . . . . . 28
Mainboard Replaceable Soldered-in Fuses . . . . . . . . . . . . . . . . 29
General Safety Precautions . . . . . . . . . . . . . . . . . . . . . 29
System and Serverboard Information . . . . . . . . . . . . . . . . . 31
Handling Circuit Boards and Drives . . . . . . . . . . . . . . . . . . 31
ESD Precautions . . . . . . . . . . . . . . . . . . . . . . 32
Unpacking . . . . . . . . . . . . . . . . . . . . . . . . 32
System Rear I/O Ports . . . . . . . . . . . . . . . . . . . . . . 33
Serverboard Details . . . . . . . . . . . . . . . . . . . . . . 33
CPUs . . . . . . . . . . . . . . . . . . . . . . . . 34
Memory . . . . . . . . . . . . . . . . . . . . . . . 34
GPUs . . . . . . . . . . . . . . . . . . . . . . . . 34
PCIe Expansion Slots . . . . . . . . . . . . . . . . . . . . 34
System Health Monitoring . . . . . . . . . . . . . . . . . . . 34
ACPI Features . . . . . . . . . . . . . . . . . . . . . . 35
Onboard I/O . . . . . . . . . . . . . . . . . . . . . . 35
Serverboard Dimensions . . . . . . . . . . . . . . . . . . . 35
Hard Disk Drives (C1104-GP1 Chassis) . . . . . . . . . . . . . . . . . 37
Drive Configurations . . . . . . . . . . . . . . . . . . . . . 37
PCIe Expansion Cards . . . . . . . . . . . . . . . . . . . . . . 38
Power Supply Functional Rating . . . . . . . . . . . . . . . . . . 38
Basic Troubleshooting and Chassis Service . . . . . . . . . . . . . . . . 39
Basic Troubleshooting Procedures . . . . . . . . . . . . . . . . . . . 39
If the System Does Not Power Up . . . . . . . . . . . . . . . . . . 39
System Powers Up But Will Not Boot . . . . . . . . . . . . . . . . . 40
No Video After System Power Up . . . . . . . . . . . . . . . . . . 40
Memory Errors . . . . . . . . . . . . . . . . . . . . . . . 40
Chassis Service Information. . . . . . . . . . . . . . . . . . . . . 41
Static-Sensitive Devices . . . . . . . . . . . . . . . . . . . . 41
007-6364-001 vii
Contents
Precautions . . . . . . . . . . . . . . . . . . . . . . . . 41
Unpacking . . . . . . . . . . . . . . . . . . . . . . . . 42
Control Panel . . . . . . . . . . . . . . . . . . . . . . . . . 42
Drive Bay Installation/Removal . . . . . . . . . . . . . . . . . . . . 42
Accessing the Drive Bays . . . . . . . . . . . . . . . . . . . . 42
Removing Hard Drives or Carriers from the Chassis . . . . . . . . . . . . . 43
The Hard Drive Backplane . . . . . . . . . . . . . . . . . . . 43
Disk Drive Installation . . . . . . . . . . . . . . . . . . . . . 43
Hard Drive Carrier Assembly Usage . . . . . . . . . . . . . . . . . 44
Power Supply. . . . . . . . . . . . . . . . . . . . . . . . . 47
Power Supply Failure . . . . . . . . . . . . . . . . . . . . . 47
Removing/Replacing a Power Supply . . . . . . . . . . . . . . . . . 48
Removing the Power Supply . . . . . . . . . . . . . . . . . . 48
Installing a New Power Supply . . . . . . . . . . . . . . . . . . 48
Accessing the Inside of the Chassis . . . . . . . . . . . . . . . . . . . 50
System Fans . . . . . . . . . . . . . . . . . . . . . . . . . 50
System Fan Failure . . . . . . . . . . . . . . . . . . . . . . 50
Replacing System Fans . . . . . . . . . . . . . . . . . . . . . 51
Remove/Replace a Fan . . . . . . . . . . . . . . . . . . . . 51
Install/Replace a PCIe Expansion Card . . . . . . . . . . . . . . . . . . 55
Install/Replace a Low-profile or Full-height PCIe Card . . . . . . . . . . . . 55
BIOS Error Codes . . . . . . . . . . . . . . . . . . . . . . . 57
System Operating and Regulatory Overview . . . . . . . . . . . . . . . . 59
Environmental Specifications . . . . . . . . . . . . . . . . . . . . 59
System Input Requirements . . . . . . . . . . . . . . . . . . . . . 60
Power Supply. . . . . . . . . . . . . . . . . . . . . . . . . 60
Regulatory Compliance . . . . . . . . . . . . . . . . . . . . . . 61
viii 007-6364-001
About This Guide
This guide provides an overview of the installation, architecture, general operation, and descriptions of the major components in the SGI ® Rackable ™ C1104-GP1 server. It also provides basic troubleshooting and maintenance information, BIOS error code information, and important safety and regulatory specifications.
Audience
This guide is written for users of SGI Rackable C1104-GP1 server systems. It is written with the assumption that the reader has a good working knowledge of computers and computer systems.
This guide may be useful to installers and system administrators looking for overview information on the server.
Chapter Descriptions
The following topics are covered in this guide:
• Chapter 1, “Introduction”
Provides an overview of the server’s components.
• Chapter 2, “Server Installation”
Provides a quick setup checklist to get the server operational.
• Chapter 3, “System Interface”
Describes several LEDs on the control panel as well as others on the SATA drive carriers that keep you constantly informed of the overall status of the system as well as the activity and health of specific components.
• Chapter 4, “System Safety”
Provides general system safety information.
• Chapter 5, “System Severboard Information”
007-6364-001 ix
Provides best practice procedures to work with a node board in the C1104-GP1 chassis, install memory DIMMs, PCIe expansion cards and hard disk drives.
• Chapter 6, “Basic Troubleshooting and Chassis Service”
Describes some basic steps required to troubleshoot your system. Additional sections in this chapter are intended to guide you through basic component remove and replace procedures.
• Appendix A, “BIOS Error Codes,”
Provides a brief listing of BIOS error code information.
• Appendix B, “System Specifications,”
Describes system component, environmental, and compliance specifications.
Related Publications
The following SGI documents may be relevant to the use of your server:
• MegaRAID SAS Software User’s Guide, publication number, 860-0488-xxx
• SGI Foundation Software release notes
• SGI Performance Suite release notes
• SGI InfiniteStorage series documentation
• Man pages
You can obtain SGI documentation, release notes, or man pages in the following ways:
• Refer to the SGI Technical Publications Library at http://docs.sgi.com. Various formats are available. This library contains the most recent books and man pages.
• Refer to the SGI Supportfolio™ webpage for release notes and other documents whose
access require a support contract. See “Product Support” on page xi
.
x 007-6364-001
Conventions
The following conventions are used throughout this document:
Convention Meaning
Command
This fixed-space font denotes literal items such as commands, files, routines, path names, signals, messages, and programming language structures.
variable
user input
[ ]
...
man page
(x)
GUI element
The italic typeface denotes variable entries and words or concepts being defined. Italic typeface is also used for book titles.
This bold fixed-space font denotes literal items that the user enters in interactive sessions. Output is shown in nonbold, fixed-space font.
Brackets enclose optional portions of a command or directive line.
Ellipses indicate that a preceding element can be repeated.
Man page section identifiers appear in parentheses after man page names.
This font denotes the names of graphical user interface (GUI) elements such as windows, screens, dialog boxes, menus, toolbars, icons, buttons, boxes, fields, and lists.
Product Support
SGI provides a comprehensive product support and maintenance program for its products. SGI also offers services to implement and integrate Linux applications in your environment.
• Refer to http://www.sgi.com/support/
• If you are in North America, contact the Technical Assistance Center at
+1 800 800 4SGI or contact your authorized service provider.
• If you are outside North America, contact the SGI subsidiary or authorized distributor in your country.
:
007-6364-001 xi
Reader Comments
If you have comments about the technical accuracy, content, or organization of this document, contact SGI. Be sure to include the title and document number of the manual with your comments.
(Online, the document number is located in the front matter of the manual. In printed manuals, the document number is located at the bottom of each page.)
You can contact SGI in any of the following ways:
• Send e-mail to the following address: [email protected]
• Contact your customer service representative and ask that an incident be filed in the SGI incident tracking system: http://www.sgi.com/support/supportcenters.html
SGI values your comments and will respond to them promptly.
xii 007-6364-001
Chapter 1
1.
Introduction
007-6364-001
Important: SGI Rackable server systems may sometimes require driver versions that are not included in the original operating system release. When required, SGI provides these drivers on an SGI Driver CD, which may ship with the system, or on the system disk (pre-installed in the factory). For more information on this topic check with your sales or service representative.
The Rackable C1104-GP1 server is a 1U rackmount system (see
.
In addition to the serverboard and chassis, various hardware components may be included with the system, as listed:
• Ten 4-cm chassis fans
• One internal air shroud
• Two passive 1U CPU heatsinks
• Riser cards as follows:
– One riser for a single PCIe x16 card, (left-front side internal GPU card)
– One riser for a single PCIe x16 card, (left-back side internal GPU card)
– One riser for a single PCIe x16 card, (right-front side internal GPU card)
– One riser for one low-profile PCIe 3.0 x8 card (external-facing rear card)
• Three power cables for GPU cards
• SATA accessories:
– One SAS/SATA backplane
– Four hot-swap disk drive carriers (RAID must be enabled for hot swap)
• Two power supplies
• One rackmount kit
1
1: Introduction
Tip: The Rackable C1104-GP1 server does not use an internal CD/DVD drive. Check with your
SGI sales or service representative for information on optional external CD/DVD drive units.
System
LEDs
Four disk drive bays
System reset
Main power
IPMI
LAN
PCIe low-profile expansion slot
Full-height PCIe slot
Figure 1-1
USB ports
LAN 1
LAN 2
VGA port
Rackable C1104-GP1 Server Front and Rear Views
Note: At time of publication, the rear x16 full-height PCIe slot (see
specific internal GPU option cards. Check with your SGI sales or service representative for the latest information on optional cards available for this slot.
2 007-6364-001
Server Board Features
Server Board Features
At the heart of the system is a dual-processor serverboard based on the Intel C610 platform controller hub (PCH) chipset and designed to provide maximum performance. The main features of the serverboard are described in the following subsections.
Processors
The serverboard supports two multi-core Intel ® Xeon™ E5-2600(v3) Series processors. Each processor sits in an LGA 2011 socket and is interconnected via double Intel QuickPath
Interconnect (QPI) link support; see the next subsection for more information on the QPI interconnects. Four DDR4 memory channels are available per CPU socket with two DIMM slots per channel. A direct media interface connects the node’s PCH ASIC to processor 1; while 40-lane
Gen-3 PCIe interconnect lines link both processors directly to the motherboard LAN ports.
QPI Interconnect
Double QPI link pairs connect the two processors together on the motherboard, providing processor-controlled transfer bandwidths of up to 51.2 GB/second between the sockets.
Each QPI comprises two 20-lane point-to-point data links, one in each direction (full duplex), with a separate clock pair in each direction, for a total of 42 signals. Each signal is a differential pair, so the total number of pins is 84. The 20 data lanes are divided onto four “quadrants” of 5 lanes each. The basic unit of transfer is the 80-bit “flit”, which is transferred in two clock cycles (four
20 bit transfers, two per clock.) The 80-bit “flit” has 8 bits for error detection, 8 bits for “link-layer header” and 64 bits for “data”. QPI bandwidths are advertised by computing the transfer of 64 bits
(8 bytes) of data every two clock cycles in each direction
Memory
The serverboard has sixteen DIMM slots (eight per processor) that support DDR4
2133/1866/1600/1333 MHz RDIMMs. Up to two DIMMs per channel are supported.
Important: Use of two DIMMs per channel will limit the DIMMs to run at 1866 MHz maximum.
Note also that memory speed support is dependent on the type of CPUs used on the mother board.
007-6364-001 3
1: Introduction
Serial ATA and Optional SAS
The Intel PCH C610 is integrated into the system serverboard to provide an internal four-port
SATA disk subsystem. The four drive ports (0 through 3) are SATA 3.0 ports. The hot-swappable
SATA drives are connected to a backplane that provides power, bus termination and configuration settings. Optional RAID 0, 1 and 10 are supported. Note that your operating system must have
RAID support enabled to accommodate hot swapping of disk drives.
SAS RAID drive configurations require use of optional hardware ordered when the system is purchased. Check with your sales or service representative to obtain information on set-up procedures if your system did not come pre-configured with SATA or SAS RAID.
PCI Express Expansion Slots
The dual processor serverboard has three PCIe 3.0 x16 slots to support internal double-width GPU cards. An additional slot at the rear of the server supports one PCIe 3.0 x8 low-profile card.
See the section “PCIe Expansion Cards” in Chapter 5
for more information on these topics.
Onboard Controllers/Ports
The color-coded I/O ports include (an internal COM header located on the serverboard), VGA
(monitor) port, two external USB 3.0 ports and two RJ45 LAN Ethernet ports. A dedicated external IPMI LAN port is also included.
Onboard Graphics Controller
The dual-processor serverboard features an integrated Aspeed AST2400 video controller providing a DDR3 2D video graphics interface through the system VGA connector. The AST video controller in the 1U server features PCIe 1x support, advanced BMC features, low power consumption, high reliability and 1920 x 1200 60Hz display capability.
IPMI
IPMI (Intelligent Platform Management Interface) is a hardware-level interface specification that provides remote access, monitoring and administration for your SGI Rackable C1104-GP1 server
4 007-6364-001
Server Chassis Features platforms. IPMI allows server administrators to view a server’s hardware status remotely, receive an alarm automatically if a failure occurs, and power cycle a system that is non-responsive.
Other Features
Other on-board features that promote system health include on-board voltage monitors, a chassis intrusion header, auto-switching voltage regulators, chassis and CPU overheat sensors, virus protection and BIOS rescue.
Server Chassis Features
The following subsections provide a general outline of the main features of the SGI Rackable
C1104-GP1 server chassis.
System Power
The Rackable C1104-GP1 1U server chassis features a redundant power supply composed of two separate power modules. This power redundancy feature allows you to replace a failed power supply without shutting down the system. Note that each power supply provides up to 1600 Watts of power to the system.
Serial ATA Subsystems
The server chassis supports up to four 2.5-in SATA drives. Chassis drives are SATA 3.0
6-Gb/second slots. RAID 0, 1 and 10 drives are hot-swappable units and are connected to a backplane that provides power and control. Note that the operating system you have installed must support RAID to enable the hot-swap capability of RAID drives. Certain RAID levels may require use of optional hardware or software to support RAIDed hard disk drives in the server.
Front Control Panel
The control panel on the C1104-GP1 server provides you with system monitoring and control.
LEDs that indicate system power, HDD activity, network activity, system overheat and a system overheat/fan-fail/ UID LED. A main power button and a system reset button are also included.
007-6364-001 5
1: Introduction
Serverboard and GPU Subsystem
The C1104-GP1 server chassis is an ATX form factor chassis designed to be used in a 1U rackmount configuration. The serverboard’s I/O backplane supports up to three standard size
(double-width) GPUs to enable high-quality GPU computing solutions. A 15-pin VGA port, two
USB 3.0 ports, two Gigabit LAN ports and a dedicated RJ-45 IPMI LAN port are also supported.
The GPUs process complex image calculations and then route the data out through the VGA port on the serverboard. The GPUs, which come with a passive heatsink attached, have been tested for use with this system.
Important: Check with your SGI sales or service representative prior to using any GPU not sourced from the SGI factory.
Any combination of these cards (up to a total of three) may have come bundled with the system.
Power for the GPU cards is provided via a GPU power cable from each of the GPUs to JPW3, 4,
6 and 7 on the serverboard (one cable for each card).
shows a general block diagram of the C1104-GP1 server’s processor and I/O chipset.
GPU Features
Each of the GPUs will feature some or all of the following:
• Hundreds of GPU cores in each card that can deliver up to 1.4 Teraflops of double-precision and up to 4.2 Teraflops of single-precision calculations.
• ECC protected internal register files, L1/L2 caches, shared memory, and external DRAM.
• Up to 12 GB of GDDR5 memory per GPU enhances performance and reduces data transfers by keeping larger data sets in local memory attached directly to the GPU.
• Integrates the GPU subsystem with the C1104-GP1 server’s monitoring and management capabilities such as IPMI.
• Onboard L1 and L2 caches that accelerate algorithms and sparse-matrix multiplication.
• Provides faster context switching, concurrent kernel execution and improved thread block scheduling.
• Enhances overall system performance by transferring data over the PCIe bus while the computing cores are processing other data.
• Provides a flexible programming environment with broad support for various programming languages and APIs.
6 007-6364-001
Server Chassis Features
Cooling System
The 1U server chassis has a cooling design that includes ten internal 4-cm counter-rotating Pulse
Width Modulated (PWM) system cooling fans located in the chassis. An air shroud channels the airflow from the system fans to efficiently cool the processor and GPU areas of the system. Each power supply module also includes an internal cooling fan. All chassis and power supply fans operate continuously.
007-6364-001 7
1: Introduction
8
#1-3
#1-2
#1-1
#1-5
#1-4
#1-7
#1-6
#1-8
VCCP0 12v
VR12.5
5 PHASE
145W
VCCP1 12v
VR12.5
5 PHASE
145W
Processor 1
DDR4
#3
#2
P0
#1
DMI2
P1
QPI
9.6G
P1
QPI
9.6G
Processor 2
P0
DDR4
#1 #2 #3 DMI2
PCI-E X16 G3
PCI-E X16 G3 (LAN REVERSE)
PVCCIO
(1.05/0.95) from 3.3v
PCI-EX16 G3
PCI-EX16 G3(LAN REVERSE)
PCI-EX8 G3(w/ Re-driver)
P5V_AUX
PX2V5_I3V3
PX1V2_I1V8
PX0V8_I1V0
PX0V67_I1V0
LAN
I350/X540
MAX:12.5W
STBY:2.5W
Sagevill: 9W
PCI-E X8
DMI2
4GB/s
P3V3_PCH
P1V5_PCH 3.3v
P1V05_PCH 5v
P1V05_STBY 3.3v STBY
#6/7/8
1.05 PCH
1.05 ASW
1.5 PCH
PVCCIO 1.0/0.95
#3 PCH
6.0 Gb/S
#2-2
#2-1
#2-5
#2-4
#2-3
#2-8
#2-7
#2-6
#2
RJ45
DDR3
BMC Boot Flash
LAN3
RTL8211E-VB-CG
SPI
RGRMII
P3V3_STBY
P1V5_AUX_BMC
P1V2_AUX_BMC
PCI-E X1 G2 3.3STBY:0.5A
BMC
AST2400
USB 2.0
<=1.758W (average)
2.3W (Peak)
SPI
BIOS
SPI
LPC
VGA CONN
COM
Header
#5
5V:1.2A
3.3V:0.1A
3.3 STBY:0.2A
#12 USB2.0
Idle:0.45W
TDP:6.5W (WORKSTATION)
5W (SERVER)
USB & SATA useage different
TPM HEADER
Debug Card
USB 3.0
SPI
BIOS
HEADER
SYSTEM POWER
Temp Sensor
EMC1402-1 *2 at diff SMBUS
FRONT PANEL
FAN SPEED
CTRL
Figure 1-2
Processor, Memory and I/O Chipset System Block Diagram
007-6364-001
Chapter 2
2.
Server Installation
This chapter provides a quick setup checklist to get the SGI Rackable C1104-GP1 operational. If your system came already mounted in a rack, you can skip the rack installation procedures.
Unpack the System
Inspect the shipping container that the C1104-GP1 was shipped in and note if it was damaged in any way. If the server shows damage, file a damage claim with the carrier who delivered it.
Decide on a suitable location for the rack that supports the weight, power requirements, and environmental requirements of the C1104-GP1 server. It should be situated in a clean, dust-free environment that is well ventilated. Avoid areas where heat, electrical noise, and electromagnetic
fields are generated. Place the server rack near a grounded power outlet. Refer also to “System
Warnings and Precautions” on page 10 .
Prepare for Setup
The shipping container should include two sets of rail assemblies, two rail mounting brackets and the mounting screws that you will use to install the system into a rack. Note that the inner rails should already be attached to the server. Read this section in its entirety before you begin the installation procedure.
Choose a Setup Location
Leave enough clearance in front of the rack to enable you to open the front door completely (~25 inches) and approximately 30 inches of clearance in the back of the rack to allow for sufficient airflow and ease in servicing. This clearance may vary depending on the type of rack and installation site chosen. This product is for installation only in an (IEC 60950) Restricted Access
Location - dedicated equipment rooms, service closets and labs, etc. See also, “Regulatory
.
007-6364-001 9
2: Server Installation
System Warnings and Precautions
!
Warning:
The SGI Rackable C1104-GP1 server weighs up to 37 lbs (16.8 kg). Always use proper lifting techniques when you move the server. Always get the assistance of another qualified person when you install the sever in a location above your shoulders. Failure to do so may result in serious personal injury or damage to the equipment.
!
Warning:
Extend the leveling jacks on the bottom of the rack to the floor with the full weight of the rack resting on them. Failure to do so can result in serious injury or death.
!
Warning:
Attach stabilizers to the rack in single rack installations. Failure to do so can result in serious injury or death. Couple racks together in multiple rack installations.
Failure to do so can result in serious injury or death.
!
Warning:
Be sure the rack is stable before extending a component from the rack. Failure to do so can result in serious injury or death.
!
Warning:
Extend only one rack component at a time. Extending two or more components simultaneously may cause the rack to tip over and result in serious injury or death.
Figure 2-1
Slide/Rail Equipment Usage Caution
10 007-6364-001
Rack Mounting Considerations
Server Precautions
• Review the electrical and general safety precautions.
• Determine the placement of each component in the rack before you install the rails.
• Install the heaviest server components in the bottom of the rack first, and then work up.
• Add a regulating uninterruptible power supply (UPS) to protect the server from power surges and voltage spikes and to keep your system operating in case of a power failure.
• Allow the hot-pluggable disk drives and power supply modules to cool before touching.
• Always keep the rack’s front door and all panels and components on the servers closed when not servicing to maintain proper cooling.
• The server is not considered suitable for visual display work place devices under the German government ordinance for work with visual display units.
Rack Mounting Considerations
Use the guidelines provided in the following subsections to properly install the server in a rack.
Ambient Operating Temperature
If installed in a closed or multi-unit rack assembly, the ambient operating temperature of the rack environment may be greater than the ambient temperature of the room. Therefore, consideration should be given to installing the equipment in an environment compatible with the manufacturer’s maximum rated ambient temperature (
30º C or 86º F)
.
Important: Certain system configurations using three NVIDIA GPUs and higher-wattage processors may require the maximum rated ambient temperature of the operational environment be lower than 30
º C (86º F).
Altitude of operation can also affect required ambient air temperatures,
. Check with your SGI sales or service representative for additional information on this topic.
007-6364-001 11
2: Server Installation
Reduced Airflow
Equipment should be mounted into a rack so that the amount of airflow required for safe operation is not compromised.
Mechanical Loading
Equipment should be mounted into a rack so that a hazardous condition does not arise due to uneven mechanical loading. Racks should generally be filled with equipment from the bottom up.
Circuit Overloading
Consideration should be given to the connection of the equipment to the power supply circuitry and the effect that any possible overloading of circuits might have on over-current protection and power supply wiring. Appropriate consideration of equipment nameplate ratings should be used when addressing this concern.
Reliable Ground
A reliable ground must be maintained at all times. To ensure this, the rack itself should be grounded. Particular attention should be given to power supply connections other than the direct connections to the branch circuit (for example, the use of power strips, and so on).
Install the System into a Rack
This section provides information on installing the C1104-GP1 into a rack. If the system has already been mounted into a rack, refer to
“Supply Power to the System” on page 19 . There are a
variety of rack units on the market, which may mean the assembly procedure will differ slightly.
You should also refer to the installation instructions that came with the rack unit you are using.
Note that this system’s rail kit is designed to fit a rack between 26-in and 33.5-in deep.
Separate the Sections of the Rack Rails
The chassis package includes two rail assemblies in the rack mounting kit.
12 007-6364-001
Install the System into a Rack
Each assembly consists of two sections: an inner fixed chassis rail that secures directly to the server chassis and an outer fixed rack rail that secures directly to the rack itself. Note that the inner rail may be pre-installed on the server by the SGI factory making separation steps unnecessary.
To separate the inner and outer rails, perform the following steps:
1.
Locate the rail assembly in the chassis packaging as shown in Figure 2-2 .
2.
Extend the rail assembly by pulling it outward.
3.
Press the quick-release tab
4.
Separate the inner rail from the outer rail assembly.
007-6364-001
Figure 2-2
Separating the System Rack Rail Components
13
2: Server Installation
Inner Rail Extensions
The chassis includes a set of inner rack rails in two sections:
• inner rails
• inner rail extensions
The inner rails are preattached to the server chassis and do not interfere with normal use of the system if you decide not to install it into a server rack. Attaching the inner rail extensions to the inner rails stabilizes the chassis within the rack is described in the following subsection.
Installing the Inner Rail Extensions
1.
Place the inner rail extensions over the preattached inner rails which are attached to the side of the chassis. Align the hooks of the inner rail with the rail extension holes. Make sure the extension faces “outward” just like the inner rail.
2.
Slide the extension toward the front of the chassis.
3.
Secure the chassis with screws as illustrated in
.
4.
Repeat steps 1-3 for the other inner rail extension.
14 007-6364-001
Install the System into a Rack
Figure 2-3
Rail to Server Chassis Attachment Example
Assembling the Outer Rails
Each outer rail is in two sections that must be assembled before mounting on to the rack.
Assembling the Outer Rails
1.
Identify the left and right outer rails by examining the ends, which bend outward.
2.
Slide the front section of the outer rail into the rear section of the outer rail.
3.
The assembly should look similar to the example in Figure 2-4 on page 16
.
007-6364-001 15
2: Server Installation
Figure 2-4
Outer Rail Assembly Example
Attaching the Outer Rack Rails
Outer rails attach to the rack and hold the chassis in place. They extend between 26.5 and 36.4 inches.
1.
Measure the depth of the rack (distance from the front rail to the rear rail) to ensure it complies with the limitations listed.
2.
Adjust the outer rails to the proper length to fit within the rack. See the placement example in
3.
Hang the hooks of the front of the outer rail onto the slots on the front of the rack. Use screws to secure the outer rails to the rack.
4.
Pull out and adjust both the short and long brackets to the proper distance so that the rail can
fit snugly into the rack, reference Figure 2-6 on page 18 .
5.
Hang the hooks of rear portion of the outer rail into the slots on the rear of the rack. Secure the long bracket to the rear side of the outer rail with the screws provided.
6.
Repeat the previous steps to properly install the left outer rail.
16 007-6364-001
Install the System into a Rack
Figure 2-5
Outer Rack Rail Assembly/Placement Example
Using the Rail Locking Tabs
Both chassis rails have a locking tab, which serves two functions:
• The tabs can lock the server into place when installed and pushed fully into the rack, (its normal operating position).
• The tabs also lock the server in place when fully extended from the rack. This prevents the server from coming completely out of the rack when pulled out for servicing. Depress both tabs at the same time to fully remove the server from its rail mounting and extract it from the rack.
007-6364-001 17
2: Server Installation
Install the Server in a Rack
!
Warning:
The SGI Rackable C1104-GP1 server weighs up to 37 lbs (16.8 kg) Always use proper lifting techniques when your move the server. Always get the assistance of another qualified person when you install the sever in a location above your shoulders. Failure to do so may result in serious personal injury or damage to the equipment.
1.
Extend the outer rails on either side of the rack rail assembly.
2.
Align the inner rails of the chassis with the outer rails on the rack, see Figure 2-6.
18
Figure 2-6
Installing the Server in the Rack
007-6364-001
Install the System into a Rack
3.
Slide the inner rails into the outer rails, keeping the pressure even on both sides. When the chassis has been pushed completely into the rack, it should click into the locked position.
4.
Optional screws are recommended to secure and hold the front of the chassis to the rack.
Supply Power to the System
Connect the power cords from the power supply modules into a power strip or power distribution unit (PDU) within the rack. An optionally available uninterruptible power supply (UPS) can ensure continued operation in case of a failure of the regular power source.
After all power connections are verified, push the power-on button on the front of the server when you wish to power on the unit.
007-6364-001 19
Chapter 3
3.
System Interface
Overview
There are a number of LEDs on the front control panel as well as others on the drive carriers and
power supplies to keep you constantly informed of the overall status of the system. See Figure 3-1
for an example of the front control panel. These LEDs provide constant information on the system and on the overall health of system components.
1 2
Figure 3-1
System Front Control Panel Indicator Components
Control Panel Buttons
In addition to monitoring the activity and health of specific components using LEDs, the system uses two buttons located on the front of the chassis: a reset button and a power on/off button. Use
the reset button to reboot the system as shown in Figure 3-2
.
007-6364-001 21
3: System Interface
Figure 3-2
System Reset Button
Figure 3-3 shows the main power button, which is used to apply or turn off the main system power.
Turning off system power with this button removes the main power but keeps standby power supplied to the system.
Figure 3-3
System Power On Button
Control Panel LEDs
The control panel located on the front of the chassis has several LEDs. These LEDs provide you with critical information related to different parts of the system. This section explains what each
LED indicates when illuminated or flashing and any corrective action you may need to take.
22 007-6364-001
Control Panel LEDs
Power Fail LED
The power fail LED indicates a power supply module has failed as shown in
second power supply module will take the load and keep the system running but the failed module will need to be replaced. Refer to Chapter 6 for details on replacing the power supply. This LED should be off when the system is operating normally.
Figure 3-4
Power Fail LED
Overheat/Fan Fail/UID LED
When the red overheat/fan/UID LED flashes (shown in Figure 3-5
), it indicates a fan failure.
When on continuously it indicates an overheat condition, which may be caused by cables obstructing the airflow in the system or the ambient room temperature being too warm.
Check the routing of the cables and make sure all fans are present and operating normally. You should also check to make sure that the chassis covers and fan shrouds are installed properly. This
LED will remain flashing or on as long as the indicated condition exists.
The “blue light” function (UID) of this LED is used to identify a specific server in large racks filled with equipment. When activated through the system software the “blue light” will remain on until shut down by the administrator.
NIC1
Figure 3-5
Overheat/Fan Fail/UID LED
When flashing, the NIC1 LED indicates network activity on the LAN1 port (see Figure 3-6
).
007-6364-001 23
3: System Interface
NIC2
HDD
Power
24
1
Figure 3-6
LAN1 Network Activity NIC1 LED
When flashing, the NIC2 LED indicates network activity on the LAN2 port (see Figure 3-7
).
2
Figure 3-7
LAN2 Network Activity NIC2 LED
The HDD LED indicates hard drive activity when flashing (see
Figure 3-8
Hard Drive Activity LED
The power LED indicates power is being supplied to the system's power supply unit(s). An
example LED is shown in Figure 3-9
. This LED should normally be illuminated when the system is operating.
007-6364-001
Drive Carrier LEDs
Figure 3-9
Power On LED
Drive Carrier LEDs
The system hard disk drives each have two LEDs, that function as listed in the following two paragraphs:
• Green: When illuminated, the green LED on the drive carrier indicates drive activity. A connection to the drive backplane enables this LED to blink on and off when that particular drive is being accessed. Please refer to Chapter 6 for instructions on replacing failed drives.
• Red: When this LED is flashing it indicates that a RAID drive is rebuilding. A solidly lit red
LED indicates a drive failure. If the drives fails, you should be notified by your system management software. Refer to Chapter 6 for instructions on replacing failed drives.
007-6364-001 25
Chapter 4
4.
System Safety
This chapter describes basic safety precautions when using the server.
Electrical Safety Precautions
Basic electrical safety precautions should be followed to protect yourself from harm and the
Rackable C1104-GP1 system from damage, as follows:
• Be aware of the locations of the power on/off switch on the chassis as well as the room's emergency power-off switch, disconnection switch or electrical outlet. If an electrical accident occurs, you can then quickly remove power from the system.
• Do not work alone when working with high voltage components.
• Power should always be disconnected from the system when removing or installing main system components, such as the memory modules and disk drives. When disconnecting power, you should first power down the operating system and then unplug the power cords.
The unit can have more than one power supply cord. Disconnect two power supply cords before servicing to avoid electrical shock.
• When working around exposed electrical circuits, another person who is familiar with the power-off controls should be nearby to switch off the power if necessary.
• Use only one hand when working with powered-on electrical equipment. This is to avoid making a complete circuit, which will cause electrical shock. Use extreme caution when using metal tools, which can easily damage any electrical components or circuit boards they come into contact with.
• Do not use mats designed to decrease static electrical discharge as protection from electrical shock. Instead, use rubber mats that have been specifically designed as electrical insulators.
• The power supply power cords must include a grounding plug and must be plugged into grounded electrical outlets or power distribution unit (PDUs).
007-6364-001 27
4: System Safety
Serverboard Battery
!
Caution: There is a danger of explosion if an onboard battery is installed upside down, which
will reverse its polarities (see Figure 4-1 ). This battery must be replaced only with the same
or an equivalent type recommended by the manufacturer. Check with your service representative if you have any questions.
Lithium battery
Battery holder
Figure 4-1
Installing the Onboard Battery
Important: Handle used batteries carefully and do not damage the battery in any way; a damaged battery may release hazardous materials into the environment. Do not discard a used battery in the garbage or a public landfill. Dispose of used batteries according to the manufacturer's instructions and in compliance with the regulations set up by your local hazardous waste management agency.
ESD Precautions
!
Caution: This server contains electronic components and printed circuit boards which are susceptible to electrostatic discharge (ESD) damage. ESD is generated by two objects with different electrical charges coming into contact with each other. An electrical discharge is created to neutralize this difference, which can damage electronic components and printed circuit boards.
The following measures are generally sufficient to neutralize this difference before contact is made to protect your equipment from ESD:
• Use a grounded wrist strap designed to prevent static discharge.
28 007-6364-001
General Safety Precautions
• Keep all components and printed circuit boards (PCBs) in their antistatic bags until ready for use.
• Touch a grounded metal object before removing the board from the antistatic bag.
• Do not let components or PCBs come into contact with your clothing, which may retain a charge even if you are wearing a wrist strap.
• Handle a board by its edges only; do not touch its components, peripheral chips, memory modules or contacts.
• When handling chips or modules, avoid touching their pins.
• Put the serverboard and peripherals back into their antistatic bags when not in use.
• For grounding purposes, make sure your computer chassis provides excellent conductivity between the power supply, the case, the mounting fasteners and the serverboard.
Mainboard Replaceable Soldered-in Fuses
Important: If your system comes with self-resetting PTC (Positive Temperature Coefficient) fuses on the serverboard, they must be replaced by trained service technicians only. The new fuse must be the same or equivalent as the one replaced. Contact your technical support organization for details and support.
General Safety Precautions
Follow these rules to ensure general safety:
• Keep the area around the Rackable C1104-GP1 system clean and free of clutter.
• The Rackable C1104-GP1 system weighs approximately 37 lbs (16.8 kg.) when fully loaded.
When lifting the system, two people at either end should lift slowly with their feet spread out to distribute the weight. Always keep your back straight and lift with your legs.
• Place the chassis top cover and any system components that have been removed away from the system or on a table so that they won't accidentally be stepped on.
• While working on the system, do not wear loose clothing such as neckties and unbuttoned shirt sleeves, which can come into contact with electrical circuits or be pulled into a cooling fan.
007-6364-001 29
4: System Safety
• Remove any jewelry or metal objects from your body, which are excellent metal conductors that can create short circuits and harm you if they come into contact with printed circuit boards or areas where power is present.
• After accessing the inside of the system, close the system back up and secure it to the rack unit with the retention screws after ensuring that all connections have been made.
30 007-6364-001
Chapter 5
5.
System and Serverboard Information
!
This chapter includes best practice procedures to work with a node board in the C1104-GP1 chassis and understand the system PCIe expansion cards and hard disk drives. Use the information in Chapter 6 to troubleshoot your server and add, remove, or replace system components.
A layout and quick reference chart is included in this chapter for your reference.
Some software products are protected with software license keys derived from the Media Access
Control (MAC) Ethernet address. If your system requires the replacement of a node board, the
MAC Ethernet address changes. If you are using such a product, you or your service representative must request a new license key after replacement of a node board. Contact your local customer support office: http://www.sgi.com/support/supportcenters.html
Caution: Install the chassis cover after you have completed accessing the components inside the server to maintain proper airflow and cooling for the system.
Handling Circuit Boards and Drives
!
Caution: Electrostatic discharge (ESD) can damage electrostatic-sensitive devices inside the
C1104-GP1 server. Use the ESD precautions described below when you handle printed circuit boards or other components in the system. The following measures are generally sufficient to protect your equipment from electro-static discharge.
007-6364-001 31
5: System and Serverboard Information
ESD Precautions
• Use a grounded wrist strap designed to prevent electrostatic discharge.
• Touch a grounded metal object before removing any board from its antistatic bag.
• Handle each printed circuit board (PCB) by the edges; do not touch the components, peripheral chips, memory modules, or gold contacts on the PCB.
• When handling chips or modules, avoid touching the pins.
• Store PCIe cards, or other boards and components in antistatic bags when not in use.
• Make sure your computer chassis provides a conductive path between the power supply, the case, the mounting fasteners, and the node board to chassis ground.
Unpacking
!
Caution: System options are shipped in anti-static packaging to avoid electrostatic discharge damage. Be sure to use ESD precautions when you unpack upgrade or replacement components for the C1104-GP1 server. Failure to do so can result in damage to the equipment.
32 007-6364-001
System Rear I/O Ports
System Rear I/O Ports
The rear external system I/O ports are color coded in conformance with the PC 99 specification.
below for the colors and locations of the various I/O ports.
identifies the functions of each of the I/O ports on the backpanel.
3
2
1
Figure 5-1
4 5
I/O Port Locations
6 7
Table 5-1
System Backpanel I/O Port Functions
1. USB port 0
2. USB port 1
3. Dedicated IPMI LAN port
4. LAN port 1
5. LAN port 2
6. VGA port
7. UID switch
Serverboard Details
The 1U C1104-GP1 system chassis has one node board. The C1104-GP1 serverboard is configured with two processors. When configured with two processors, the following rules apply:
• Both processor sockets must have identical revisions, core voltage, and bus/core speed.
• The stepping between the processors on the board must be identical.
• See
for CPU locations on the serverboard - note that the drawing is not to scale.
007-6364-001 33
5: System and Serverboard Information
CPUs
• The C610 chipset is used on the system serverboard.
Memory
• Eight DIMM slots supporting 2133/1866/1600 MHz registered DDR4 ECC SDRAM.
Note: Check with your authorized sales/service representative for installation of approved
DIMM types.
GPUs
• A total of three GPUs are supported (true PCI-E 3.0 x16 signal) - GPU types are limited, check with your sales or support representative.
PCIe Expansion Slots
• An external PCI-Express (PCIe) slot with the following features:
– One PCIe Gen 3.0 x8 low-profile card (in x16 slot)
System Health Monitoring
• Onboard voltage monitors
• Fan status monitor with firmware/software on/off and speed control
• Watch Dog
• Environmental temperature monitoring via BIOS
• Power-up mode control for recovery from AC power loss
• System resource alert (via included utility program)
• Auto-switching voltage regulator for each CPU core
• CPU thermal trip support
• I2C temperature sensing logic
• Chassis intrusion detection
34 007-6364-001
System Rear I/O Ports
ACPI Features
• Slow blinking LED for suspend state indicator
• BIOS support for USB keyboard
• Wake-On-LAN (WOL)
• Internal/external modem ring-on
• Hardware BIOS Virus protection
Onboard I/O
• Four disk drive bays supported by an on-chip SATA controller (RAID 0, 1 and 10 are supported in this system)
• Two (2) USB (Universal Serial Bus 3.0) ports (rear [external] type A)
• Two (2) RJ-45 LAN ports supported by an on-board Intel® Ethernet controller
• One (1) dedicated (RJ-45) IPMI LAN port
• One (1) VGA port supported by an Aspeed 2400 graphics controller (with DDR3 memory)
Serverboard Dimensions
Proprietary board format is: 19.8" x 9.2" (503 mm x 234 mm).
007-6364-001 35
5: System and Serverboard Information
3I-SA
2S-SA
LE4
VGA
JWD1
BMC
LAN2 LAN1
USB4/5 (3.0)
IPMI_LAN
BIOS
LAN
CTRL
JOH1
JBT1
1
1
1
PCH
BT1
CLOSE 1st
CPU1
OPEN 1st
MAC CODE
BAR CODE
X10DRG-H
Rev. 1.01
JITP1
BIOS
LICENSE
CLOSE 1st
CPU2
OPEN 1st
1
JF1
P1 DIMMG1 P1 DIMMG2
Figure 5-2
FANB FANA FAN4 FAN3
Node Board Layout Example
FANG
FANF
36 007-6364-001
Hard Disk Drives (C1104-GP1 Chassis)
Hard Disk Drives (C1104-GP1 Chassis)
The 1U chassis supports a maximum of four 2.5-inch hard disk drives, see . Install the drives from
left to right starting in the lower-left bay. Disk drive bays must be populated with either a drive or a “drive blank” to maintain system thermals. Failure to follow this guideline may cause system overheating and thermal shutdown of the unit.
Important: The operating system you use must have RAID support to enable the hot-swap capability and RAID functions of the SATA drives.
System
LEDs
Four disk drive bays
System reset
Figure 5-3
Main power
C1104-GP1 System Disk Drive Locations
Drive Configurations
The disk drive configurations supported in the Rackable C1104-GP1 server are outlined in the paragraphs that follow. Note that some configurations are dependent on use of optional hardware to support RAID configurations.
The supported disk drive configurations are as follows:
• JBOD
This non-RAID disk array supports any number of drives between one and four. The operating system is placed on the disk drive in location 0 (system disk). All other drives are data drives.
• RAID 0
Disk striping without parity, supports any number of drives between two and four. Note that all drives must be the same type, speed and capacity. The operating system will be striped across all drives in the system. This configuration is not recommended.
007-6364-001 37
5: System and Serverboard Information
• RAID 1
Disk “mirroring”, supports exactly two drives. The two drives represent one RAID 1 logical drive. The operating system will be installed on the drives located in Drive positions 0 and 1.
Note that both drives 0 and 1 must be of the same type, speed and capacity.
• RAID 10
Mirrored disk striping, the data is striped across one set of drives and then mirrored on another set of drives. A minimum of four drives of the same type are required. The total number of drives must be an even number (4 in this case). A total of four drives is a 2+2 configuration. The operating system will be striped across the drives in the primary set and then mirrored on the secondary set of drives.
Note that all drives must be the same type, speed and capacity.
PCIe Expansion Cards
There are three internal double-width (GPU) PCIe 3.0 x16 expansion slots and one external PCIe slot available with the C1104-GP1 server. The external option slot functions as listed:
• External PCI-Express 3.0 x8 low-profile card (x16 slot)
Note: Only specific GPU cards will fit and function in the internal PCIe GPU slots, contact your
SGI sales or service representative for information on approved GPU cards.
Power Supply Functional Rating
The C1104-GP1 server default configuration is two rear-installed 1600-Watt power supplies. The second power supply acts as a redundant power unit for the server. The supplies are “auto-ranging” and can operate from either 100-140V or 180-240V levels at 50 or 60Hz.
Each power supply module has its own cooling fan.
The supplies used have a Platinum Certification rating.
38 007-6364-001
Chapter 6
6.
Basic Troubleshooting and Chassis Service
Use the procedures in the first half of this chapter to troubleshoot your system. If you follow all of the procedures and still need assistance, check with your authorized support organization.
Basic Troubleshooting Procedures
Use the information in the following subsections to remedy basic problems you might encounter when working with the SGI Rackable C1104-GP1 server.
If the System Does Not Power Up
If the system will not power up when the front power button is pushed, use the following checklist to identify common sources for the problem:
• Make sure that both ends of each system power cable are firmly connected to the power supplies and the corresponding power source(s) or power distribution unit (PDU).
• Check to see if the power fail LED is lit on the front of the unit. This LED should be off if the system is operating normally.
• Check that the LED on each power supply is properly lit. The power supply has one status
LED located on the left side of the front of the power supply. The LED has three states:
– Dark or off - indicates no AC power present
– Solid Amber - AC power is present, the server is not turned on (no DC power)
– Blinking Amber - supply temperature exceeds 63 o C (auto-shutdown occurs at 73 o C)
– Solid Green - AC power is present and the server is turned on (DC power present)
007-6364-001 39
6: Basic Troubleshooting and Chassis Service
• Open the system cover, remove the air shroud and check to make sure that no obvious short circuits exist between the serverboard and chassis.
System Powers Up But Will Not Boot
If the system powers up but will not boot the Operating System, check the following:
• Check the system order document(s) - the C1104-GP1 server may have been ordered with no operating system. If so, check with your system administrator for OS loading information.
• Check the system disk (drive 0) for drive activity and confirm that it is firmly seated in the disk bay. A red light on the front of the disk indicates a functional error. Check with your service provider or local system administrator.
No Video After System Power Up
If the system powers up and appears to be booting normally but no video is present, try the following basic solutions:
• Confirm your monitor is plugged in and switched on.
• Check all video cables and ensure they are properly connected.
• Listen for a BIOS “beep code” error message - one long beep plus 8 short beeps indicates a video error. This beep code message could indicate a video memory error or other video malfunction; contact your service provider.
• If using an optional PCIe video card check the back of the card for LED activity or a fault indicator. Try opening the system, reseating the PCIe card and rebooting; see the section
“Install/Replace a PCIe Expansion Card” on page 55
.
If you cannot get a video signal after trying basic solutions contact your support provider.
Memory Errors
If your system experiences memory related errors, try these basic troubleshooting steps to resolve or better identify the problem:
• Confirm that the power supply LED is not indicating an error.
• Listen for memory error beep codes - five short beeps followed by one long beep is a BIOS signal that no system memory has been detected - See
Appendix A, “BIOS Error Codes” .
40 007-6364-001
Chassis Service Information
• Shut the system down, remove the covers over the serverboard and make sure that all the
DIMM modules are properly and fully installed.
• You should be using registered ECC DDR4 memory. Also, it is recommended that you use the same memory type and speed for all DIMMs in the system.
• Contact your administrator or support provider if the memory errors continue.
Chassis Service Information
The following sections cover the steps required to install components and perform maintenance on the C1104-GP1 chassis. For component installation, follow the steps in the order given to eliminate the most common problems encountered. If some steps are unnecessary, skip ahead to the step that follows.
Important: Always disconnect the AC power cord(s) before adding, changing or installing any internal hardware components.
Tools Required: The only tool you will need to install components and perform maintenance is a
Phillips screwdriver.
Static-Sensitive Devices
Electrostatic discharge (ESD) can damage electronic components. To prevent damage to any printed circuit boards (PCBs), it is important to handle them very carefully. The following measures are generally sufficient to protect your equipment from ESD damage.
Precautions
• Use a grounded wrist strap designed to prevent static discharge.
• Touch a grounded metal object before removing any board from its antistatic bag.
• Handle a board by its edges only; do not touch its components, peripheral chips, memory modules or gold contacts.
• When handling chips or modules, avoid touching their pins.
007-6364-001 41
6: Basic Troubleshooting and Chassis Service
• Put the serverboard, add-on cards and peripherals back into their anti-static bags when not in use.
• For grounding purposes, make sure your computer chassis provides excellent conductivity between the power supply, the case, the mounting fasteners and the serverboard.
Unpacking
Replacement components are usually shipped in anti-static packaging to avoid static damage.
When unpacking an upgrade or replacement component, make sure the person handling it is static protected.
Control Panel
The control panel (located on the front of the chassis) must be connected to the JF1 connector on the serverboard to provide you with system status indications. A ribbon cable has bundled these wires together to simplify the connection. Connect the cable from JF1 on the serverboard to the
Control Panel PCB (printed circuit board). Make sure the red wire plugs into pin 1 on both connectors. Pull all excess cabling out of the airflow path. The LEDs inform you of system status.
See Chapter 3 for details on the LEDs and the control panel buttons.
Drive Bay Installation/Removal
This section describes hard drive installation and removal.
Accessing the Drive Bays
Drives: You do not need to access the inside of the chassis or remove power to replace or swap a
RAIDed hard disk drive. Data may be lost or corrupted if you “hot swap” a JBOD disk drive. Shut down system power before removing or replacing a JBOD disk. Removing either a RAID or
JBOD drive without replacing it may cause system errors. Proceed to the next section for further hard drive instructions.
Note: You must use approved 2.5" disk drives in the system.
42 007-6364-001
Drive Bay Installation/Removal
Removing Hard Drives or Carriers from the Chassis
1.
Press the release button on the drive carrier. This extends the drive carrier handle.
2.
Use the handle to pull the drive carrier out of the chassis.
Important: Empty carriers without drives must stay in the chassis during operation for proper airflow/cooling purposes except during remove/replace operations. Do not operate the server with carriers removed.
The Hard Drive Backplane
The hard drives plug into a backplane that provides power, drive ID and bus termination. A RAID controller and/or optional RAID software can be used with the backplane to provide data security.
The operating system you use must have RAID support to enable the hot-swap capability of the hard drives. The backplane is preconfigured, so no jumper/switch configuration is required.
!
Caution: Be careful when working around the drive backplane. Do not touch the backplane with your fingers or any metal objects and make sure no ribbon cables touch the backplane or obstruct the holes, which aid in proper airflow.
Disk Drive Installation
The drives are mounted in drive carriers (
Figure 6-1 ) to simplify their installation and removal
from the chassis, see Figure 6-2 on page 44 for a disk removal example. These carriers also help
promote proper airflow for the drives. For this reason, even empty carriers without hard drives installed must remain in the chassis during operation. See
for an example drive carrier and the “dummy” drive blank used when a working disk is not installed in a drive slot.
Figure 6-1
Drive and Carrier Assembly Example
007-6364-001 43
6: Basic Troubleshooting and Chassis Service
Figure 6-2
Remove Drive and Carrier from Front of Server
Hard Drive Carrier Assembly Usage
1.
Remove the four screws securing the dummy/bad drive to the hard drive carrier.
2.
Insert a new/replacement hard drive into the carrier with the PCB side facing down and the connector end toward the rear of the carrier.
3.
Align the hard drive in the disk drive carrier so that the mounting holes of the carrier are aligned with the mounting holes of the drive. Note that there are holes in the carrier which are marked “SATA” to aid in correct installation.
4.
Secure the drive to the carrier with four screws. Use the M3 flat-head screws included in the
HDD bag of your accessory box. Note: the screws used to secure a dummy drive to the carrier should not be used to secure the hard drive.
44 007-6364-001
007-6364-001
Drive Bay Installation/Removal
5.
Insert the hard drive carrier assembly into its bay vertically, keeping the carrier oriented so that the release button is on the bottom. When the carrier reaches the rear of the drive bay, the handle will retract.
6.
Using your thumb, push against the upper part of the hard drive handle until the assembly
clicks into the locked (fully seated) position, see Figure 6-4 on page 47 for an example.
Note: Your operating system must have RAID support to enable the hot-plug capability of the drives.
!
Caution: Regardless of how many hard drives are installed, all drive carriers must remain in the drive bays to maintain proper airflow and system cooling.
45
6: Basic Troubleshooting and Chassis Service
46
Figure 6-3
Drive Carrier Attachment to Dummy Drive Blank Example
007-6364-001
Power Supply
Figure 6-4
Hard Disk Drive Installation Example
Power Supply
The system offers a redundant power supply assembly consisting of two 1600-Watt power modules. Each power supply module has an auto-switching capability, which enables it to automatically sense and operate at a 100V - 240V input voltage at 50 or 60Hz.
Power Supply Failure
If either of the two power supply modules fail, the other module will take the full load and allow the system to continue operation without interruption. The PWR Fail LED will illuminate and remain on until the failed unit has been replaced. The power supply units have a hot-swap capability, meaning you can replace the failed unit without powering down the system, see
Figure 6-5 on page 49 for an example.
007-6364-001 47
6: Basic Troubleshooting and Chassis Service
Removing/Replacing a Power Supply
You do not need to shut down the system to replace a failed power supply unit. The backup power supply module will keep the system up and running while you replace the failed unit. Replace with the same model.
Removing the Power Supply
1.
First unplug the AC power cord from the failed power supply module.
2.
Depress the locking tab on the power supply module.
3.
Pull it straight out using the rounded handle.
Installing a New Power Supply
1.
Replace the failed hot-swap unit with another identical power supply unit.
2.
Push the new power supply unit into the power bay until you hear a click.
3.
Secure the locking tab on the unit.
4.
Finish by plugging the AC power cord back into the unit.
48 007-6364-001
007-6364-001
Figure 6-5
Power Supply Remove/Replace Example
Power Supplies
Power Supply
49
6: Basic Troubleshooting and Chassis Service
Accessing the Inside of the Chassis
1.
Grasp the two handles on either side and pull the unit straight out until it locks (you will hear a “click”).
2.
Next, depress the two buttons on the top of the chassis to release the top cover and at the same time, push the cover away from you until it stops. You can then lift the top cover from the chassis to gain full access to the inside of the server.
Note: Normally you would power down the system before installing or removing internal components - but it may be necessary to leave system power on to determine which fan has failed.
System Fans
Ten 4-cm counter-rotating fans provide the cooling for the system. Each fan unit is actually made up of two fans joined back-to-back, which rotate in opposite directions. This counter-rotating action generates exceptional airflow and works to dampen vibration levels. It is very important that the chassis top cover is properly installed and making a good seal in order for the cooling air to circulate properly through the chassis and cool the components.
System Fan Failure
Fan speed is controlled by system temperature via a BIOS setting. If a fan fails, the remaining fans will ramp up to full speed and the overheat/fan fail LED on the control panel will flash. Replace any failed fan as soon as possible with the same type and model (the system can continue to run with a failed fan).
Your system administrator may be able to identify which fan has failed using the system BIOS.
If an administrator or service representative is not using the BIOS to determine which fan has failed, you can remove the top chassis cover while the system is still running to determine which of the fans has failed. After determining which is the failed fan, shut down and remove power from the system by unplugging the server’s cords. Never run the server for an extended period of time with the top cover open.
50 007-6364-001
System Fans
Replacing System Fans
This section describes how to remove or install a system fan.
Remove/Replace a Fan
1.
If you have not already done so, remove the chassis cover to access the fans, see the example in
.
2.
Turn off the power to the system and unplug the AC power cord.
3.
Remove the failed fan's wiring connectors from the serverboard.
4.
Remove and retain the four pins securing the fan assembly to the fan tray.
5.
Lift the assembly housing the failed fan from the fan tray and out of the chassis, see the example in
6.
Place the new fan into the vacant space in the fan tray, while making sure the arrows on the top of the fan (indicating air direction) point in the same direction as the arrows on the other fans in the same fan tray. See
Figure 6-8 on page 54 for a fan assembly example.
7.
Reconnect the fan wires to the exact same chassis fan headers as the previous fan.
8.
Reconnect the AC power cord, power up the system and check that the fan is working properly before replacing the chassis cover.
007-6364-001 51
6: Basic Troubleshooting and Chassis Service
52
Figure 6-6
Cooling Fans Access Example
007-6364-001
System Fans
007-6364-001
Figure 6-7
Remove/Replace Fan Assembly Example
53
6: Basic Troubleshooting and Chassis Service
54
Figure 6-8
Individual Fan Remove/Replace Example
007-6364-001
Install/Replace a PCIe Expansion Card
Install/Replace a PCIe Expansion Card
Confirm that you have the correct PCIe card for your chassis and the card includes a standard bracket. The following type cards are supported in the server chassis:
• One low-profile PCIe 3.0 x8 card
Note: At time of publication, the rear x16 full-height PCIe slot (see
specific internal GPU option cards. Check with your SGI sales or service representative for the latest information on optional cards available for this slot.
Low-profile PCIe slot Full-height PCIe slot
Figure 6-9
Rear PCIe Low-profile and Optional Full-height Slot Locations
Install/Replace a Low-profile PCIe Card
Use the following steps and illustration to install or replace a PCIe card at the rear of the system:
1.
Remove the chassis cover and disconnect both the power cables from the server.
2.
Confirm that you have the correct size and type of PCIe expansion card (low-profile).
3.
Remove the screw securing the low-profile PCIe slot cover at the rear of the chassis and slide it sideways to remove from the chassis.
4.
Select the appropriate riser connector for your low-profile card. Note that the low-profile riser card uses a x16 connector.
5.
Align the PCIe card with the rear slot opening and the riser connector, then simultaneously slide the rear bracket into place as you insert the PCIe connector into the riser.
6.
Secure the rear bracket in the slot with the screw removed in step 3 and connect cables to the add-on card as necessary. See
Figure 6-10 on page 56 for an example.
7.
Replace the system cover and plug in the power cords prior to rebooting the server.
007-6364-001 55
6: Basic Troubleshooting and Chassis Service
56
Figure 6-10
Low-profile PCIe Card Remove/Replace Example
007-6364-001
Appendix A
A.
BIOS Error Codes
007-6364-001
During Power-On Self-Test (POST) routines, which are performed each time the system is powered on, errors may occur.
Non-fatal errors are those which, in most cases, allow the system to continue the boot-up process.
The error messages normally appear on the screen.
Fatal errors are those which will not allow the system to continue the boot-up procedure. If a fatal error occurs, you should consult with your system manufacturer for possible repairs.
These fatal errors are usually communicated through a series of audible beeps. The numbers on the fatal error list (see
Table A-1 ) correspond to the number of beeps for the corresponding error.
Table A-1
BIOS Error Codes
Beep Code Error Message
1 beep Refresh
5 short beeps + 1 long beep Memory error
Description
Circuits have been reset (Ready to power up)
No memory detected in the system
5 short beeps Console input or output device missing
Console-In: USB or PS/2 keyboard, PCI or
Serial Console Redirection, IPMI KVM or SOL
Console-Out: Video Controller, PCI or
Serial console Redirection, IPMI SOL
System overheat condition System thermal limits exceeded 1 continuous long beep
1 long beep +8 short beeps Video display error or video memory read/write error
Video error - adapter missing or with faulty memory
57
Appendix B
B.
System Operating and Regulatory Overview
This appendix provides basic environmental operating requirements and regulatory information for the server.
Environmental Specifications
Table B-1 lists allowable ranges for temperature, humidity, and altitude for the server.
Table B-1
Temperature, Humidity, and Altitude Specifications
Attribute Specification
While Product Operating
Temperature – Up to 1500m (5000ft)
+5
º
C (41
º
F) to +30
º
C (86
º
F)
– 1525m (5000ft) to 3050m (10,000ft)
Reduce max temperature (30
º
C) by 1
º
C per
305m (1000ft) of altitude above 1525m
(5000ft).
Humidity 20% to 80% Non-condensing
Rate of Change Constraints
Maximum: 10
º
C/hour (18
º
F/hour)
Maximum: 10% relative humidity/hour
Altitude 3050m (10,000ft)
While Product Power Off
Temperature
Humidity
+5
º
C (41
º
F) to +45
º
C (113
º
F)
8% to 80% Non-condensing
Altitude 3050m (10,000ft)
Maximum: 20
º
C/hour (36
º
F/hour)
007-6364-001 59
B: System Operating and Regulatory Overview
Table B-1
Temperature, Humidity, and Altitude Specifications (continued)
Rate of Change Constraints Attribute Specification
While Product Packaged for Shipping
Temperature
Humidity
-40
º
C (-40
º
F) to +60
º
C (140
º
F)
8% to 80% Non-condensing
Altitude 12,200m (40,000ft)
Maximum: 20
º
C/hour (36
º
F/hour)
System Input Requirements
AC Input Voltage: 180-240 VAC
Rated Input Current: 1000W: 100-120V/12.9A, 1600W: 200-240V/9.5A
Rated Input Frequency: 50-60 Hz
Power Supply
Rated Output Power: 1600W (Redundant) Platinum rated
Rated Output Voltages: 1000W: +12V (82A), +12Vsb (2A)
1600W: +12V (132A), +12Vsb (2A)
60 007-6364-001
Regulatory Compliance
Regulatory Compliance
This product is for installation in a Restricted Access Location only per clause 1.7.14 of IEC document 60950
The SGI compliance number for this product is CMN1104-118-1
6
Electromagnetic Emissions: FCC Class A, EN 55022 Class A, EN 61000-3-2/-3-3, CISPR 22
Class A
Electromagnetic Immunity: EN 55024/CISPR 24, (EN 61000-4-2, EN 61000-4-3, EN 61000-4-4,
EN 61000-4-5, EN 61000-4-6, EN 61000-4-8, EN 61000-4-11)
Safety: CSA/EN/IEC/UL 60950-1 Compliant, UL or CSA Listed (USA and Canada), CE Marking
(Europe)
California Best Management Practices Regulations for Perchlorate Materials: This Perchlorate warning applies only to products containing CR (Manganese Dioxide) Lithium coin cells.
“Perchlorate Material-special handling may apply. See: www.dtsc.ca.gov/hazardouswaste/perchlorate”
007-6364-001 61
advertisement
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Related manuals
advertisement
Table of contents
- 3 Record of Revision
- 5 Contents
- 9 About This Guide
- 9 Audience
- 9 Chapter Descriptions
- 10 Related Publications
- 11 Conventions
- 11 Product Support
- 12 Reader Comments
- 13 Introduction
- 15 Server Board Features
- 15 Processors
- 15 QPI Interconnect
- 15 Memory
- 16 Serial ATA and Optional SAS
- 16 PCI Express Expansion Slots
- 16 Onboard Controllers/Ports
- 16 Onboard Graphics Controller
- 16 IPMI
- 17 Other Features
- 17 Server Chassis Features
- 17 System Power
- 17 Serial ATA Subsystems
- 17 Front Control Panel
- 18 Serverboard and GPU Subsystem
- 18 GPU Features
- 19 Cooling System
- 21 Server Installation
- 21 Unpack the System
- 21 Prepare for Setup
- 21 Choose a Setup Location
- 22 System Warnings and Precautions
- 23 Server Precautions
- 23 Rack Mounting Considerations
- 23 Ambient Operating Temperature
- 24 Reduced Airflow
- 24 Mechanical Loading
- 24 Circuit Overloading
- 24 Reliable Ground
- 24 Install the System into a Rack
- 24 Separate the Sections of the Rack Rails
- 26 Inner Rail Extensions
- 26 Installing the Inner Rail Extensions
- 27 Assembling the Outer Rails
- 27 Assembling the Outer Rails
- 28 Attaching the Outer Rack Rails
- 29 Using the Rail Locking Tabs
- 30 Install the Server in a Rack
- 31 Supply Power to the System
- 33 System Interface
- 33 Overview
- 33 Control Panel Buttons
- 34 Control Panel LEDs
- 35 Power Fail LED
- 35 Overheat/Fan Fail/UID LED
- 35 NIC1
- 36 NIC2
- 36 HDD
- 36 Power
- 37 Drive Carrier LEDs
- 39 System Safety
- 39 Electrical Safety Precautions
- 40 Serverboard Battery
- 40 ESD Precautions
- 41 Mainboard Replaceable Soldered-in Fuses
- 41 General Safety Precautions
- 43 System and Serverboard Information
- 43 Handling Circuit Boards and Drives
- 44 ESD Precautions
- 44 Unpacking
- 45 System Rear I/O Ports
- 45 Serverboard Details
- 46 CPUs
- 46 Memory
- 46 GPUs
- 46 PCIe Expansion Slots
- 46 System Health Monitoring
- 47 ACPI Features
- 47 Onboard I/O
- 47 Serverboard Dimensions
- 49 Hard Disk Drives (C1104-GP1 Chassis)
- 49 Drive Configurations
- 50 PCIe Expansion Cards
- 50 Power Supply Functional Rating
- 51 Basic Troubleshooting and Chassis Service
- 51 Basic Troubleshooting Procedures
- 51 If the System Does Not Power Up
- 52 System Powers Up But Will Not Boot
- 52 No Video After System Power Up
- 52 Memory Errors
- 53 Chassis Service Information
- 53 Static-Sensitive Devices
- 53 Precautions
- 54 Unpacking
- 54 Control Panel
- 54 Drive Bay Installation/Removal
- 54 Accessing the Drive Bays
- 55 Removing Hard Drives or Carriers from the Chassis
- 55 The Hard Drive Backplane
- 55 Disk Drive Installation
- 56 Hard Drive Carrier Assembly Usage
- 59 Power Supply
- 59 Power Supply Failure
- 60 Removing/Replacing a Power Supply
- 60 Removing the Power Supply
- 60 Installing a New Power Supply
- 62 Accessing the Inside of the Chassis
- 62 System Fans
- 62 System Fan Failure
- 63 Replacing System Fans
- 63 Remove/Replace a Fan
- 67 Install/Replace a PCIe Expansion Card
- 67 Install/Replace a Low-profile PCIe Card
- 69 BIOS Error Codes
- 71 System Operating and Regulatory Overview
- 71 Environmental Specifications
- 72 System Input Requirements
- 72 Power Supply
- 73 Regulatory Compliance