SGI® Rackable™ C1104-GP1 System User Guide


Add to my manuals
74 Pages

advertisement

SGI® Rackable™ C1104-GP1 System User Guide | Manualzz

SGI

®

Rackable

C1104-GP1

System User Guide

007-6364-001

COPYRIGHT

© 2014 Silicon Graphics International Corp. All rights reserved; provided portions may be copyright in third parties, as indicated elsewhere herein. No permission is granted to copy, distribute, or create derivative works from the contents of this electronic documentation in any manner, in whole or in part, without the prior written permission of SGI.

LIMITED RIGHTS LEGEND

The software described in this document is “commercial computer software” provided with restricted rights (except as to included open/free source) as specified in the FAR 52.227-19 and/or the DFAR 227.7202, or successive sections. Use beyond license provisions is a violation of worldwide intellectual property laws, treaties and conventions. This document is provided with limited rights as defined in 52.227-14.

The electronic (software) version of this document was developed at private expense; if acquired under an agreement with the USA government or any contractor thereto, it is acquired as “commercial computer software” subject to the provisions of its applicable license agreement, as specified in (a) 48 CFR

12.212 of the FAR; or, if acquired for Department of Defense units, (b) 48 CFR 227-7202 of the DoD FAR Supplement; or sections succeeding thereto.

Contractor/manufacturer is SGI, 46600 Landing Parkway, Fremont, CA 94538.

TRADEMARKS AND ATTRIBUTIONS

Silicon Graphics, SGI, the SGI logo, Rackable, and Supportfolio are trademarks or registered trademarks of Silicon Graphics International Corp. in the United

States and/or other countries worldwide.

Intel, Intel QuickPath Interconnect (QPI) and Xeon are trademarks or registered trademarks of Intel Corporation or its subsidiaries in the United States and other countries.

Fusion-MPT, Integrated RAID, MegaRAID, and LSI Logic are trademarks or registered trademarks of LSI Logic Corporation.

HyperTransport is a licensed trademark of the HyperTransport Technology Consortium.

InfiniBand is a registered trademark of the InfiniBand Trade Association.

Internet Explorer and MS-DOS are registered trademarks of Microsoft Corporation.

Java and Java Virtual Machine are trademarks or registered trademarks of Sun Microsystems, Inc.

Linux is a registered trademark of Linus Torvalds, used with permission by SGI.

Novell and Novell Netware are registered trademarks of Novell Inc.

PCIe and PCI-X are registered trademarks of PCI SIG.

Phoenix and PhoenixBIOS are registered trademarks of Phoenix Technologies Ltd.

Red Hat and all Red Hat-based trademarks are trademarks or registered trademarks of Red Hat, Inc. in the United States and other countries.

SUSE LINUX and the SUSE logo are registered trademarks of Novell, Inc.

UNIX is a registered trademark in the United States and other countries, licensed exclusively through X/Open Company, Ltd.

All other trademarks mentioned herein are the property of their respective owners.

Record of Revision

Version

001

Description

September, 2014

First release

007-6364-001 iii

Contents

1

Record of Revision . . . . . . . . . . . . . . . . . . . . . . . iii

About This Guide . . . . . . . . . . . . . . . . . . . . . . . ix

Audience. . . . . . . . . . . . . . . . . . . . . . . . . . ix

Chapter Descriptions . . . . . . . . . . . . . . . . . . . . . . ix

Related Publications . . . . . . . . . . . . . . . . . . . . . . . x

Conventions . . . . . . . . . . . . . . . . . . . . . . . . . xi

Product Support . . . . . . . . . . . . . . . . . . . . . . . . xi

Reader Comments . . . . . . . . . . . . . . . . . . . . . . . xii

Introduction . . . . . . . . . . . . . . . . . . . . . . . . 1

Server Board Features . . . . . . . . . . . . . . . . . . . . . . 3

Processors . . . . . . . . . . . . . . . . . . . . . . . . 3

QPI Interconnect . . . . . . . . . . . . . . . . . . . . . . 3

Memory . . . . . . . . . . . . . . . . . . . . . . . . . 3

Serial ATA and Optional SAS . . . . . . . . . . . . . . . . . . . 4

PCI Express Expansion Slots . . . . . . . . . . . . . . . . . . . 4

Onboard Controllers/Ports . . . . . . . . . . . . . . . . . . . . 4

Onboard Graphics Controller . . . . . . . . . . . . . . . . . . . 4

IPMI . . . . . . . . . . . . . . . . . . . . . . . . . 4

Other Features . . . . . . . . . . . . . . . . . . . . . . . 5

Server Chassis Features . . . . . . . . . . . . . . . . . . . . . . 5

System Power . . . . . . . . . . . . . . . . . . . . . . . 5

Serial ATA Subsystems . . . . . . . . . . . . . . . . . . . . 5

Front Control Panel . . . . . . . . . . . . . . . . . . . . . . 5

Serverboard and GPU Subsystem . . . . . . . . . . . . . . . . . . 6

GPU Features . . . . . . . . . . . . . . . . . . . . . . 6

Cooling System . . . . . . . . . . . . . . . . . . . . . . . 7

007-6364-001 v

Contents

vi

2

3

Server Installation . . . . . . . . . . . . . . . . . . . . . . . 9

Unpack the System . . . . . . . . . . . . . . . . . . . . . . . 9

Prepare for Setup . . . . . . . . . . . . . . . . . . . . . . 9

Choose a Setup Location . . . . . . . . . . . . . . . . . . . . 9

System Warnings and Precautions . . . . . . . . . . . . . . . . . . . 10

Server Precautions . . . . . . . . . . . . . . . . . . . . . . 11

Rack Mounting Considerations . . . . . . . . . . . . . . . . . . . . 11

Ambient Operating Temperature . . . . . . . . . . . . . . . . . . 11

Reduced Airflow . . . . . . . . . . . . . . . . . . . . . . 12

Mechanical Loading . . . . . . . . . . . . . . . . . . . . . . 12

Circuit Overloading . . . . . . . . . . . . . . . . . . . . . . 12

Reliable Ground . . . . . . . . . . . . . . . . . . . . . . . 12

Install the System into a Rack . . . . . . . . . . . . . . . . . . . . 12

Separate the Sections of the Rack Rails . . . . . . . . . . . . . . . . . 12

Inner Rail Extensions . . . . . . . . . . . . . . . . . . . . . 14

Installing the Inner Rail Extensions . . . . . . . . . . . . . . . . 14

Assembling the Outer Rails . . . . . . . . . . . . . . . . . . . . 15

Assembling the Outer Rails. . . . . . . . . . . . . . . . . . . 15

Attaching the Outer Rack Rails . . . . . . . . . . . . . . . . . . . 16

Using the Rail Locking Tabs . . . . . . . . . . . . . . . . . . 17

Install the Server in a Rack . . . . . . . . . . . . . . . . . . . . 18

Supply Power to the System . . . . . . . . . . . . . . . . . . 19

System Interface. . . . . . . . . . . . . . . . . . . . . . . . 21

Overview . . . . . . . . . . . . . . . . . . . . . . . . . . 21

Control Panel Buttons . . . . . . . . . . . . . . . . . . . . . . 21

Control Panel LEDs . . . . . . . . . . . . . . . . . . . . . . . 22

Power Fail LED . . . . . . . . . . . . . . . . . . . . . . . 23

Overheat/Fan Fail/UID LED . . . . . . . . . . . . . . . . . . . 23

NIC1 . . . . . . . . . . . . . . . . . . . . . . . . . . 23

NIC2 . . . . . . . . . . . . . . . . . . . . . . . . . . 24

HDD . . . . . . . . . . . . . . . . . . . . . . . . . . 24

Power . . . . . . . . . . . . . . . . . . . . . . . . . 24

Drive Carrier LEDs . . . . . . . . . . . . . . . . . . . . . . . 25

007-6364-001

Contents

4

5

6

System Safety . . . . . . . . . . . . . . . . . . . . . . . . 27

Electrical Safety Precautions . . . . . . . . . . . . . . . . . . . . 27

Serverboard Battery. . . . . . . . . . . . . . . . . . . . . . 28

ESD Precautions . . . . . . . . . . . . . . . . . . . . . . 28

Mainboard Replaceable Soldered-in Fuses . . . . . . . . . . . . . . . . 29

General Safety Precautions . . . . . . . . . . . . . . . . . . . . . 29

System and Serverboard Information . . . . . . . . . . . . . . . . . 31

Handling Circuit Boards and Drives . . . . . . . . . . . . . . . . . . 31

ESD Precautions . . . . . . . . . . . . . . . . . . . . . . 32

Unpacking . . . . . . . . . . . . . . . . . . . . . . . . 32

System Rear I/O Ports . . . . . . . . . . . . . . . . . . . . . . 33

Serverboard Details . . . . . . . . . . . . . . . . . . . . . . 33

CPUs . . . . . . . . . . . . . . . . . . . . . . . . 34

Memory . . . . . . . . . . . . . . . . . . . . . . . 34

GPUs . . . . . . . . . . . . . . . . . . . . . . . . 34

PCIe Expansion Slots . . . . . . . . . . . . . . . . . . . . 34

System Health Monitoring . . . . . . . . . . . . . . . . . . . 34

ACPI Features . . . . . . . . . . . . . . . . . . . . . . 35

Onboard I/O . . . . . . . . . . . . . . . . . . . . . . 35

Serverboard Dimensions . . . . . . . . . . . . . . . . . . . 35

Hard Disk Drives (C1104-GP1 Chassis) . . . . . . . . . . . . . . . . . 37

Drive Configurations . . . . . . . . . . . . . . . . . . . . . 37

PCIe Expansion Cards . . . . . . . . . . . . . . . . . . . . . . 38

Power Supply Functional Rating . . . . . . . . . . . . . . . . . . 38

Basic Troubleshooting and Chassis Service . . . . . . . . . . . . . . . . 39

Basic Troubleshooting Procedures . . . . . . . . . . . . . . . . . . . 39

If the System Does Not Power Up . . . . . . . . . . . . . . . . . . 39

System Powers Up But Will Not Boot . . . . . . . . . . . . . . . . . 40

No Video After System Power Up . . . . . . . . . . . . . . . . . . 40

Memory Errors . . . . . . . . . . . . . . . . . . . . . . . 40

Chassis Service Information. . . . . . . . . . . . . . . . . . . . . 41

Static-Sensitive Devices . . . . . . . . . . . . . . . . . . . . 41

007-6364-001 vii

Contents

A

B

Precautions . . . . . . . . . . . . . . . . . . . . . . . . 41

Unpacking . . . . . . . . . . . . . . . . . . . . . . . . 42

Control Panel . . . . . . . . . . . . . . . . . . . . . . . . . 42

Drive Bay Installation/Removal . . . . . . . . . . . . . . . . . . . . 42

Accessing the Drive Bays . . . . . . . . . . . . . . . . . . . . 42

Removing Hard Drives or Carriers from the Chassis . . . . . . . . . . . . . 43

The Hard Drive Backplane . . . . . . . . . . . . . . . . . . . 43

Disk Drive Installation . . . . . . . . . . . . . . . . . . . . . 43

Hard Drive Carrier Assembly Usage . . . . . . . . . . . . . . . . . 44

Power Supply. . . . . . . . . . . . . . . . . . . . . . . . . 47

Power Supply Failure . . . . . . . . . . . . . . . . . . . . . 47

Removing/Replacing a Power Supply . . . . . . . . . . . . . . . . . 48

Removing the Power Supply . . . . . . . . . . . . . . . . . . 48

Installing a New Power Supply . . . . . . . . . . . . . . . . . . 48

Accessing the Inside of the Chassis . . . . . . . . . . . . . . . . . . . 50

System Fans . . . . . . . . . . . . . . . . . . . . . . . . . 50

System Fan Failure . . . . . . . . . . . . . . . . . . . . . . 50

Replacing System Fans . . . . . . . . . . . . . . . . . . . . . 51

Remove/Replace a Fan . . . . . . . . . . . . . . . . . . . . 51

Install/Replace a PCIe Expansion Card . . . . . . . . . . . . . . . . . . 55

Install/Replace a Low-profile or Full-height PCIe Card . . . . . . . . . . . . 55

BIOS Error Codes . . . . . . . . . . . . . . . . . . . . . . . 57

System Operating and Regulatory Overview . . . . . . . . . . . . . . . . 59

Environmental Specifications . . . . . . . . . . . . . . . . . . . . 59

System Input Requirements . . . . . . . . . . . . . . . . . . . . . 60

Power Supply. . . . . . . . . . . . . . . . . . . . . . . . . 60

Regulatory Compliance . . . . . . . . . . . . . . . . . . . . . . 61

viii 007-6364-001

About This Guide

This guide provides an overview of the installation, architecture, general operation, and descriptions of the major components in the SGI ® Rackable ™ C1104-GP1 server. It also provides basic troubleshooting and maintenance information, BIOS error code information, and important safety and regulatory specifications.

Audience

This guide is written for users of SGI Rackable C1104-GP1 server systems. It is written with the assumption that the reader has a good working knowledge of computers and computer systems.

This guide may be useful to installers and system administrators looking for overview information on the server.

Chapter Descriptions

The following topics are covered in this guide:

• Chapter 1, “Introduction”

Provides an overview of the server’s components.

• Chapter 2, “Server Installation”

Provides a quick setup checklist to get the server operational.

• Chapter 3, “System Interface”

Describes several LEDs on the control panel as well as others on the SATA drive carriers that keep you constantly informed of the overall status of the system as well as the activity and health of specific components.

• Chapter 4, “System Safety”

Provides general system safety information.

• Chapter 5, “System Severboard Information”

007-6364-001 ix

Provides best practice procedures to work with a node board in the C1104-GP1 chassis, install memory DIMMs, PCIe expansion cards and hard disk drives.

• Chapter 6, “Basic Troubleshooting and Chassis Service”

Describes some basic steps required to troubleshoot your system. Additional sections in this chapter are intended to guide you through basic component remove and replace procedures.

• Appendix A, “BIOS Error Codes,”

Provides a brief listing of BIOS error code information.

• Appendix B, “System Specifications,”

Describes system component, environmental, and compliance specifications.

Related Publications

The following SGI documents may be relevant to the use of your server:

MegaRAID SAS Software User’s Guide, publication number, 860-0488-xxx

• SGI Foundation Software release notes

• SGI Performance Suite release notes

• SGI InfiniteStorage series documentation

• Man pages

You can obtain SGI documentation, release notes, or man pages in the following ways:

• Refer to the SGI Technical Publications Library at http://docs.sgi.com. Various formats are available. This library contains the most recent books and man pages.

• Refer to the SGI Supportfolio™ webpage for release notes and other documents whose

access require a support contract. See “Product Support” on page xi

.

x 007-6364-001

Conventions

The following conventions are used throughout this document:

Convention Meaning

Command

This fixed-space font denotes literal items such as commands, files, routines, path names, signals, messages, and programming language structures.

variable

user input

[ ]

...

man page

(x)

GUI element

The italic typeface denotes variable entries and words or concepts being defined. Italic typeface is also used for book titles.

This bold fixed-space font denotes literal items that the user enters in interactive sessions. Output is shown in nonbold, fixed-space font.

Brackets enclose optional portions of a command or directive line.

Ellipses indicate that a preceding element can be repeated.

Man page section identifiers appear in parentheses after man page names.

This font denotes the names of graphical user interface (GUI) elements such as windows, screens, dialog boxes, menus, toolbars, icons, buttons, boxes, fields, and lists.

Product Support

SGI provides a comprehensive product support and maintenance program for its products. SGI also offers services to implement and integrate Linux applications in your environment.

• Refer to http://www.sgi.com/support/

• If you are in North America, contact the Technical Assistance Center at

+1 800 800 4SGI or contact your authorized service provider.

• If you are outside North America, contact the SGI subsidiary or authorized distributor in your country.

:

007-6364-001 xi

Reader Comments

If you have comments about the technical accuracy, content, or organization of this document, contact SGI. Be sure to include the title and document number of the manual with your comments.

(Online, the document number is located in the front matter of the manual. In printed manuals, the document number is located at the bottom of each page.)

You can contact SGI in any of the following ways:

• Send e-mail to the following address: [email protected]

• Contact your customer service representative and ask that an incident be filed in the SGI incident tracking system: http://www.sgi.com/support/supportcenters.html

SGI values your comments and will respond to them promptly.

xii 007-6364-001

Chapter 1

1.

Introduction

007-6364-001

Important: SGI Rackable server systems may sometimes require driver versions that are not included in the original operating system release. When required, SGI provides these drivers on an SGI Driver CD, which may ship with the system, or on the system disk (pre-installed in the factory). For more information on this topic check with your sales or service representative.

The Rackable C1104-GP1 server is a 1U rackmount system (see

Figure 1-1 on page 2 )

.

In addition to the serverboard and chassis, various hardware components may be included with the system, as listed:

• Ten 4-cm chassis fans

• One internal air shroud

• Two passive 1U CPU heatsinks

• Riser cards as follows:

– One riser for a single PCIe x16 card, (left-front side internal GPU card)

– One riser for a single PCIe x16 card, (left-back side internal GPU card)

– One riser for a single PCIe x16 card, (right-front side internal GPU card)

– One riser for one low-profile PCIe 3.0 x8 card (external-facing rear card)

• Three power cables for GPU cards

• SATA accessories:

– One SAS/SATA backplane

– Four hot-swap disk drive carriers (RAID must be enabled for hot swap)

• Two power supplies

• One rackmount kit

1

1: Introduction

Tip: The Rackable C1104-GP1 server does not use an internal CD/DVD drive. Check with your

SGI sales or service representative for information on optional external CD/DVD drive units.

System

LEDs

Four disk drive bays

System reset

Main power

IPMI

LAN

PCIe low-profile expansion slot

Full-height PCIe slot

Figure 1-1

USB ports

LAN 1

LAN 2

VGA port

Rackable C1104-GP1 Server Front and Rear Views

Note: At time of publication, the rear x16 full-height PCIe slot (see

Figure 1-1 ) is used only for

specific internal GPU option cards. Check with your SGI sales or service representative for the latest information on optional cards available for this slot.

2 007-6364-001

Server Board Features

Server Board Features

At the heart of the system is a dual-processor serverboard based on the Intel C610 platform controller hub (PCH) chipset and designed to provide maximum performance. The main features of the serverboard are described in the following subsections.

Processors

The serverboard supports two multi-core Intel ® Xeon™ E5-2600(v3) Series processors. Each processor sits in an LGA 2011 socket and is interconnected via double Intel QuickPath

Interconnect (QPI) link support; see the next subsection for more information on the QPI interconnects. Four DDR4 memory channels are available per CPU socket with two DIMM slots per channel. A direct media interface connects the node’s PCH ASIC to processor 1; while 40-lane

Gen-3 PCIe interconnect lines link both processors directly to the motherboard LAN ports.

QPI Interconnect

Double QPI link pairs connect the two processors together on the motherboard, providing processor-controlled transfer bandwidths of up to 51.2 GB/second between the sockets.

Each QPI comprises two 20-lane point-to-point data links, one in each direction (full duplex), with a separate clock pair in each direction, for a total of 42 signals. Each signal is a differential pair, so the total number of pins is 84. The 20 data lanes are divided onto four “quadrants” of 5 lanes each. The basic unit of transfer is the 80-bit “flit”, which is transferred in two clock cycles (four

20 bit transfers, two per clock.) The 80-bit “flit” has 8 bits for error detection, 8 bits for “link-layer header” and 64 bits for “data”. QPI bandwidths are advertised by computing the transfer of 64 bits

(8 bytes) of data every two clock cycles in each direction

Memory

The serverboard has sixteen DIMM slots (eight per processor) that support DDR4

2133/1866/1600/1333 MHz RDIMMs. Up to two DIMMs per channel are supported.

Important: Use of two DIMMs per channel will limit the DIMMs to run at 1866 MHz maximum.

Note also that memory speed support is dependent on the type of CPUs used on the mother board.

007-6364-001 3

1: Introduction

Serial ATA and Optional SAS

The Intel PCH C610 is integrated into the system serverboard to provide an internal four-port

SATA disk subsystem. The four drive ports (0 through 3) are SATA 3.0 ports. The hot-swappable

SATA drives are connected to a backplane that provides power, bus termination and configuration settings. Optional RAID 0, 1 and 10 are supported. Note that your operating system must have

RAID support enabled to accommodate hot swapping of disk drives.

SAS RAID drive configurations require use of optional hardware ordered when the system is purchased. Check with your sales or service representative to obtain information on set-up procedures if your system did not come pre-configured with SATA or SAS RAID.

PCI Express Expansion Slots

The dual processor serverboard has three PCIe 3.0 x16 slots to support internal double-width GPU cards. An additional slot at the rear of the server supports one PCIe 3.0 x8 low-profile card.

See the section “PCIe Expansion Cards” in Chapter 5

for more information on these topics.

Onboard Controllers/Ports

The color-coded I/O ports include (an internal COM header located on the serverboard), VGA

(monitor) port, two external USB 3.0 ports and two RJ45 LAN Ethernet ports. A dedicated external IPMI LAN port is also included.

Onboard Graphics Controller

The dual-processor serverboard features an integrated Aspeed AST2400 video controller providing a DDR3 2D video graphics interface through the system VGA connector. The AST video controller in the 1U server features PCIe 1x support, advanced BMC features, low power consumption, high reliability and 1920 x 1200 60Hz display capability.

IPMI

IPMI (Intelligent Platform Management Interface) is a hardware-level interface specification that provides remote access, monitoring and administration for your SGI Rackable C1104-GP1 server

4 007-6364-001

Server Chassis Features platforms. IPMI allows server administrators to view a server’s hardware status remotely, receive an alarm automatically if a failure occurs, and power cycle a system that is non-responsive.

Other Features

Other on-board features that promote system health include on-board voltage monitors, a chassis intrusion header, auto-switching voltage regulators, chassis and CPU overheat sensors, virus protection and BIOS rescue.

Server Chassis Features

The following subsections provide a general outline of the main features of the SGI Rackable

C1104-GP1 server chassis.

System Power

The Rackable C1104-GP1 1U server chassis features a redundant power supply composed of two separate power modules. This power redundancy feature allows you to replace a failed power supply without shutting down the system. Note that each power supply provides up to 1600 Watts of power to the system.

Serial ATA Subsystems

The server chassis supports up to four 2.5-in SATA drives. Chassis drives are SATA 3.0

6-Gb/second slots. RAID 0, 1 and 10 drives are hot-swappable units and are connected to a backplane that provides power and control. Note that the operating system you have installed must support RAID to enable the hot-swap capability of RAID drives. Certain RAID levels may require use of optional hardware or software to support RAIDed hard disk drives in the server.

Front Control Panel

The control panel on the C1104-GP1 server provides you with system monitoring and control.

LEDs that indicate system power, HDD activity, network activity, system overheat and a system overheat/fan-fail/ UID LED. A main power button and a system reset button are also included.

007-6364-001 5

1: Introduction

Serverboard and GPU Subsystem

The C1104-GP1 server chassis is an ATX form factor chassis designed to be used in a 1U rackmount configuration. The serverboard’s I/O backplane supports up to three standard size

(double-width) GPUs to enable high-quality GPU computing solutions. A 15-pin VGA port, two

USB 3.0 ports, two Gigabit LAN ports and a dedicated RJ-45 IPMI LAN port are also supported.

The GPUs process complex image calculations and then route the data out through the VGA port on the serverboard. The GPUs, which come with a passive heatsink attached, have been tested for use with this system.

Important: Check with your SGI sales or service representative prior to using any GPU not sourced from the SGI factory.

Any combination of these cards (up to a total of three) may have come bundled with the system.

Power for the GPU cards is provided via a GPU power cable from each of the GPUs to JPW3, 4,

6 and 7 on the serverboard (one cable for each card).

Figure 1-2 on page 8

shows a general block diagram of the C1104-GP1 server’s processor and I/O chipset.

GPU Features

Each of the GPUs will feature some or all of the following:

• Hundreds of GPU cores in each card that can deliver up to 1.4 Teraflops of double-precision and up to 4.2 Teraflops of single-precision calculations.

• ECC protected internal register files, L1/L2 caches, shared memory, and external DRAM.

• Up to 12 GB of GDDR5 memory per GPU enhances performance and reduces data transfers by keeping larger data sets in local memory attached directly to the GPU.

• Integrates the GPU subsystem with the C1104-GP1 server’s monitoring and management capabilities such as IPMI.

• Onboard L1 and L2 caches that accelerate algorithms and sparse-matrix multiplication.

• Provides faster context switching, concurrent kernel execution and improved thread block scheduling.

• Enhances overall system performance by transferring data over the PCIe bus while the computing cores are processing other data.

• Provides a flexible programming environment with broad support for various programming languages and APIs.

6 007-6364-001

Server Chassis Features

Cooling System

The 1U server chassis has a cooling design that includes ten internal 4-cm counter-rotating Pulse

Width Modulated (PWM) system cooling fans located in the chassis. An air shroud channels the airflow from the system fans to efficiently cool the processor and GPU areas of the system. Each power supply module also includes an internal cooling fan. All chassis and power supply fans operate continuously.

007-6364-001 7

1: Introduction

8

#1-3

#1-2

#1-1

#1-5

#1-4

#1-7

#1-6

#1-8

VCCP0 12v

VR12.5

5 PHASE

145W

VCCP1 12v

VR12.5

5 PHASE

145W

Processor 1

DDR4

#3

#2

P0

#1

DMI2

P1

QPI

9.6G

P1

QPI

9.6G

Processor 2

P0

DDR4

#1 #2 #3 DMI2

PCI-E X16 G3

PCI-E X16 G3 (LAN REVERSE)

PVCCIO

(1.05/0.95) from 3.3v

PCI-EX16 G3

PCI-EX16 G3(LAN REVERSE)

PCI-EX8 G3(w/ Re-driver)

P5V_AUX

PX2V5_I3V3

PX1V2_I1V8

PX0V8_I1V0

PX0V67_I1V0

LAN

I350/X540

MAX:12.5W

STBY:2.5W

Sagevill: 9W

PCI-E X8

DMI2

4GB/s

P3V3_PCH

P1V5_PCH 3.3v

P1V05_PCH 5v

P1V05_STBY 3.3v STBY

#6/7/8

1.05 PCH

1.05 ASW

1.5 PCH

PVCCIO 1.0/0.95

#3 PCH

6.0 Gb/S

#2-2

#2-1

#2-5

#2-4

#2-3

#2-8

#2-7

#2-6

#2

RJ45

DDR3

BMC Boot Flash

LAN3

RTL8211E-VB-CG

SPI

RGRMII

P3V3_STBY

P1V5_AUX_BMC

P1V2_AUX_BMC

PCI-E X1 G2 3.3STBY:0.5A

BMC

AST2400

USB 2.0

<=1.758W (average)

2.3W (Peak)

SPI

BIOS

SPI

LPC

VGA CONN

COM

Header

#5

5V:1.2A

3.3V:0.1A

3.3 STBY:0.2A

#12 USB2.0

Idle:0.45W

TDP:6.5W (WORKSTATION)

5W (SERVER)

USB & SATA useage different

TPM HEADER

Debug Card

USB 3.0

SPI

BIOS

HEADER

SYSTEM POWER

Temp Sensor

EMC1402-1 *2 at diff SMBUS

FRONT PANEL

FAN SPEED

CTRL

Figure 1-2

Processor, Memory and I/O Chipset System Block Diagram

007-6364-001

Chapter 2

2.

Server Installation

This chapter provides a quick setup checklist to get the SGI Rackable C1104-GP1 operational. If your system came already mounted in a rack, you can skip the rack installation procedures.

Unpack the System

Inspect the shipping container that the C1104-GP1 was shipped in and note if it was damaged in any way. If the server shows damage, file a damage claim with the carrier who delivered it.

Decide on a suitable location for the rack that supports the weight, power requirements, and environmental requirements of the C1104-GP1 server. It should be situated in a clean, dust-free environment that is well ventilated. Avoid areas where heat, electrical noise, and electromagnetic

fields are generated. Place the server rack near a grounded power outlet. Refer also to “System

Warnings and Precautions” on page 10 .

Prepare for Setup

The shipping container should include two sets of rail assemblies, two rail mounting brackets and the mounting screws that you will use to install the system into a rack. Note that the inner rails should already be attached to the server. Read this section in its entirety before you begin the installation procedure.

Choose a Setup Location

Leave enough clearance in front of the rack to enable you to open the front door completely (~25 inches) and approximately 30 inches of clearance in the back of the rack to allow for sufficient airflow and ease in servicing. This clearance may vary depending on the type of rack and installation site chosen. This product is for installation only in an (IEC 60950) Restricted Access

Location - dedicated equipment rooms, service closets and labs, etc. See also, “Regulatory

Compliance” in Appendix B

.

007-6364-001 9

2: Server Installation

System Warnings and Precautions

!

Warning:

The SGI Rackable C1104-GP1 server weighs up to 37 lbs (16.8 kg). Always use proper lifting techniques when you move the server. Always get the assistance of another qualified person when you install the sever in a location above your shoulders. Failure to do so may result in serious personal injury or damage to the equipment.

!

Warning:

Extend the leveling jacks on the bottom of the rack to the floor with the full weight of the rack resting on them. Failure to do so can result in serious injury or death.

!

Warning:

Attach stabilizers to the rack in single rack installations. Failure to do so can result in serious injury or death. Couple racks together in multiple rack installations.

Failure to do so can result in serious injury or death.

!

Warning:

Be sure the rack is stable before extending a component from the rack. Failure to do so can result in serious injury or death.

!

Warning:

Extend only one rack component at a time. Extending two or more components simultaneously may cause the rack to tip over and result in serious injury or death.

Figure 2-1

Slide/Rail Equipment Usage Caution

10 007-6364-001

Rack Mounting Considerations

Server Precautions

• Review the electrical and general safety precautions.

• Determine the placement of each component in the rack before you install the rails.

• Install the heaviest server components in the bottom of the rack first, and then work up.

• Add a regulating uninterruptible power supply (UPS) to protect the server from power surges and voltage spikes and to keep your system operating in case of a power failure.

• Allow the hot-pluggable disk drives and power supply modules to cool before touching.

• Always keep the rack’s front door and all panels and components on the servers closed when not servicing to maintain proper cooling.

• The server is not considered suitable for visual display work place devices under the German government ordinance for work with visual display units.

Rack Mounting Considerations

Use the guidelines provided in the following subsections to properly install the server in a rack.

Ambient Operating Temperature

If installed in a closed or multi-unit rack assembly, the ambient operating temperature of the rack environment may be greater than the ambient temperature of the room. Therefore, consideration should be given to installing the equipment in an environment compatible with the manufacturer’s maximum rated ambient temperature (

30º C or 86º F)

.

Important: Certain system configurations using three NVIDIA GPUs and higher-wattage processors may require the maximum rated ambient temperature of the operational environment be lower than 30

º C (86º F).

Altitude of operation can also affect required ambient air temperatures,

see Table B-1 on page 59

. Check with your SGI sales or service representative for additional information on this topic.

007-6364-001 11

2: Server Installation

Reduced Airflow

Equipment should be mounted into a rack so that the amount of airflow required for safe operation is not compromised.

Mechanical Loading

Equipment should be mounted into a rack so that a hazardous condition does not arise due to uneven mechanical loading. Racks should generally be filled with equipment from the bottom up.

Circuit Overloading

Consideration should be given to the connection of the equipment to the power supply circuitry and the effect that any possible overloading of circuits might have on over-current protection and power supply wiring. Appropriate consideration of equipment nameplate ratings should be used when addressing this concern.

Reliable Ground

A reliable ground must be maintained at all times. To ensure this, the rack itself should be grounded. Particular attention should be given to power supply connections other than the direct connections to the branch circuit (for example, the use of power strips, and so on).

Install the System into a Rack

This section provides information on installing the C1104-GP1 into a rack. If the system has already been mounted into a rack, refer to

“Supply Power to the System” on page 19 . There are a

variety of rack units on the market, which may mean the assembly procedure will differ slightly.

You should also refer to the installation instructions that came with the rack unit you are using.

Note that this system’s rail kit is designed to fit a rack between 26-in and 33.5-in deep.

Separate the Sections of the Rack Rails

The chassis package includes two rail assemblies in the rack mounting kit.

12 007-6364-001

Install the System into a Rack

Each assembly consists of two sections: an inner fixed chassis rail that secures directly to the server chassis and an outer fixed rack rail that secures directly to the rack itself. Note that the inner rail may be pre-installed on the server by the SGI factory making separation steps unnecessary.

To separate the inner and outer rails, perform the following steps:

1.

Locate the rail assembly in the chassis packaging as shown in Figure 2-2 .

2.

Extend the rail assembly by pulling it outward.

3.

Press the quick-release tab

4.

Separate the inner rail from the outer rail assembly.

007-6364-001

Figure 2-2

Separating the System Rack Rail Components

13

2: Server Installation

Inner Rail Extensions

The chassis includes a set of inner rack rails in two sections:

• inner rails

• inner rail extensions

The inner rails are preattached to the server chassis and do not interfere with normal use of the system if you decide not to install it into a server rack. Attaching the inner rail extensions to the inner rails stabilizes the chassis within the rack is described in the following subsection.

Installing the Inner Rail Extensions

1.

Place the inner rail extensions over the preattached inner rails which are attached to the side of the chassis. Align the hooks of the inner rail with the rail extension holes. Make sure the extension faces “outward” just like the inner rail.

2.

Slide the extension toward the front of the chassis.

3.

Secure the chassis with screws as illustrated in

Figure 2-3 on page 15

.

4.

Repeat steps 1-3 for the other inner rail extension.

14 007-6364-001

Install the System into a Rack

Figure 2-3

Rail to Server Chassis Attachment Example

Assembling the Outer Rails

Each outer rail is in two sections that must be assembled before mounting on to the rack.

Assembling the Outer Rails

1.

Identify the left and right outer rails by examining the ends, which bend outward.

2.

Slide the front section of the outer rail into the rear section of the outer rail.

3.

The assembly should look similar to the example in Figure 2-4 on page 16

.

007-6364-001 15

2: Server Installation

Figure 2-4

Outer Rail Assembly Example

Attaching the Outer Rack Rails

Outer rails attach to the rack and hold the chassis in place. They extend between 26.5 and 36.4 inches.

1.

Measure the depth of the rack (distance from the front rail to the rear rail) to ensure it complies with the limitations listed.

2.

Adjust the outer rails to the proper length to fit within the rack. See the placement example in

Figure 2-5 on page 17 .

3.

Hang the hooks of the front of the outer rail onto the slots on the front of the rack. Use screws to secure the outer rails to the rack.

4.

Pull out and adjust both the short and long brackets to the proper distance so that the rail can

fit snugly into the rack, reference Figure 2-6 on page 18 .

5.

Hang the hooks of rear portion of the outer rail into the slots on the rear of the rack. Secure the long bracket to the rear side of the outer rail with the screws provided.

6.

Repeat the previous steps to properly install the left outer rail.

16 007-6364-001

Install the System into a Rack

Figure 2-5

Outer Rack Rail Assembly/Placement Example

Using the Rail Locking Tabs

Both chassis rails have a locking tab, which serves two functions:

• The tabs can lock the server into place when installed and pushed fully into the rack, (its normal operating position).

• The tabs also lock the server in place when fully extended from the rack. This prevents the server from coming completely out of the rack when pulled out for servicing. Depress both tabs at the same time to fully remove the server from its rail mounting and extract it from the rack.

007-6364-001 17

2: Server Installation

Install the Server in a Rack

!

Warning:

The SGI Rackable C1104-GP1 server weighs up to 37 lbs (16.8 kg) Always use proper lifting techniques when your move the server. Always get the assistance of another qualified person when you install the sever in a location above your shoulders. Failure to do so may result in serious personal injury or damage to the equipment.

1.

Extend the outer rails on either side of the rack rail assembly.

2.

Align the inner rails of the chassis with the outer rails on the rack, see Figure 2-6.

18

Figure 2-6

Installing the Server in the Rack

007-6364-001

Install the System into a Rack

3.

Slide the inner rails into the outer rails, keeping the pressure even on both sides. When the chassis has been pushed completely into the rack, it should click into the locked position.

4.

Optional screws are recommended to secure and hold the front of the chassis to the rack.

Supply Power to the System

Connect the power cords from the power supply modules into a power strip or power distribution unit (PDU) within the rack. An optionally available uninterruptible power supply (UPS) can ensure continued operation in case of a failure of the regular power source.

After all power connections are verified, push the power-on button on the front of the server when you wish to power on the unit.

007-6364-001 19

Chapter 3

3.

System Interface

Overview

There are a number of LEDs on the front control panel as well as others on the drive carriers and

power supplies to keep you constantly informed of the overall status of the system. See Figure 3-1

for an example of the front control panel. These LEDs provide constant information on the system and on the overall health of system components.

1 2

Figure 3-1

System Front Control Panel Indicator Components

Control Panel Buttons

In addition to monitoring the activity and health of specific components using LEDs, the system uses two buttons located on the front of the chassis: a reset button and a power on/off button. Use

the reset button to reboot the system as shown in Figure 3-2

.

007-6364-001 21

3: System Interface

Figure 3-2

System Reset Button

Figure 3-3 shows the main power button, which is used to apply or turn off the main system power.

Turning off system power with this button removes the main power but keeps standby power supplied to the system.

Figure 3-3

System Power On Button

Control Panel LEDs

The control panel located on the front of the chassis has several LEDs. These LEDs provide you with critical information related to different parts of the system. This section explains what each

LED indicates when illuminated or flashing and any corrective action you may need to take.

22 007-6364-001

Control Panel LEDs

Power Fail LED

The power fail LED indicates a power supply module has failed as shown in

Figure 3-4 . The

second power supply module will take the load and keep the system running but the failed module will need to be replaced. Refer to Chapter 6 for details on replacing the power supply. This LED should be off when the system is operating normally.

Figure 3-4

Power Fail LED

Overheat/Fan Fail/UID LED

When the red overheat/fan/UID LED flashes (shown in Figure 3-5

), it indicates a fan failure.

When on continuously it indicates an overheat condition, which may be caused by cables obstructing the airflow in the system or the ambient room temperature being too warm.

Check the routing of the cables and make sure all fans are present and operating normally. You should also check to make sure that the chassis covers and fan shrouds are installed properly. This

LED will remain flashing or on as long as the indicated condition exists.

The “blue light” function (UID) of this LED is used to identify a specific server in large racks filled with equipment. When activated through the system software the “blue light” will remain on until shut down by the administrator.

NIC1

Figure 3-5

Overheat/Fan Fail/UID LED

When flashing, the NIC1 LED indicates network activity on the LAN1 port (see Figure 3-6

).

007-6364-001 23

3: System Interface

NIC2

HDD

Power

24

1

Figure 3-6

LAN1 Network Activity NIC1 LED

When flashing, the NIC2 LED indicates network activity on the LAN2 port (see Figure 3-7

).

2

Figure 3-7

LAN2 Network Activity NIC2 LED

The HDD LED indicates hard drive activity when flashing (see

Figure 3-8 ).

Figure 3-8

Hard Drive Activity LED

The power LED indicates power is being supplied to the system's power supply unit(s). An

example LED is shown in Figure 3-9

. This LED should normally be illuminated when the system is operating.

007-6364-001

Drive Carrier LEDs

Figure 3-9

Power On LED

Drive Carrier LEDs

The system hard disk drives each have two LEDs, that function as listed in the following two paragraphs:

• Green: When illuminated, the green LED on the drive carrier indicates drive activity. A connection to the drive backplane enables this LED to blink on and off when that particular drive is being accessed. Please refer to Chapter 6 for instructions on replacing failed drives.

• Red: When this LED is flashing it indicates that a RAID drive is rebuilding. A solidly lit red

LED indicates a drive failure. If the drives fails, you should be notified by your system management software. Refer to Chapter 6 for instructions on replacing failed drives.

007-6364-001 25

Chapter 4

4.

System Safety

This chapter describes basic safety precautions when using the server.

Electrical Safety Precautions

Basic electrical safety precautions should be followed to protect yourself from harm and the

Rackable C1104-GP1 system from damage, as follows:

• Be aware of the locations of the power on/off switch on the chassis as well as the room's emergency power-off switch, disconnection switch or electrical outlet. If an electrical accident occurs, you can then quickly remove power from the system.

• Do not work alone when working with high voltage components.

• Power should always be disconnected from the system when removing or installing main system components, such as the memory modules and disk drives. When disconnecting power, you should first power down the operating system and then unplug the power cords.

The unit can have more than one power supply cord. Disconnect two power supply cords before servicing to avoid electrical shock.

• When working around exposed electrical circuits, another person who is familiar with the power-off controls should be nearby to switch off the power if necessary.

• Use only one hand when working with powered-on electrical equipment. This is to avoid making a complete circuit, which will cause electrical shock. Use extreme caution when using metal tools, which can easily damage any electrical components or circuit boards they come into contact with.

• Do not use mats designed to decrease static electrical discharge as protection from electrical shock. Instead, use rubber mats that have been specifically designed as electrical insulators.

• The power supply power cords must include a grounding plug and must be plugged into grounded electrical outlets or power distribution unit (PDUs).

007-6364-001 27

4: System Safety

Serverboard Battery

!

Caution: There is a danger of explosion if an onboard battery is installed upside down, which

will reverse its polarities (see Figure 4-1 ). This battery must be replaced only with the same

or an equivalent type recommended by the manufacturer. Check with your service representative if you have any questions.

Lithium battery

Battery holder

Figure 4-1

Installing the Onboard Battery

Important: Handle used batteries carefully and do not damage the battery in any way; a damaged battery may release hazardous materials into the environment. Do not discard a used battery in the garbage or a public landfill. Dispose of used batteries according to the manufacturer's instructions and in compliance with the regulations set up by your local hazardous waste management agency.

ESD Precautions

!

Caution: This server contains electronic components and printed circuit boards which are susceptible to electrostatic discharge (ESD) damage. ESD is generated by two objects with different electrical charges coming into contact with each other. An electrical discharge is created to neutralize this difference, which can damage electronic components and printed circuit boards.

The following measures are generally sufficient to neutralize this difference before contact is made to protect your equipment from ESD:

• Use a grounded wrist strap designed to prevent static discharge.

28 007-6364-001

General Safety Precautions

• Keep all components and printed circuit boards (PCBs) in their antistatic bags until ready for use.

• Touch a grounded metal object before removing the board from the antistatic bag.

• Do not let components or PCBs come into contact with your clothing, which may retain a charge even if you are wearing a wrist strap.

• Handle a board by its edges only; do not touch its components, peripheral chips, memory modules or contacts.

• When handling chips or modules, avoid touching their pins.

• Put the serverboard and peripherals back into their antistatic bags when not in use.

• For grounding purposes, make sure your computer chassis provides excellent conductivity between the power supply, the case, the mounting fasteners and the serverboard.

Mainboard Replaceable Soldered-in Fuses

Important: If your system comes with self-resetting PTC (Positive Temperature Coefficient) fuses on the serverboard, they must be replaced by trained service technicians only. The new fuse must be the same or equivalent as the one replaced. Contact your technical support organization for details and support.

General Safety Precautions

Follow these rules to ensure general safety:

• Keep the area around the Rackable C1104-GP1 system clean and free of clutter.

• The Rackable C1104-GP1 system weighs approximately 37 lbs (16.8 kg.) when fully loaded.

When lifting the system, two people at either end should lift slowly with their feet spread out to distribute the weight. Always keep your back straight and lift with your legs.

• Place the chassis top cover and any system components that have been removed away from the system or on a table so that they won't accidentally be stepped on.

• While working on the system, do not wear loose clothing such as neckties and unbuttoned shirt sleeves, which can come into contact with electrical circuits or be pulled into a cooling fan.

007-6364-001 29

4: System Safety

• Remove any jewelry or metal objects from your body, which are excellent metal conductors that can create short circuits and harm you if they come into contact with printed circuit boards or areas where power is present.

• After accessing the inside of the system, close the system back up and secure it to the rack unit with the retention screws after ensuring that all connections have been made.

30 007-6364-001

Chapter 5

5.

System and Serverboard Information

!

This chapter includes best practice procedures to work with a node board in the C1104-GP1 chassis and understand the system PCIe expansion cards and hard disk drives. Use the information in Chapter 6 to troubleshoot your server and add, remove, or replace system components.

A layout and quick reference chart is included in this chapter for your reference.

Some software products are protected with software license keys derived from the Media Access

Control (MAC) Ethernet address. If your system requires the replacement of a node board, the

MAC Ethernet address changes. If you are using such a product, you or your service representative must request a new license key after replacement of a node board. Contact your local customer support office: http://www.sgi.com/support/supportcenters.html

Caution: Install the chassis cover after you have completed accessing the components inside the server to maintain proper airflow and cooling for the system.

Handling Circuit Boards and Drives

!

Caution: Electrostatic discharge (ESD) can damage electrostatic-sensitive devices inside the

C1104-GP1 server. Use the ESD precautions described below when you handle printed circuit boards or other components in the system. The following measures are generally sufficient to protect your equipment from electro-static discharge.

007-6364-001 31

5: System and Serverboard Information

ESD Precautions

• Use a grounded wrist strap designed to prevent electrostatic discharge.

• Touch a grounded metal object before removing any board from its antistatic bag.

• Handle each printed circuit board (PCB) by the edges; do not touch the components, peripheral chips, memory modules, or gold contacts on the PCB.

• When handling chips or modules, avoid touching the pins.

• Store PCIe cards, or other boards and components in antistatic bags when not in use.

• Make sure your computer chassis provides a conductive path between the power supply, the case, the mounting fasteners, and the node board to chassis ground.

Unpacking

!

Caution: System options are shipped in anti-static packaging to avoid electrostatic discharge damage. Be sure to use ESD precautions when you unpack upgrade or replacement components for the C1104-GP1 server. Failure to do so can result in damage to the equipment.

32 007-6364-001

System Rear I/O Ports

System Rear I/O Ports

The rear external system I/O ports are color coded in conformance with the PC 99 specification.

See Figure 5-1

below for the colors and locations of the various I/O ports.

Table 5-1

identifies the functions of each of the I/O ports on the backpanel.

3

2

1

Figure 5-1

4 5

I/O Port Locations

6 7

Table 5-1

System Backpanel I/O Port Functions

1. USB port 0

2. USB port 1

3. Dedicated IPMI LAN port

4. LAN port 1

5. LAN port 2

6. VGA port

7. UID switch

Serverboard Details

The 1U C1104-GP1 system chassis has one node board. The C1104-GP1 serverboard is configured with two processors. When configured with two processors, the following rules apply:

• Both processor sockets must have identical revisions, core voltage, and bus/core speed.

• The stepping between the processors on the board must be identical.

• See

Figure 5-2 on page 36

for CPU locations on the serverboard - note that the drawing is not to scale.

007-6364-001 33

5: System and Serverboard Information

CPUs

• The C610 chipset is used on the system serverboard.

Memory

• Eight DIMM slots supporting 2133/1866/1600 MHz registered DDR4 ECC SDRAM.

Note: Check with your authorized sales/service representative for installation of approved

DIMM types.

GPUs

• A total of three GPUs are supported (true PCI-E 3.0 x16 signal) - GPU types are limited, check with your sales or support representative.

PCIe Expansion Slots

• An external PCI-Express (PCIe) slot with the following features:

– One PCIe Gen 3.0 x8 low-profile card (in x16 slot)

System Health Monitoring

• Onboard voltage monitors

• Fan status monitor with firmware/software on/off and speed control

• Watch Dog

• Environmental temperature monitoring via BIOS

• Power-up mode control for recovery from AC power loss

• System resource alert (via included utility program)

• Auto-switching voltage regulator for each CPU core

• CPU thermal trip support

• I2C temperature sensing logic

• Chassis intrusion detection

34 007-6364-001

System Rear I/O Ports

ACPI Features

• Slow blinking LED for suspend state indicator

• BIOS support for USB keyboard

• Wake-On-LAN (WOL)

• Internal/external modem ring-on

• Hardware BIOS Virus protection

Onboard I/O

• Four disk drive bays supported by an on-chip SATA controller (RAID 0, 1 and 10 are supported in this system)

• Two (2) USB (Universal Serial Bus 3.0) ports (rear [external] type A)

• Two (2) RJ-45 LAN ports supported by an on-board Intel® Ethernet controller

• One (1) dedicated (RJ-45) IPMI LAN port

• One (1) VGA port supported by an Aspeed 2400 graphics controller (with DDR3 memory)

Serverboard Dimensions

Proprietary board format is: 19.8" x 9.2" (503 mm x 234 mm).

007-6364-001 35

5: System and Serverboard Information

3I-SA

2S-SA

LE4

VGA

JWD1

BMC

LAN2 LAN1

USB4/5 (3.0)

IPMI_LAN

BIOS

LAN

CTRL

JOH1

JBT1

1

1

1

PCH

BT1

CLOSE 1st

CPU1

OPEN 1st

MAC CODE

BAR CODE

X10DRG-H

Rev. 1.01

JITP1

BIOS

LICENSE

CLOSE 1st

CPU2

OPEN 1st

1

JF1

P1 DIMMG1 P1 DIMMG2

Figure 5-2

FANB FANA FAN4 FAN3

Node Board Layout Example

FANG

FANF

36 007-6364-001

Hard Disk Drives (C1104-GP1 Chassis)

Hard Disk Drives (C1104-GP1 Chassis)

The 1U chassis supports a maximum of four 2.5-inch hard disk drives, see . Install the drives from

left to right starting in the lower-left bay. Disk drive bays must be populated with either a drive or a “drive blank” to maintain system thermals. Failure to follow this guideline may cause system overheating and thermal shutdown of the unit.

Important: The operating system you use must have RAID support to enable the hot-swap capability and RAID functions of the SATA drives.

System

LEDs

Four disk drive bays

System reset

Figure 5-3

Main power

C1104-GP1 System Disk Drive Locations

Drive Configurations

The disk drive configurations supported in the Rackable C1104-GP1 server are outlined in the paragraphs that follow. Note that some configurations are dependent on use of optional hardware to support RAID configurations.

The supported disk drive configurations are as follows:

• JBOD

This non-RAID disk array supports any number of drives between one and four. The operating system is placed on the disk drive in location 0 (system disk). All other drives are data drives.

• RAID 0

Disk striping without parity, supports any number of drives between two and four. Note that all drives must be the same type, speed and capacity. The operating system will be striped across all drives in the system. This configuration is not recommended.

007-6364-001 37

5: System and Serverboard Information

• RAID 1

Disk “mirroring”, supports exactly two drives. The two drives represent one RAID 1 logical drive. The operating system will be installed on the drives located in Drive positions 0 and 1.

Note that both drives 0 and 1 must be of the same type, speed and capacity.

• RAID 10

Mirrored disk striping, the data is striped across one set of drives and then mirrored on another set of drives. A minimum of four drives of the same type are required. The total number of drives must be an even number (4 in this case). A total of four drives is a 2+2 configuration. The operating system will be striped across the drives in the primary set and then mirrored on the secondary set of drives.

Note that all drives must be the same type, speed and capacity.

PCIe Expansion Cards

There are three internal double-width (GPU) PCIe 3.0 x16 expansion slots and one external PCIe slot available with the C1104-GP1 server. The external option slot functions as listed:

• External PCI-Express 3.0 x8 low-profile card (x16 slot)

Note: Only specific GPU cards will fit and function in the internal PCIe GPU slots, contact your

SGI sales or service representative for information on approved GPU cards.

Power Supply Functional Rating

The C1104-GP1 server default configuration is two rear-installed 1600-Watt power supplies. The second power supply acts as a redundant power unit for the server. The supplies are “auto-ranging” and can operate from either 100-140V or 180-240V levels at 50 or 60Hz.

Each power supply module has its own cooling fan.

The supplies used have a Platinum Certification rating.

38 007-6364-001

Chapter 6

6.

Basic Troubleshooting and Chassis Service

Use the procedures in the first half of this chapter to troubleshoot your system. If you follow all of the procedures and still need assistance, check with your authorized support organization.

The subsections in the second half of this chapter starting with “Chassis Service Information” on page 41 are intended to guide you through basic component remove and replace procedures.

Basic Troubleshooting Procedures

Use the information in the following subsections to remedy basic problems you might encounter when working with the SGI Rackable C1104-GP1 server.

If the System Does Not Power Up

If the system will not power up when the front power button is pushed, use the following checklist to identify common sources for the problem:

• Make sure that both ends of each system power cable are firmly connected to the power supplies and the corresponding power source(s) or power distribution unit (PDU).

• Check to see if the power fail LED is lit on the front of the unit. This LED should be off if the system is operating normally.

• Check that the LED on each power supply is properly lit. The power supply has one status

LED located on the left side of the front of the power supply. The LED has three states:

– Dark or off - indicates no AC power present

– Solid Amber - AC power is present, the server is not turned on (no DC power)

– Blinking Amber - supply temperature exceeds 63 o C (auto-shutdown occurs at 73 o C)

– Solid Green - AC power is present and the server is turned on (DC power present)

007-6364-001 39

6: Basic Troubleshooting and Chassis Service

• Open the system cover, remove the air shroud and check to make sure that no obvious short circuits exist between the serverboard and chassis.

System Powers Up But Will Not Boot

If the system powers up but will not boot the Operating System, check the following:

• Check the system order document(s) - the C1104-GP1 server may have been ordered with no operating system. If so, check with your system administrator for OS loading information.

• Check the system disk (drive 0) for drive activity and confirm that it is firmly seated in the disk bay. A red light on the front of the disk indicates a functional error. Check with your service provider or local system administrator.

No Video After System Power Up

If the system powers up and appears to be booting normally but no video is present, try the following basic solutions:

• Confirm your monitor is plugged in and switched on.

• Check all video cables and ensure they are properly connected.

• Listen for a BIOS “beep code” error message - one long beep plus 8 short beeps indicates a video error. This beep code message could indicate a video memory error or other video malfunction; contact your service provider.

• If using an optional PCIe video card check the back of the card for LED activity or a fault indicator. Try opening the system, reseating the PCIe card and rebooting; see the section

“Install/Replace a PCIe Expansion Card” on page 55

.

If you cannot get a video signal after trying basic solutions contact your support provider.

Memory Errors

If your system experiences memory related errors, try these basic troubleshooting steps to resolve or better identify the problem:

• Confirm that the power supply LED is not indicating an error.

• Listen for memory error beep codes - five short beeps followed by one long beep is a BIOS signal that no system memory has been detected - See

Appendix A, “BIOS Error Codes” .

40 007-6364-001

Chassis Service Information

• Shut the system down, remove the covers over the serverboard and make sure that all the

DIMM modules are properly and fully installed.

• You should be using registered ECC DDR4 memory. Also, it is recommended that you use the same memory type and speed for all DIMMs in the system.

• Contact your administrator or support provider if the memory errors continue.

Chassis Service Information

The following sections cover the steps required to install components and perform maintenance on the C1104-GP1 chassis. For component installation, follow the steps in the order given to eliminate the most common problems encountered. If some steps are unnecessary, skip ahead to the step that follows.

Important: Always disconnect the AC power cord(s) before adding, changing or installing any internal hardware components.

Tools Required: The only tool you will need to install components and perform maintenance is a

Phillips screwdriver.

Static-Sensitive Devices

Electrostatic discharge (ESD) can damage electronic components. To prevent damage to any printed circuit boards (PCBs), it is important to handle them very carefully. The following measures are generally sufficient to protect your equipment from ESD damage.

Precautions

• Use a grounded wrist strap designed to prevent static discharge.

• Touch a grounded metal object before removing any board from its antistatic bag.

• Handle a board by its edges only; do not touch its components, peripheral chips, memory modules or gold contacts.

• When handling chips or modules, avoid touching their pins.

007-6364-001 41

6: Basic Troubleshooting and Chassis Service

• Put the serverboard, add-on cards and peripherals back into their anti-static bags when not in use.

• For grounding purposes, make sure your computer chassis provides excellent conductivity between the power supply, the case, the mounting fasteners and the serverboard.

Unpacking

Replacement components are usually shipped in anti-static packaging to avoid static damage.

When unpacking an upgrade or replacement component, make sure the person handling it is static protected.

Control Panel

The control panel (located on the front of the chassis) must be connected to the JF1 connector on the serverboard to provide you with system status indications. A ribbon cable has bundled these wires together to simplify the connection. Connect the cable from JF1 on the serverboard to the

Control Panel PCB (printed circuit board). Make sure the red wire plugs into pin 1 on both connectors. Pull all excess cabling out of the airflow path. The LEDs inform you of system status.

See Chapter 3 for details on the LEDs and the control panel buttons.

Drive Bay Installation/Removal

This section describes hard drive installation and removal.

Accessing the Drive Bays

Drives: You do not need to access the inside of the chassis or remove power to replace or swap a

RAIDed hard disk drive. Data may be lost or corrupted if you “hot swap” a JBOD disk drive. Shut down system power before removing or replacing a JBOD disk. Removing either a RAID or

JBOD drive without replacing it may cause system errors. Proceed to the next section for further hard drive instructions.

Note: You must use approved 2.5" disk drives in the system.

42 007-6364-001

Drive Bay Installation/Removal

Removing Hard Drives or Carriers from the Chassis

1.

Press the release button on the drive carrier. This extends the drive carrier handle.

2.

Use the handle to pull the drive carrier out of the chassis.

Important: Empty carriers without drives must stay in the chassis during operation for proper airflow/cooling purposes except during remove/replace operations. Do not operate the server with carriers removed.

The Hard Drive Backplane

The hard drives plug into a backplane that provides power, drive ID and bus termination. A RAID controller and/or optional RAID software can be used with the backplane to provide data security.

The operating system you use must have RAID support to enable the hot-swap capability of the hard drives. The backplane is preconfigured, so no jumper/switch configuration is required.

!

Caution: Be careful when working around the drive backplane. Do not touch the backplane with your fingers or any metal objects and make sure no ribbon cables touch the backplane or obstruct the holes, which aid in proper airflow.

Disk Drive Installation

The drives are mounted in drive carriers (

Figure 6-1 ) to simplify their installation and removal

from the chassis, see Figure 6-2 on page 44 for a disk removal example. These carriers also help

promote proper airflow for the drives. For this reason, even empty carriers without hard drives installed must remain in the chassis during operation. See

Figure 6-3 on page 46

for an example drive carrier and the “dummy” drive blank used when a working disk is not installed in a drive slot.

Figure 6-1

Drive and Carrier Assembly Example

007-6364-001 43

6: Basic Troubleshooting and Chassis Service

Figure 6-2

Remove Drive and Carrier from Front of Server

Hard Drive Carrier Assembly Usage

1.

Remove the four screws securing the dummy/bad drive to the hard drive carrier.

2.

Insert a new/replacement hard drive into the carrier with the PCB side facing down and the connector end toward the rear of the carrier.

3.

Align the hard drive in the disk drive carrier so that the mounting holes of the carrier are aligned with the mounting holes of the drive. Note that there are holes in the carrier which are marked “SATA” to aid in correct installation.

4.

Secure the drive to the carrier with four screws. Use the M3 flat-head screws included in the

HDD bag of your accessory box. Note: the screws used to secure a dummy drive to the carrier should not be used to secure the hard drive.

44 007-6364-001

007-6364-001

Drive Bay Installation/Removal

5.

Insert the hard drive carrier assembly into its bay vertically, keeping the carrier oriented so that the release button is on the bottom. When the carrier reaches the rear of the drive bay, the handle will retract.

6.

Using your thumb, push against the upper part of the hard drive handle until the assembly

clicks into the locked (fully seated) position, see Figure 6-4 on page 47 for an example.

Note: Your operating system must have RAID support to enable the hot-plug capability of the drives.

!

Caution: Regardless of how many hard drives are installed, all drive carriers must remain in the drive bays to maintain proper airflow and system cooling.

45

6: Basic Troubleshooting and Chassis Service

46

Figure 6-3

Drive Carrier Attachment to Dummy Drive Blank Example

007-6364-001

Power Supply

Figure 6-4

Hard Disk Drive Installation Example

Power Supply

The system offers a redundant power supply assembly consisting of two 1600-Watt power modules. Each power supply module has an auto-switching capability, which enables it to automatically sense and operate at a 100V - 240V input voltage at 50 or 60Hz.

Power Supply Failure

If either of the two power supply modules fail, the other module will take the full load and allow the system to continue operation without interruption. The PWR Fail LED will illuminate and remain on until the failed unit has been replaced. The power supply units have a hot-swap capability, meaning you can replace the failed unit without powering down the system, see

Figure 6-5 on page 49 for an example.

007-6364-001 47

6: Basic Troubleshooting and Chassis Service

Removing/Replacing a Power Supply

You do not need to shut down the system to replace a failed power supply unit. The backup power supply module will keep the system up and running while you replace the failed unit. Replace with the same model.

Removing the Power Supply

1.

First unplug the AC power cord from the failed power supply module.

2.

Depress the locking tab on the power supply module.

3.

Pull it straight out using the rounded handle.

Installing a New Power Supply

1.

Replace the failed hot-swap unit with another identical power supply unit.

2.

Push the new power supply unit into the power bay until you hear a click.

3.

Secure the locking tab on the unit.

4.

Finish by plugging the AC power cord back into the unit.

48 007-6364-001

007-6364-001

Figure 6-5

Power Supply Remove/Replace Example

Power Supplies

Power Supply

49

6: Basic Troubleshooting and Chassis Service

Accessing the Inside of the Chassis

1.

Grasp the two handles on either side and pull the unit straight out until it locks (you will hear a “click”).

2.

Next, depress the two buttons on the top of the chassis to release the top cover and at the same time, push the cover away from you until it stops. You can then lift the top cover from the chassis to gain full access to the inside of the server.

Note: Normally you would power down the system before installing or removing internal components - but it may be necessary to leave system power on to determine which fan has failed.

System Fans

Ten 4-cm counter-rotating fans provide the cooling for the system. Each fan unit is actually made up of two fans joined back-to-back, which rotate in opposite directions. This counter-rotating action generates exceptional airflow and works to dampen vibration levels. It is very important that the chassis top cover is properly installed and making a good seal in order for the cooling air to circulate properly through the chassis and cool the components.

System Fan Failure

Fan speed is controlled by system temperature via a BIOS setting. If a fan fails, the remaining fans will ramp up to full speed and the overheat/fan fail LED on the control panel will flash. Replace any failed fan as soon as possible with the same type and model (the system can continue to run with a failed fan).

Your system administrator may be able to identify which fan has failed using the system BIOS.

If an administrator or service representative is not using the BIOS to determine which fan has failed, you can remove the top chassis cover while the system is still running to determine which of the fans has failed. After determining which is the failed fan, shut down and remove power from the system by unplugging the server’s cords. Never run the server for an extended period of time with the top cover open.

50 007-6364-001

System Fans

Replacing System Fans

This section describes how to remove or install a system fan.

Remove/Replace a Fan

1.

If you have not already done so, remove the chassis cover to access the fans, see the example in

Figure 6-6 on page 52

.

2.

Turn off the power to the system and unplug the AC power cord.

3.

Remove the failed fan's wiring connectors from the serverboard.

4.

Remove and retain the four pins securing the fan assembly to the fan tray.

5.

Lift the assembly housing the failed fan from the fan tray and out of the chassis, see the example in

Figure 6-7 on page 53 .

6.

Place the new fan into the vacant space in the fan tray, while making sure the arrows on the top of the fan (indicating air direction) point in the same direction as the arrows on the other fans in the same fan tray. See

Figure 6-8 on page 54 for a fan assembly example.

7.

Reconnect the fan wires to the exact same chassis fan headers as the previous fan.

8.

Reconnect the AC power cord, power up the system and check that the fan is working properly before replacing the chassis cover.

007-6364-001 51

6: Basic Troubleshooting and Chassis Service

52

Figure 6-6

Cooling Fans Access Example

007-6364-001

System Fans

007-6364-001

Figure 6-7

Remove/Replace Fan Assembly Example

53

6: Basic Troubleshooting and Chassis Service

54

Figure 6-8

Individual Fan Remove/Replace Example

007-6364-001

Install/Replace a PCIe Expansion Card

Install/Replace a PCIe Expansion Card

Confirm that you have the correct PCIe card for your chassis and the card includes a standard bracket. The following type cards are supported in the server chassis:

• One low-profile PCIe 3.0 x8 card

Note: At time of publication, the rear x16 full-height PCIe slot (see

Figure 6-9 ) is used only for

specific internal GPU option cards. Check with your SGI sales or service representative for the latest information on optional cards available for this slot.

Low-profile PCIe slot Full-height PCIe slot

Figure 6-9

Rear PCIe Low-profile and Optional Full-height Slot Locations

Install/Replace a Low-profile PCIe Card

Use the following steps and illustration to install or replace a PCIe card at the rear of the system:

1.

Remove the chassis cover and disconnect both the power cables from the server.

2.

Confirm that you have the correct size and type of PCIe expansion card (low-profile).

3.

Remove the screw securing the low-profile PCIe slot cover at the rear of the chassis and slide it sideways to remove from the chassis.

4.

Select the appropriate riser connector for your low-profile card. Note that the low-profile riser card uses a x16 connector.

5.

Align the PCIe card with the rear slot opening and the riser connector, then simultaneously slide the rear bracket into place as you insert the PCIe connector into the riser.

6.

Secure the rear bracket in the slot with the screw removed in step 3 and connect cables to the add-on card as necessary. See

Figure 6-10 on page 56 for an example.

7.

Replace the system cover and plug in the power cords prior to rebooting the server.

007-6364-001 55

6: Basic Troubleshooting and Chassis Service

56

Figure 6-10

Low-profile PCIe Card Remove/Replace Example

007-6364-001

Appendix A

A.

BIOS Error Codes

007-6364-001

During Power-On Self-Test (POST) routines, which are performed each time the system is powered on, errors may occur.

Non-fatal errors are those which, in most cases, allow the system to continue the boot-up process.

The error messages normally appear on the screen.

Fatal errors are those which will not allow the system to continue the boot-up procedure. If a fatal error occurs, you should consult with your system manufacturer for possible repairs.

These fatal errors are usually communicated through a series of audible beeps. The numbers on the fatal error list (see

Table A-1 ) correspond to the number of beeps for the corresponding error.

Table A-1

BIOS Error Codes

Beep Code Error Message

1 beep Refresh

5 short beeps + 1 long beep Memory error

Description

Circuits have been reset (Ready to power up)

No memory detected in the system

5 short beeps Console input or output device missing

Console-In: USB or PS/2 keyboard, PCI or

Serial Console Redirection, IPMI KVM or SOL

Console-Out: Video Controller, PCI or

Serial console Redirection, IPMI SOL

System overheat condition System thermal limits exceeded 1 continuous long beep

1 long beep +8 short beeps Video display error or video memory read/write error

Video error - adapter missing or with faulty memory

57

Appendix B

B.

System Operating and Regulatory Overview

This appendix provides basic environmental operating requirements and regulatory information for the server.

Environmental Specifications

Table B-1 lists allowable ranges for temperature, humidity, and altitude for the server.

Table B-1

Temperature, Humidity, and Altitude Specifications

Attribute Specification

While Product Operating

Temperature – Up to 1500m (5000ft)

+5

º

C (41

º

F) to +30

º

C (86

º

F)

– 1525m (5000ft) to 3050m (10,000ft)

Reduce max temperature (30

º

C) by 1

º

C per

305m (1000ft) of altitude above 1525m

(5000ft).

Humidity 20% to 80% Non-condensing

Rate of Change Constraints

Maximum: 10

º

C/hour (18

º

F/hour)

Maximum: 10% relative humidity/hour

Altitude 3050m (10,000ft)

While Product Power Off

Temperature

Humidity

+5

º

C (41

º

F) to +45

º

C (113

º

F)

8% to 80% Non-condensing

Altitude 3050m (10,000ft)

Maximum: 20

º

C/hour (36

º

F/hour)

007-6364-001 59

B: System Operating and Regulatory Overview

Table B-1

Temperature, Humidity, and Altitude Specifications (continued)

Rate of Change Constraints Attribute Specification

While Product Packaged for Shipping

Temperature

Humidity

-40

º

C (-40

º

F) to +60

º

C (140

º

F)

8% to 80% Non-condensing

Altitude 12,200m (40,000ft)

Maximum: 20

º

C/hour (36

º

F/hour)

System Input Requirements

AC Input Voltage: 180-240 VAC

Rated Input Current: 1000W: 100-120V/12.9A, 1600W: 200-240V/9.5A

Rated Input Frequency: 50-60 Hz

Power Supply

Rated Output Power: 1600W (Redundant) Platinum rated

Rated Output Voltages: 1000W: +12V (82A), +12Vsb (2A)

1600W: +12V (132A), +12Vsb (2A)

60 007-6364-001

Regulatory Compliance

Regulatory Compliance

This product is for installation in a Restricted Access Location only per clause 1.7.14 of IEC document 60950

The SGI compliance number for this product is CMN1104-118-1

6

Electromagnetic Emissions: FCC Class A, EN 55022 Class A, EN 61000-3-2/-3-3, CISPR 22

Class A

Electromagnetic Immunity: EN 55024/CISPR 24, (EN 61000-4-2, EN 61000-4-3, EN 61000-4-4,

EN 61000-4-5, EN 61000-4-6, EN 61000-4-8, EN 61000-4-11)

Safety: CSA/EN/IEC/UL 60950-1 Compliant, UL or CSA Listed (USA and Canada), CE Marking

(Europe)

California Best Management Practices Regulations for Perchlorate Materials: This Perchlorate warning applies only to products containing CR (Manganese Dioxide) Lithium coin cells.

“Perchlorate Material-special handling may apply. See: www.dtsc.ca.gov/hazardouswaste/perchlorate”

007-6364-001 61

advertisement

Was this manual useful for you? Yes No
Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Related manuals

advertisement

Table of contents