emc greenplum site preparation guide
TRANSCRIPT
EMC® Greenplum® Data Computing ApplianceSite Preparation Guide
P/N: 300-012-149Rev: A01
The Data Computing Division of EMC
Copyright © 2011 EMC Corporation. All rights reserved.
EMC believes the information in this publication is accurate as of its publication date. The information is subject to change without notice.
THE INFORMATION IN THIS PUBLICATION IS PROVIDED “AS IS.” EMC CORPORATION MAKES NO REPRESENTATIONS OR WARRANTIES OF ANY KIND WITH RESPECT TO THE INFORMATION IN THIS PUBLICATION, AND SPECIFICALLY DISCLAIMS IMPLIED WARRANTIES OF MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE.
Use, copying, and distribution of any EMC software described in this publication requires an applicable software license.
For the most up-to-date listing of EMC product names, see EMC Corporation Trademarks on EMC.com
All other trademarks used herein are the property of their respective owners.
Table of Contents 1
EMC Greenplum DCA Site Preparation Guide – Contents
EMC Greenplum DCA Site Preparation Guide - ContentsPreface ............................................................................................... 2
Chapter 1: About EMC Greenplum DCA...................................... 3Available DCA configurations............................................................ 3
GP10 (Quarter-Rack Configuration) ............................................ 5GP100 (Half-Rack Configuration) ................................................ 6GP1000 (One-Rack Configuration) .............................................. 7GP1000 +1 Scale-out Module (Two-Rack Configuration)............. 8
Component Specifications ................................................................ 8
Chapter 2: Preparing the Data Center Environment .............12Confirming Site Requirements.........................................................12
Floor Space Requirements .........................................................12Power and Cooling Requirements...............................................12Power Cord Specifications..........................................................13Enviromental Requirements.......................................................14Air Quality Requirements ...........................................................14
Optional Securing Brackets .............................................................15Anti-Tip Bracket.........................................................................16Anti-Move Bracket .....................................................................16Seismic Restraint Bracket ..........................................................17
Cabinet Positioning..........................................................................18Package Dimensions and Clearance.................................................19
Chapter 3: Gathering Site-Specific Information ....................20
Chapter 4: Next Steps ...................................................................22
2
EMC Greenplum DCA Site Preparation Guide – Preface
Preface
This guide is intended for EMC personnel, partners and customers to plan for requirements before an installation of a new EMC Greenplum Data Computing Appliance (DCA) into a data center. This guide provides an overview of the system, information on data center requirements, a checklist of items to gather for software configuration and links to relevant documentation for use in the next steps of deployment. The requirements listed in this document must be met prior to performing a DCA installation.
This guide contains the following chapters and appendices:
• Chapter 1, “About EMC Greenplum DCA”
• Chapter 2, “Preparing the Data Center Environment”
• Chapter 3, “Gathering Site-Specific Information”
• Chapter 4, “Next Steps”
Greenplum DCA Site Preparation Guide – Chapter 1: About EMC Greenplum DCA
1. About EMC Greenplum DCA
Greenplum Data Computing Appliance (DCA) is a self-contained data warehouse solution that integrates all of the database software, servers and switches necessary to perform big data analytics.
EMC Greenplum Data Computing Appliance (DCA) is a turn-key, easy to install data warehouse solution that provides extreme query and loading performance for analyzing large data sets. The EMC Greenplum DCA integrates Greenplum Database software with compute, storage and network components; delivered racked and ready for immediate data loading and query execution.
EMC Greenplum Data Computing Appliance runs the Greenplum Database relational database management system (RDBMS) software. Greenplum Database utilizes the DCA components to perform its database operations and processing.
See the following sections for a description of the DCA components and configurations.
• Available DCA configurations
• Component Specifications
Available DCA configurations
This section details the rack configurations currently available for the DCA. Note that in the Greenplum Database product and documentation, physical servers are referred to as hosts.
The GP10, GP100 and GP1000 have the same basic rack configuration, except for the number of segment hosts.
Table 1.1 DCA Components
DCA Component Quantity
Master Hosts 2 (one primary and one standby)
Segment Hosts GP10 (quarter-rack) = 4
GP100 (half-rack) = 8
GP1000 (one-rack) = 16
Interconnect Switches 2
Administration Switch 1
Available DCA configurations 3
Greenplum DCA Site Preparation Guide – Chapter 1: About EMC Greenplum DCA
A GP1000 scale-out module (two-rack configuration) contains all of the components of the GP1000 minus the master hosts.
Table 1.2 DCA GP1000 Scale-Out Module Components
DCA Component Quantity
Segment Hosts 16 per rack
Interconnect Switches 2 per rack
Administration Switch 1 per rack
Available DCA configurations 4
Greenplum DCA Site Preparation Guide – Chapter 1: About EMC Greenplum DCA
GP10 (Quarter-Rack Configuration)
Figure 1.1 GP10 quarter-rack configuration
Available DCA configurations 5
Greenplum DCA Site Preparation Guide – Chapter 1: About EMC Greenplum DCA
GP100 (Half-Rack Configuration)
Figure 1.2 GP100 half-rack configuration
Available DCA configurations 6
Greenplum DCA Site Preparation Guide – Chapter 1: About EMC Greenplum DCA
GP1000 (One-Rack Configuration)
Figure 1.3 GP1000 one-rack configuration
Available DCA configurations 7
Greenplum DCA Site Preparation Guide – Chapter 1: About EMC Greenplum DCA
GP1000 +1 Scale-out Module (Two-Rack Configuration)
Figure 1.4 GP1000 plus 1 scale-out module (two-rack configuration)
Component SpecificationsThis section explains the specifications of the various server and networking components of the DCA. Note that in the Greenplum Database product and
Component Specifications 8
Greenplum DCA Site Preparation Guide – Chapter 1: About EMC Greenplum DCA
documentation, physical servers are referred to as hosts.
Table 1.3 DCA Components
DCA Component Quantity
Master Hosts All Configurations = 2 (one primary and one standby)
Segment Hosts GP10 = 4
GP100 = 8
GP1000 = 16
GP1000+1 = 32
Interconnect Switches GP100/GP1000 = 2
GP1000+1 = 4
Administration Switch GP100/GP1000 = 1
GP1000+1 = 2
Master Host Specifications
The following diagram shows an example of how a Greenplum Database master host is configured in the DCA. DCA has two master hosts (the primary master and a standby master).
Figure 1.5 Greenplum Database Master Host Configuration on the DCA
Component Specifications 9
Greenplum DCA Site Preparation Guide – Chapter 1: About EMC Greenplum DCA
Table 1.4 Master Host Server Specifications
Hardware Specifications Quantity
Processor Intel X5680 3.33 GHz (6 core) 2
Memory DDR3 1333 MHz 48 GB
Dual-port Converged Network Adapter
2 x 10 Gbps 1
Quad-port Network Adapter 4 x 1 Gbps 1
RAID controller Dual channel 6 Gb/s SAS 1
Hard Disks 600 GB 10 K RPM SAS
(one RAID5 volume of 4+1 with 1 hot spare)
6
Segment Host Specifications
The following diagram shows an example of how a Greenplum Database segment host is configured in the DCA. Greenplum GP100 (half-rack) has 8 segment hosts. Greenplum GP1000 (full-rack) has 16 segment hosts. Each segment host serves 6 Greenplum Database primary segment instances and 6 mirror segment instances.
Figure 1.6 Greenplum Database Segment Host Configuration on the DCA
Component Specifications 10
Greenplum DCA Site Preparation Guide – Chapter 1: About EMC Greenplum DCA
Table 1.5 Segment Host Server Specifications
Hardware Specifications Quantity
Processor Intel X5670 2.93 GHz (6 core) 2
Memory DDR3 1333 MHz 48 GB
Dual-port Converged Network Adapter
2 x 10 Gbps 1
Dual-port Network Adapter 2 x 1 Gbps 1
RAID controller Dual channel 6 Gb/s SAS 1
Hard Disks 600 GB 15 K RPM SAS
(two RAID5 volumes of 5+1 disks)
12
Network Component Specifications
Hardware Specifications Quantity
Interconnect Switch 24-port Converged Enhanced Ethernet (CEE), Fibre Channel over Ethernet (FCoE)
8 Fibre Channel Ports (future use)
2
Admin Switch 24-port 1 Gb Ethernet Layer 3 1
11
Greenplum DCA Site Preparation Guide – Chapter 2: Preparing the Data Center Environment
2. Preparing the Data Center Environment
• Confirming Site Requirements
• Optional Securing Brackets
• Cabinet Positioning
• Package Dimensions and Clearance
Confirming Site RequirementsThe section summarizes the site requirements for the DCA.
• Floor Space Requirements
• Power and Cooling Requirements
• Power Cord Specifications
• Enviromental Requirements
• Air Quality Requirements
Floor Space Requirements
The following table describes the physical footprint of the DCA:
Table 2.1 DCA Physical Dimensions
Height Width Depth1
1. with door attached
Weight
GP10 (quarter-rack)75 in
190 cm
24 in
61 cm
41.6 in
104 cm
940 lbs
GP100 (half-rack)75 in
190 cm
24 in
61 cm
41.6 in
104 cm
1200 lbs
GP1000 (one-rack)75 in
190 cm
24 in
61 cm
41.6 in
104 cm
1700 lbs
GP1000+1(two-rack)75in
190 cm
48 in
122 cm
41.6 in
104 cm
3400 lbs
Power and Cooling Requirements
The following table describes the power and cooling requirements of the DCA:
Table 2.2 EMC Greenplum Data Computing Appliance Physical Dimensions
Total Power VA Power Connections Cooling (BTU/HR)
GP10 (quarter-rack) 2478 2 8450
Confirming Site Requirements 12
Greenplum DCA Site Preparation Guide – Chapter 2: Preparing the Data Center Environment
Power Cord Specifications
Table 2.3 Power Cord Specifications
Power Cord Connector
Country Power Cord Model Descriptions
USA, Japan DCA1-US-15 DCA - Single Phase, 30Amp, 15ft ext cords with L6-30P plug
DCA1-US-21 DCA - Single Phase, 30Amp, 21ft ext cords with L6-30P plug
Australia DCA1-ASTL-15 DCA - Single Phase, 30Amp, 15ft ext cords with CLIPSAL 56PA332 plug
DCA1-ASTL-21 DCA - Single Phase, 30Amp, 21ft ext cords with CLIPSAL 56PA332 plug
Other Countries
DCA1-IEC3-15 DCA - Single Phase, 30Amp, 15ft ext cords with IEC309-332P6 plug
DCA1-IEC3-21 DCA - Single Phase, 30Amp, 21ft ext cords with IEC309-332P6 plug
Other Power Cord Types
DCA1-RUS-15 DCA - Single Phase, 30Amp, 15ft ext cords with RUSSELLSTOLL 3750DP plug
DCA1-RUS-21 DCA - Single Phase, 30Amp, 21ft ext cords with RUSSELLSTOLL 3750DP plug
GP100 (half-rack) 3980 2 13600
GP1000 (one-rack) 6980 4 23800
GP1000+1(two-rack) 13960 8 47600
Table 2.2 EMC Greenplum Data Computing Appliance Physical Dimensions
Confirming Site Requirements 13
Greenplum DCA Site Preparation Guide – Chapter 2: Preparing the Data Center Environment
Enviromental Requirements
Table 2.4 Environmental Requirements
+15°C to +32°C (59°F to 89.6°F) site temperature
40% to 55% relative humidity
0 to 2439 meters (0 to 8,000 feet) above sea level operating altitude
Air Quality Requirements
EMC products are designed to be consistent with the requirements of the American Society of Heating, Refrigeration and Air Conditioning Engineers (ASHRAE) Environmental Standard Handbook and the most current revision of Thermal Guidelines for Data Processing Environments, Second Edition, ASHRAE 2009b.
The data center should maintain a cleanliness level as identified in ISO 14664-1, class 8 for particulate dust and pollution control. The air entering the data center should be filtered with a MERV 11 filter or better. The air within the data center should be continuously filtered with a MERV 8 or better filtration system. In addition, efforts should be maintained to prevent conductive particles, such as zinc whiskers, from entering the facility.
The allowable relative humidity level is 20 to 80% non condensing, however, the recommended operating environment range is 40 to 55%. For data centers with gaseous contamination, such as high sulfur content, lower temperatures and humidity are recommended to minimize the risk of hardware corrosion and degradation. In general, the humidity fluctuations within the data center should be minimized. It is also recommended that the data center be positively pressured and have air curtains on entry ways to prevent outside air contaminants and humidity from entering the facility.
For facilities below 40% relative humidity, it is recommended to use grounding straps when contacting the equipment to avoid the risk of Electrostatic discharge (ESD), which can harm electronic equipment.
As part of an ongoing monitoring process for the corrosiveness of the environment, it is recommended to place copper and silver coupons (per ISA 71.04-1985, Section 6.1 Reactivity), in airstreams representative of those in the data center. The monthly reactivity rate of the coupons should be less than 300 Angstroms. When monitored reactivity rate is exceeded, the coupon should be analyzed for material species and a corrective mitigation process put in place.
Confirming Site Requirements 14
Greenplum DCA Site Preparation Guide – Chapter 2: Preparing the Data Center Environment
This EMC® cabinet ventilates from front to back; you must provide adequate clearance to service and cool the system. Depending on component-specific connections within the cabinet, the available power cord length may be somewhat shorter than the 15-foot standard.
Figure 2.1 Access and Ventilation Requirements
Optional Securing BracketsIf you intend to secure the optional stabilizer brackets to your site floor, prepare the location for the mounting bolts. The additional brackets help to prevent the cabinet from tipping while you service cantilevered levels, or from rolling during minor seismic events. The brackets provide three levels of protection for stabilizing the unit.
• Anti-Tip Bracket
• Anti-Move Bracket
• Seismic Restraint Bracket
Optional Securing Brackets 15
Greenplum DCA Site Preparation Guide – Chapter 2: Preparing the Data Center Environment
Anti-Tip Bracket
Use this bracket to provide an extra measure of anti-tip security. One or two kits may be used. For cabinets with components that slide, EMC recommends that you use two kits.
Figure 2.2 Anti-Tip Bracket Placement
Anti-Move Bracket
Use this bracket to permanently fasten the unit to the floor.
Figure 2.3 Anti-Move Bracket Placement
Optional Securing Brackets 16
Greenplum DCA Site Preparation Guide – Chapter 2: Preparing the Data Center Environment
Seismic Restraint Bracket
Use this bracket to provide the highest protection from moving or tipping.
Figure 2.4 Seismic Restraint Bracket Placement
Optional Securing Brackets 17
Greenplum DCA Site Preparation Guide – Chapter 2: Preparing the Data Center Environment
Cabinet PositioningThe cabinet bottom includes four caster wheels. The front wheels are fixed; the two rear casters swivel in a 1.75-inch diameter. Swivel position of the caster wheels will determine the load-bearing points on your site floor, but does not affect the cabinet footprint. Once you have positioned, leveled, and stabilized the cabinet, the four leveling feet determine the final load-bearing points on your site floor.
Figure 2.5 Cabinet Positioning
When the cabinet is centered over two typical 24 in. (60.96 cm) by 24 in. (60.96 cm) floor tiles:
• Cutouts should be 8 in. (20.32 cm) by 6 in. (15.24 cm).
• Cutouts should be centered on the tiles, 9 in. (22.86 cm) from the front and rear and 8 in. (20.32 cm) from sides.
Cabinet Positioning 18
Greenplum DCA Site Preparation Guide – Chapter 2: Preparing the Data Center Environment
Package Dimensions and ClearanceMake certain your doorways and elevators are wide enough and tall enough to accommodate the shipping pallet and cabinet. Use a mechanical lift or pallet jack to position the packaged cabinet in its final location.
Figure 2.6 Door Clearance
Leave approximately 2.43 meters (8 feet) of clearance at the back of the cabinet to unload the unit and roll it off the pallet.
Figure 2.7 Unloading Clearance
Package Dimensions and Clearance 19
Greenplum DCA Site Preparation Guide – Chapter 3: Gathering Site-Specific Information
3. Gathering Site-Specific Information
In order to complete an installation of an EMC Greenplum DCA, the following information should be gathered from the customer’s network and database personnel:
Table 3.1 Site-Specific Information
Information Description
External IP and hostname of the primary master
This is the IP address and hostname that the customer will use to connect to the primary master host from their public LAN.
The master hostname is also used for client connections to Greenplum Database.
External IP and hostname of the standby master
This is the IP address and hostname that the customer will use to connect to the standby master host from their public LAN.
Netmask Netmask of the customer’s network.
Gateway Default gateway of the customer’s network and the IP address and interface name of the router.
NTP server IP The IP address or hostname of the customer’s preferred NTP (Network Time Protocol) server.
DNS name server IP The IP address of the customer’s DNS name server.
iDRAC password This is the password used for remote access to the master, standby master and segment hosts using the integrated Dell Remote Access Controller (iDRAC) interface. The default iDRAC password is calvin.
root password Customer supplied root password for the master, standby master and segment hosts. The default root password is changeme.
gpadmin password Customer supplied Greenplum Database superuser password. The default gpadmin password is changeme.
System locale The preferred locale to be used on the master, standby master and segment hosts. en_US.UTF-8 is the default locale for the Greenplum DCA (U.S. English and Unicode character set encoding).
A locale identifier consists of a language identifier and a region identifier, and optionally a character set encoding. For example, sv_SE is Swedish as spoken in Sweden, en_US is U.S. English, and fr_CA is French Canadian. If more than one character set encoding can be useful for a locale, then the specifications look like this: en_US.UTF-8 (locale specification and character set encoding).
System timezone The local timezone to be used on the master, standby master and segment hosts. The default timezone is PST.
20
Greenplum DCA Site Preparation Guide – Chapter 3: Gathering Site-Specific Information
Database character set encoding
UNICODE (UTF-8) is the default character set encoding for Greenplum Database (server-side encoding). This is usually the best choice, as it allows the customer to store all possible Unicode characters from any language, but if all data you are storing is from a single language (now and in the future), it does entail a slight storage space penalty compared to an encoding specific to that language.
If the space savings is key, the customer should consider Latin-1, Latin-9, or WIN1252 for US or Western European installations, since those encodings use a single byte per character. Likewise in Thailand you might consider WIN874 to store Thai, since it uses a single byte per character. However, keep in mind that this prevents storing any data outside those character sets. Even in the US or Western Europe, customers might find that some of their data is Latin-1, while some is Latin-9 or Win1252, so any choice of single-byte encoding will not accommodate all of their data needs. See the Greenplum Database Administrator Guide for a list of all supported character set encodings.
Software Tools Connection to the DCA for setup and management requires an SSH utility. EMC recommends Putty or Cygwin.
Hardware Tools The following hardware tools will be required during installation of the DCA:
• Utility Knife
• 9/16’’ Socket Wrench
• ESD (electro-static discharge) kit
Power Connection for Service Laptop
Power for external devices should not be drawn from the DCA cabinet. A power connection is required for the EMC personnel service laptop. The connection should be a standard AC 100-240V~1.5A, 50-60hz outlet.
Dial-home Connectivity The DCA supports dial-home for event notification to EMC Global Services support center. Communication from the DCA to EMC is done via FTPS. Firewall access should be setup to allow FTPS traffic from the DCA’s external IP address to the following EMC addresses:
corpusfep3.emc.comcorpusfep4.emc.com
Table 3.1 Site-Specific Information
Information Description
21
22
Greenplum DCA Site Preparation Guide – Next Steps:
4. Next Steps
The following documentation may be used during the next steps in implementing your Data Computing Appliance:
EMC Greenplum Data Computing Appliance Getting Started Guide
http://powerlink.emc.com/km/live1/en_US/Offering_Technical/Technical_Documentation/300-011-454.pdf
Greenplum Database 4.0 Administrator Guide
http://powerlink.emc.com/km/live1/en_US/Offering_Technical/Technical_Documentation/300-011-538.pdf
Greenplum Database 4.0 Release Notes
http://powerlink.emc.com/km/live1/en_US/Offering_Technical/Technical_Documentation/300-011-650.pdf