supermicro nvdia gpu sunucu Çözümleri
TRANSCRIPT
TAIPEI | SEP. 21-22, 2016
SUPERMICRO GPU SUNUCU ÇÖZÜMLERİ WWW.SUNUCUDEPOSU.COM Benedict Khoo (FAE Manager, APAC Region), September 21st 2016
We Keep IT Green™
Earth-friendly" Solutions
ABOUT THE COMPANY…
INNOVATION
• Server Building Blocks methodology
• Application optimized
• First to market
ENERGY EFFICIENCY
• Excellency on thermal / cooling design
• Titanium power supply
• Perfection in green computing
OPEN PLATFORM
• True open platform
• Commoditization with innovation
Our DNA…
2
Supermicro® (NASDAQ:SMCI) is a global leader in high-
performance, high-efficiency server technology and innovation. We
develop and provide end-to-end green computing solutions to the
datacenter, cloud computing, enterprise IT, big data, HPC, and
embedded markets. Our solutions range from complete server,
storage, blade and workstations to full racks, networking devices,
server management software and technology support and services.
We perform the majority of our R&D efforts in-house, which
increases the communication and collaboration between design
teams, streamlines the development process and reduces time-to-
market. We have developed a set of design principles which allow us
to aggregate individual industry-standard components and materials
to develop truly optimized server boards, chassis, power supplies,
networking and storage devices. This building block approach allows
us to provide a broad range of SKUs, and enables us to build and
deliver application-optimized solutions based upon customers'
requirements.
3
KEY ADVANTAGES FEATURES
Widest Range of Supported Solutions (up to 7U)…
Highest Density Solutions which can support up to 10x GPGPU Solutions per Node
Maximizing performance per Watts, per Sq. Ft., per Dollar designs…
Unique Green Computing Architecture features…
Full Bandwidth capable for optimal I/O performance…
Offer up to 50+ Sku Solutions…
4
GPU Solutions – HPC/Grid Optimized
ü High efficiency power supplies at full capacity
ü Excellent thermal design
ü Non-blocking air-flow
ü Greatest performance layout
ü No re-driver required; no latency
2016 – Next Gen
GPU Innovation - Latency and Performance Optimized
Tesla S1070
The fastest
1U server
in the world
7U 10-blade 20-GPUs
2015 – 1U 4-GPU Optimized
1U 4-GPU Standalone
Server
PCI-E x16
1U Twin™
2008 – GPGPU Where it started…
Integrated
GPU Server
2009 – Hybrid Computing Pioneer
GPU Server & Workstation
2011 – GPU Blades
2013 – GPU FatTwin™
5
“The most comprehensive
product line in the Industry”
SUPERMICRO SUPERBLADE REVIEW HTTP://WWW.SERVETHEHOME.COM/SUPERMICRO-SUPERBLADE-GPU-SYSTEM-REVIEW-SBE-710Q-R90-CHASSIS/
Using the Supermicro GPU Super- Blade platform we quickly saw the benefits in terms of: higher density, higher power supply efficiency, easier maintenance, significantly reduced cabling, and easier upgrades/ expansion.
We were impressed by how easy it was to use and manage the system.
6
3rd Party Server Reviews
Rating:
97%
“As we have said before, the case used for the 7048GR-TR Workstation is
simply the best in quality, craftsmanship, and features. … it is our go-to case
every time. The 7048GR-TR Workstation is designed for maximum uptime with
hot-swappable drives and cooling fans, and includes dual redundant power
supplies.”
— TweakTown
“Overall, for those looking to cram four GPU’s into a small 1U form factor for dense
compute or even VDI applications, the Supermicro 1028GQ-TRT is an excellent
solution. With 10Gbase-T networking, the server is easy to integrate into existing
datacenter infrastructure so long as the rack is able to handle higher-power rated
gear.
… we find the 4028GR-TR is a well designed system that has the ability to
handle high performance work loads. Moving to a large 4U server allows larger
capacity cooling systems to be installed that keep the system cool while running
extreme work loads. This is a trade off vs smaller 1U systems which have higher
density but operate at close to maximum heat load capacities.”
— ServeTheHome
Rating:
9.7
7
STAC-A2 Benchmarks
The STAC-A2 Benchmark suite is the industry
standard for testing technology stacks used for
compute-intensive analytic workloads involved
in pricing and risk management. In all, the
STAC-A2 specifications deliver nearly 200 test
results related to performance, scaling,
efficiency, and quality, which are detailed in this
report.
Test System: Supermicro SYS-1028GR-TR server
World Record Results Fastest warm time to date in the baseline end-to-end
Greeks benchmark: GREEKS.TIME.WARM;
This was 1.27x the speed of the next fastest
System (SUT ID: INTC150811).
8
Supermicro 1U GPU Solution at GSIC
Center
https://www.supermicro.com.tw/products/nfo/Green500.cfm
- Ranked 1st on the World's Green500 List of Computer Systems
9
Optimized Portfolio with Highest Rack-level
GPU Density
Best–in-class technology designed for highly parallel applications to deliver ultimate
performance, flexibility, and scalability
2 1018GR 3 1028GR 4 1028GQ
Cost Effective Mainstream Parallel Optimized
Single Haswell/Broadwell CPU
8 DDR4 DIMMs
6x 2.5” HS HDD bays
2 Double-Width GPUs
1 x8 PCIe 3.0 slot
1x 1400W Platinum PWS
Dual Haswell/Broadwell CPUs
16 DDR4 DIMMs
4x 2.5” HS HDD bays
3 Double-Width GPUs
1 x8 PCIe 3.0 slot
2x 1600W Platinum PWS
Dual Haswell/Broadwell CPUs
16 DDR4 DIMMs
4x 2.5” HS HDD bays
4 Double-Width GPUs
Active/Passive GPUs
2 x8 PCIe 3.0 Slots
2x 2000W Platinum PWS
10
Optimized Portfolio with Highest Node-level
GPU Density
Best–in-class technology designed for highly parallel applications to deliver ultimate
performance, flexibility, and scalability
4 7048GR 6 2028GR 8 4028GR
Mission Critical Mainstream Parallel Optimized
4U Chassis
Dual Haswell/Broadwell CPUs w/ IPMI
16 DDR4 DIMMs
8x 3.5” HS HDD bays
4 Double-Width GPUs
x16/x8/x4 – 4/2/1**
2x 2000W Titanium PWS
2U Chassis
Dual Haswell/Broadwell CPUs
16 DDR4 DIMMs
10x 2.5” HS HDD bays
6 Double-Width GPUs
1 x8 PCIe 3.0 slot
2x 2000W Platinum PWS
4U Chassis
Dual Haswell/Broadwell CPUs
24 DDR4 DIMMs
24x 2.5” HS HDD bays
8 Double-Width GPUs
2 x8 PCIe 3.0 slot; 1 x4 PCIe 2.0 slot
4x 1600W Platinum PWS
11
Widest Portfolios TOWER
RACK
MULTI-NODE
8:2 (4U)
4:2 (1U) 2:2 (7U / 10Node)
3:2
4:2
6:2 (2U)
3:2 (1U)
2:1 (1U)
(4U/4Node)
6:2
(4U / 2Node)
HIGHER DENSITY
RATIO:
3*:2 (4U)
3*:2 (WS)
1:1 (WS) 3:2 (2U) 4:2 (2U)
1:2 (1U)
1:2 (2U/2Node)
12
GPU
EN
ABLED
G
PU
OPTIM
IZED
GPU:CPU *Support MAX 2x Double Width GPU
13
GPU Optimized Server Portfolio
THE LEADING SOLUTIONS (NEW) New Generation High Performance Optimal Solutions…
CUSTOMER PAIN POINTS PROBLEM SOLUTION
Generic ARCHITECTURE
QPI
Machine Learning / AI applications have large datasets well beyond one single GPU.
Aggregate GPU resources
to tackle large dataset
computation, in
conjunction with high
speed connectivity to
minimize latency
Latency is a major bottleneck, based on many 8x GPU designs
With constant communication,
the QPI + PCIe is a major
constraint. Symmetric PCIe
design is NOT efficient for
Machine Learning Applications.
14
PCIe
PCIe
MAXWELL/PASCAL READY • Active/Passive GPU Support • Support latest Maxwell/Pascal GPUs • Support a 10 GPUs configuration
Highest Density NVIDIA GPU Solution The most flexible parallel computing solution in the market. Optimized for GPU peering, this
architecture enables faster Machine Learning Training by up to with GPUs under a
single CPU root!
ADVANTAGES
X10 SUPERMICRO ADVANTAGE ● PERFORMANCE: GPUs under single CPU Root
● FLEXIBILITY: Supports up to 10x Active/Passive GPUs ● GPU RDMA: Direct Internode GPU Interconnect
● EFFICIENCY: Titanium-rated Power Supply ● DESIGN: No GPU preheating
Single Root Complex Design for World Class Latency Optimized Solution
• GPU compute unit on one ROOT can train twice as fast and explore networks twice as
large.
• Distributed training across eight GPUs allows scaling to size and speed of the networks by
another factor of two
Super High Computing Capability
Highest Performance/ Watts Capabilities 15
PASCAL GPU READY • Performance – 10 TFLOPs FP32 • NVLink Advance Technology • 3D Memory - 2x Memory Bandwidth
Optimized Solution With NVIDIA Pascal GPU Architecture
Unparalleled 1U platform for the highest parallel applications. No one else can do so much in
a 1U!!!! Up to NDIDIA GPU with Pascal Architecture in , supporting Optimized GPU
RDMA
X10 SUPERMICRO ADVANTAGE ● PERFORMANCE: 8x PASCAL with GPUs IN 1U/ 4U
● NVLINK: 80GB/s High Bandwidth GPU Interconnect ● RDMA FABRIC: 4x Direct Low Latency Data Access
● EFFICIENCY: Titanium-rated Power Supply ● DESIGN: No GPU preheating
ADVANTAGES • All GPUs capable of Peer-to-Peer direct access to all other GPUs’ memory as well as
direct transfer (memcpy) operations via NVLink at high Bandwidth
• High performance for collective communications
• PCIe bandwidth fully available for host and/or NIC communication during inter-GPU communication
16
TAIPEI | SEP. 21-22, 2016
THANK YOU
More Information Please Talk To Our Representatives
WWW.SUPERMICRO.COM/GPU
We Keep IT Green™
Earth-friendly" Solutions