第15回pcクラスタシンポジウム ストレージ製品とデータ・ソ...
TRANSCRIPT
第15回PCクラスタシンポジウム
ストレージ製品とデータ・ソリューションのご紹介
クレイ・ジャパン・インク
第15回PCクラスタシンポジウム
Safe Harbor Statement
2
This presentation may contain forward-looking statements that are
based on our current expectations. Forward looking statements may
include statements about our financial guidance and expected operating
results, our opportunities and future potential, our product development
and new product introduction plans, our ability to expand and penetrate
our addressable markets and other statements that are not historical
facts. These statements are only predictions and actual results may
materially vary from those projected. Please refer to Cray's documents
filed with the SEC from time to time concerning factors that could affect
the Company and these forward-looking statements.
第15回PCクラスタシンポジウム
クレイ・アダプティブ・スーパーコンピューティング
3
Su
perc
om
pu
tin
g
Big
Da
ta Storage & Data Management
Computation Analytics
Urika-XA
Urika-GD XC40システム
CS400システム
第15回PCクラスタシンポジウム
1972 1976-77 1988 1996-2K 2004 2007 2011-12 2013 2014 2015 2016
Cray 1 Los Alamos
Cray Y-
MP
NCAR
Cray to SGI
Cielo DMI
Lustre
Red Storm Trinity
Cray DVS & Cray DataWarp
Cray Sonexion
Innovation
iVEC / Pawsey
NCSA Blue Waters
PGS
2012
4
The Cray DataWarp technology in the Trinity
system will provide the first multi-petabyte
multi-terabyte/sec IO burst handling
capability, ever.
--Gary Grider, HPC Division Leader, LANL
Storage Trends Convergence of Supercomputing and Big Data
第15回PCクラスタシンポジウム
Computing Memory and Storage Trends
Current Model
CPU
Memory
(DRAM)
Parallel Storage
(HDD)
On Node
Off Node (External) Archive Storage
(HDD & Tape)
Future Model
CPU
Near Memory
(HBM/HMC)
Near Storage
(Flash)
Far Memory
(DRAM/NVDIMM)
Archive Storage
(HDD & Tape)
On Node
Off Node (Internal/HSN)
Parallel Storage
(HDD) Off Node (External)
6
第15回PCクラスタシンポジウム
Cray Storage Solutions - Span Data Lifecycle
Sonexion Tier:
• Fast throughput
• Fully parallel access
• True scalability
TAS Tier:
• Best cost efficiency
• Data fully accessible
• Extensive scalability Cray TAS Connector
Hours Days Weeks Months Years
Ca
pa
cit
y (
PB
)
Th
rou
gh
pu
t (G
B/s
ec)
7
第15回PCクラスタシンポジウム
Cray Storage Offerings
8
Efficient Long-term Storage High Performance Storage I/O Acceleration
• Pure performance
• Breakthrough efficiencies
• Balanced and cohesive design
• Protect and store
• Access data forever
• Easily sustain long-term storage
• Efficiently scale
• Innovate faster
• Be confident
第15回PCクラスタシンポジウム
Summary – Optimized End-to-End Storage Solutions
Application I/O Optimization
Parallel File Systems
Leadership
Scalable Networking
Experts
Best in Class Storage Systems
DataWarp, Sonexion, TAS
Aries, InfiniBand, 40GbE
Cray System Architectures
End-to-End Optimization P
erfo
rman
ce O
ptim
izatio
n
9
第15回PCクラスタシンポジウム
High Performance Flash Storage & I/O Acceleration
第15回PCクラスタシンポジウム
Cray DataWarp™
Performance • Scales from 70 thousand to 40 million
IOPS
• Accommodate wide range of workloads
Efficiencies • 5x the performance of disk - same cost
• Offloads I/O intensive workloads
Cohesive • Flexible usage models
• Automated workflows
Bridges the performance gap between compute and storage
Flash Storage IO Acceleration System for Cray XC40
11
第15回PCクラスタシンポジウム
DataWarp Benefits
● DataWarp absorbs spikes ● Reduces size of underlying PFS
● Provision PFS for sustained performance instead of peak
● Accommodates range of applications
● Improves efficiency ● Flash tier: 3-5x the performance
of disk at the same cost ● Machine efficiency improved
● Better use of compute resources
Spikes
drive up
cost of
storage
DataWarp
absorbs
spikes &
reduces
size of
PFS
12
第15回PCクラスタシンポジウム
Use Case: Local Storage on Demand
/tmp /tmp /tmp
• Each compute node in a job is assigned a private part of the allocated SSD space
• Much faster than “faking it” with a parallel file system
Per Node Scratch
• Dynamic compute node swap space
Per Node Swap Space
13
第15回PCクラスタシンポジウム
Use Case: Shared Fast /ssd
/ssd
• High Bandwidth access to shared files
• Files can be striped across multiple DataWarp Nodes
• Space can be temporary for the job, or be marked as persistent to work between jobs
Shared Fast Scratch
14
第15回PCクラスタシンポジウム
Use Case: Checkpoint / Restart
SSD
Burst • User asks for enough SSD to cover
the number of concurrently resident checkpoints
• High Bandwidth checkpoints are written to SSDs
• Followed by an asynchronous explicit or transparent copy out to rotating storage
Fast Checkpoint / Restart
15
第15回PCクラスタシンポジウム
Automated Coordination Across Storage Tiers
Scheduler Directives
Cray HPC DataWarp
Data
Loosely Coupled
Data
Tightly Coupled
Cray TAS Archive Storage
HDD Pool
Tape Pool
Cray Sonexion Storage Lustre Parallel File System
Persistent Namespace
Automated Policies
16
第15回PCクラスタシンポジウム
High Performance Primary Storage
第15回PCクラスタシンポジウム
Cray Sonexion™
Innovate
Faster
• Deploy faster
• Achieve results faster
Scale
Efficiently
• Sustained performance over 1 TB/s
• 25% fewer components
• Over 3 PBs usable in one 42U rack
Be Confident • Proven Cray architectures
• Single point of support
Optimize performance and capacity at maximum density
Efficient, proven, scale-out Lustre system for any Linux HPC environment
18
第15回PCクラスタシンポジウム
Efficiency Comparison – 1TB/s Example
Deployment Comparison
Modular Storage with External Servers
Cray Sonexion (NCSA Blue Waters)
Monolithic Storage with External Servers
Capacity 22 PBytes 22 PBytes 32 PBytes
Bandwidth 1 TByte/sec. 1 TByte/sec. 1 TByte/sec.
LNET routers 942 482 440
Storage units 472 180 360
Hard drives 28,320 15,120 20,160
External servers
942 0 294
Director IB switches
6 0 2
IB cables 5,468 482 1512
Racks 94 30 40
Cost $$$ $ $
19
第15回PCクラスタシンポジウム
Simplify Management
20
第15回PCクラスタシンポジウム
Tiered Data Management
第15回PCクラスタシンポジウム
Protect • Preserve assets efficiently
• Transparently migrate data
Access • Continuously accessible
• Flexible access models
Sustain • Break vendor lock-in
• Multigenerational data preservation
Cray Tiered Adaptive Storage (TAS)
Protect and access your high-value data when you need it, for as long as you need it
Powered By
Efficient storage management with tiers
22
第15回PCクラスタシンポジウム
Traditional HSM for HPC – Complex
IB Fabric
fs1
fs2
fs3
QDR
FDR
FC
Ethernet
DM
DM
DM
DM
DM
DM
Ethernet
HSM
HSM
HSM
HSM
HSM
HSM
Disk Cache
Archive
Media
Archive
Media
Archive
Media
Archive
Media
Archive
Media
Archive
Media
Lustre Movers HSM Movers
Data Ingest
23
第15回PCクラスタシンポジウム
Simplified Data Management for Big Data and HPC
IB or FC
Fabric
fs1
fs2
fs3
QDR
FDR
FC
Ethernet
Data Movement and Transparent User Access
Shared Virtualized Storage
Pool
Common Access Protocols:
Lustre, NFS, SMB, HTTP, FTP
Powered
By
24
第15回PCクラスタシンポジウム
Users and Applications
Protect Data over its Lifespan
● Transparent tiering for users ● Data always accessible regardless of tier ● File system appears infinitely large ● Files always visible from the file system
● Automated data management
● Policy-based data management ● 24x7 data management ● Multiple copies and disaster recovery
● Works with any Lustre 2.5
● Cray TAS Connector for Lustre HSM
Tier
1
Tier
2
Tier
3
Tier
4
File System
Policy Engine
Lustre File System
Users and Applications
Powered By
Cray TAS Connector
25
第15回PCクラスタシンポジウム
Policy-based Data Movement
26
● Familiar Actions & Policies
● Transparently Archive from disk cache to archive media
● Manage disk space or Release archived files from disk
● Automatically Stage released files back when accessed
Archive
Release Stage
第15回PCクラスタシンポジウム
Sustain Long-term Repositories
● Open data format ● Based on POSIX TAR ● Data is accessible without TAS ● No vendor lock-in
● Data protected at scale
● Support for 100’s of PB of managed data ● Integration with Lustre HSM
● Future-ready technology migration
● Support for multigenerational data management ● Migrate with technology
2000 2005 2010 2015 2020 2025 2030 2035
the default setting is
Open
27
第15回PCクラスタシンポジウム
Parallel Archiving with Cray TAS
TAS
Gateway
3210L L L LA A A A 6 74 5L L L LA A A A L L L LA A A A L L L LA A A A8 9 10 11 12 13 14 15
L
A
17
L
A
16
L
A
19
L
A
18
5600SANbox
3210L L L LA A A A 6 74 5L L L LA A A A L L L LA A A A L L L LA A A A8 9 10 11 12 13 14 15
L
A
17
L
A
16
L
A
19
L
A
18
5600SANbox Storage Networks
01
02
03
04
05
06
07
08
09
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
01
02
03
04
05
06
07
08
09
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
3210L L L LA A A A 6 74 5L L L LA A A A L L L LA A A A L L L LA A A A8 9 10 11 12 13 14 15
L
A
17
L
A
16
L
A
19
L
A
18
5600SANbox
3210L L L LA A A A 6 74 5L L L LA A A A L L L LA A A A L L L LA A A A8 9 10 11 12 13 14 15
L
A
17
L
A
16
L
A
19
L
A
18
5600SANbox
ES
T
Archiver
ES
T
Archiver
ES
T
Archiver
ES
T
Archiver
Customer Network
28
第15回PCクラスタシンポジウム
Cray TAS – Tiered Data Management Summary
● Store and Protect ● Up to 5 copies—across any media ● Transparently migrate data across tiers, from
ingest to archive
● Access Data Forever ● All data always accessible to apps and users ● Your choice of file protocol
● Stay Open – Sustainable Infrastructure
● Open formats break vendor lock-in ● Data transparently migrates across
generations of storage infrastructure, disk and tape
Protect and access your high-value data when you need it, for as long as you need it
Powered By
29
第15回PCクラスタシンポジウム
The future is seldom the same
as the past Seymour Cray
June 4, 1995