power-saving in large-scale storage systems with data migration

47
Power-Saving in Large-Scale Storage Systems with Data Migration Koji Hasebe, Tatsuya Niwa, Akiyoshi Sugiki, and Kazuhiko Kato University of Tsukuba, Japan

Upload: others

Post on 12-Sep-2021

4 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Power-Saving in Large-Scale Storage Systems with Data Migration

Power-Saving in Large-Scale Storage Systems with Data Migration

Koji Hasebe, Tatsuya Niwa, Akiyoshi Sugiki, and Kazuhiko KatoUniversity of Tsukuba, Japan

Page 2: Power-Saving in Large-Scale Storage Systems with Data Migration

Background

IT systems consume 1-2% of the total energy in the world.Green IT: A New Industry Shock Wave, Gartner Symp/ITxpo, 2007

In large data centers, storage systems consume <40% of the total power. StorageIO, Greg Sculz

Power-saving in storage systems is a central issue.

Page 3: Power-Saving in Large-Scale Storage Systems with Data Migration

Previous Studies

WorkloadLow-power mode

Peak time Off-peak time

In the literature… MAID [Colarelli-Grunwald, '02], PDC [Pinheiro-Bianchini, '04]

DIV [Pinheiro et al., '06], Pergamum [Storer et al., '08]

RIMAC [Yao-Wang, '06], eRAID [Wang-Zhu-Li, '08]

Hibernator [Zhu et al., '05], PARAID [Waddle et al. '07], etc.

Commonly-observed technique:

Page 4: Power-Saving in Large-Scale Storage Systems with Data Migration

Previous Studies

Limitations:• Central controller to manage data accesses• Relatively small number of disks (up to several dozen)

Harnik et al. [IPDPS'09] Propose the efficient allocation of replicated data

d1 d2 d3

Page 5: Power-Saving in Large-Scale Storage Systems with Data Migration

Previous Studies

Limitations:• Central controller to manage data accesses• Relatively small number of disks (up to several dozen)

Harnik et al. [IPDPS'09] Propose the efficient allocation of replicated data

d1 d2 d3

Page 6: Power-Saving in Large-Scale Storage Systems with Data Migration

Previous Studies

Limitations:• Central controller to manage data accesses• Relatively small number of disks (up to several dozen)

Harnik et al. [IPDPS'09] Propose the efficient allocation of replicated data

d1 d2 d3

Low-power mode

Page 7: Power-Saving in Large-Scale Storage Systems with Data Migration

Motivation and Objective Apply the skewing technique to large storage systems Explore an efficient technique based on the data migration,

instead of the replication approach

Page 8: Power-Saving in Large-Scale Storage Systems with Data Migration

Motivation and Objective Apply the skewing technique to large storage systems Explore an efficient technique based on the data migration,

instead of the replication approach

datadata data data data

Page 9: Power-Saving in Large-Scale Storage Systems with Data Migration

Motivation and Objective Apply the skewing technique to large storage systems Explore an efficient technique based on the data migration,

instead of the replication approach

datadata data data data

Low-power mode

Page 10: Power-Saving in Large-Scale Storage Systems with Data Migration

Central Idea (1)Underlying System

P1 P2 P3 P4 P5 P6 P7 P8 P9 P10 P11 P12

Block 1 Block 2 Block 3 Block 4

Parent Children

Parent Child

Assume that 3 physical nodes are required at off-peak time May increase up to four-fold

Page 11: Power-Saving in Large-Scale Storage Systems with Data Migration

Central Idea (1)Underlying System

P1 P2 P3

V1

V2

V3

V4

V5

V6

V7

V8

V9

P4 P5 P6 P7 P8 P9 P10 P11 P12

Block 1 Block 2 Block 3 Block 4

V1V2 V3 V4 V5

V6

V7V8V9

Managed by distributed hash table (DHT)

Cf. Chord [Stoica et al. '01]

Page 12: Power-Saving in Large-Scale Storage Systems with Data Migration

Central Idea (1)Underlying System

V1

P1

V2

V3

V4

P2

V5

V6

V7

P3

V8

V9

P4 P5 P6 P7 P8 P9 P10 P11 P12

1

4

7

2

5

8

3

6

9

1

2

3

4

5

6

7

8

9

1

4

7

2

5

8

3

6

9

Block 1 Block 2 Block 3 Block 4

Page 13: Power-Saving in Large-Scale Storage Systems with Data Migration

Central Idea (2)Migration of Virtual Nodes

V1

P1

V2

V3

V4

P2

V5

V6

V7

P3

V8

V9

P4 P5 P6

1

4

7

2

5

8

3

6

9

Block 1 Block 2

Overloaded

Page 14: Power-Saving in Large-Scale Storage Systems with Data Migration

Central Idea (2)Migration of Virtual Nodes

V12/2

P1

V2

V3

V4

P2

V5

V6

V7

P3

V8

V9

P4 P5 P6

1

4

7

2

5

8

3

6

9

Block 1 Block 2

Overloaded

V11/2

Divide V1 into two

V9

V12/2

V11/2

Page 15: Power-Saving in Large-Scale Storage Systems with Data Migration

Central Idea (2)Migration of Virtual Nodes

V12/2

P1

V2

V3

V4

P2

V5

V6

V7

P3

V8

V9

P4 P5 P6

4

7

2

5

8

3

6

9

Block 1 Block 2

Overloaded

V11/2

Page 16: Power-Saving in Large-Scale Storage Systems with Data Migration

Central Idea (2)Migration of Virtual Nodes

V12/2

P1

V2

V3

V4

P2

V5

V6

V7

P3

V8

V9

P4 P5 P6

4

7

2

5

8

3

6

9

Block 1 Block 2

Overloaded

V11/2

Page 17: Power-Saving in Large-Scale Storage Systems with Data Migration

Central Idea (2)Migration of Virtual Nodes

V12/2

P1

V2

V3

V42/2

P2

V5

V6

V7

P3

V8

V9

P4 P5 P6

4

7

2

5

8

3

6

9

Block 1 Block 2

Overloaded

V41/2

V11/2

Page 18: Power-Saving in Large-Scale Storage Systems with Data Migration

Central Idea (2)Migration of Virtual Nodes

V12/2

P1

V2

V3

V42/2

P2

V5

V6

P3

V8

V9

P4 P5 P6

7

2

5

8

3

6

9

Block 1 Block 2

Overloaded

V41/2

V11/2V7

2/2V7

1/2

Page 19: Power-Saving in Large-Scale Storage Systems with Data Migration

Central Idea (2)Migration of Virtual Nodes

V12/2

P1

V42/2

P2 P3 P4 P5 P6

Block 1 Block 2

V41/2

V11/2V7

2/2

V22/2

V32/2

V52/2

V62/2

V82/2

V92/2 V7

1/2

V51/2

V21/2

V81/2

V61/2

V31/2

V91/2

Page 20: Power-Saving in Large-Scale Storage Systems with Data Migration

Central Idea (2)Migration of Virtual Nodes

P1 P2 P3 P4 P5 P6 P7 P8 P9 P10 P11 P12

Block 1 Block 2 Block 3 Block 4

Parent Children

Parent Child

dd

d

dd

d

dd

dd d d

Page 21: Power-Saving in Large-Scale Storage Systems with Data Migration

Power-Saving Algorithms Short-term optimization Extension Reduction

Long-term optimization

Page 22: Power-Saving in Large-Scale Storage Systems with Data Migration

Power-Saving Algorithm 1Short-term Optimization (Extension)

Procedure1. Each physical node checks its own workload.2. If the workload exceeds its capacity, then one of the

virtual nodes is split and migrated to its child block.

V12/2

V2

V3

V42/2

V5

V6

V72/2

V8

V9

V11/2

V41/2 (5)

(8)V71/2

(2)

(6)

(9)

(3)

P1 P2 P3 P4 P5 P6

Parent Child

Page 23: Power-Saving in Large-Scale Storage Systems with Data Migration

Power-Saving Algorithm 1Short-term Optimization (Extension)

Notes: Reusing the stored data in the previous day enables the

migration by copying the difference. The mapping of virtual nodes effectively skews the workload.

V12/2

V2

V3

V42/2

V5

V6

V72/2

V8

V9

V11/2

V41/2 (5)

(8)V71/2

(2)

(6)

(9)

(3)

P1 P2 P3 P4 P5 P6

Parent Child

Page 24: Power-Saving in Large-Scale Storage Systems with Data Migration

Power-Saving Algorithm 2Short-term Optimization (Reduction)

Problem

V1 V42/2

V5

V72/2

V9

V41/2

V71/2

P1 P2 P3 P4 P5 P6

Parent Child

V22/2

V32/2

V62/2

V82/2

V21/2 V3

1/2

V62/2

V82/2

The remaining capacity of physical nodes

The workload of each virtual node = 1

(1) (1) (2)

Page 25: Power-Saving in Large-Scale Storage Systems with Data Migration

Power-Saving Algorithm 2Short-term Optimization (Reduction)

Wrong migration

V1 V42/2

V5

V72/2

V9

V41/2

V71/2

P1 P2 P3 P4 P5 P6

Parent Child

V22/2

V32/2

V62/2

V82/2

V21/2 V3

1/2

V62/2

V82/2

(1) (1) (2)

Page 26: Power-Saving in Large-Scale Storage Systems with Data Migration

Power-Saving Algorithm 2Short-term Optimization (Reduction)

Wrong migration

V1 V42/2

V5

V72/2

V9

V41/2

V71/2

P1 P2 P3 P4 P5 P6

Parent Child

V22/2

V3 V6

V82/2

V21/2

V82/2

(1) (1) (2)

(0) (0)

Page 27: Power-Saving in Large-Scale Storage Systems with Data Migration

Power-Saving Algorithm 2Short-term Optimization (Reduction)

The solution

V1 V42/2

V5

V72/2

V9

V41/2

V71/2

P1 P2 P3 P4 P5 P6

Parent Child

V22/2

V32/2

V62/2

V82/2

V21/2 V3

1/2

V62/2

V82/2

(1) (1) (2)

(0) (0) (0)

Page 28: Power-Saving in Large-Scale Storage Systems with Data Migration

Power-Saving Algorithm 2Short-term Optimization (Reduction)

The solution

V1 V4

V5

V7

V9

P1 P2 P3 P4 P5 P6

Parent Child

V2

V32/2

V62/2

V8

V31/2

V62/2

(1) (1) (2)

(0) (0) (0)

Page 29: Power-Saving in Large-Scale Storage Systems with Data Migration

Power-Saving Algorithm 2Short-term Optimization (Reduction)

Procedure1. C → P: the information about the workloads for every virtual node2. P lists all possible combinations of a subset of physical nodes s.t. P can absorb

their virtual nodes

Page 30: Power-Saving in Large-Scale Storage Systems with Data Migration

Power-Saving Algorithm 2Short-term Optimization (Reduction)

Procedure1. C → P: the information about the workloads for every virtual node2. P lists all possible combinations of a subset of physical nodes s.t. P can absorb

their virtual nodes

P1 {P4, P5}

P2 {P4, P5}, {P5, P6}

P3 {P4, P5}

Candidates

V1 V4

V5

V7

V9

P1 P2 P3

V2

V32/2 V6

V8

(1) (∞) (∞)

V32/2

P4 P5 P6

Page 31: Power-Saving in Large-Scale Storage Systems with Data Migration

Power-Saving Algorithm 2Short-term Optimization (Reduction)

Procedure1. C → P: the information about the workloads for every virtual node2. P lists all possible combinations of a subset of physical nodes s.t. P can absorb

their virtual nodes

P1 {P4, P5}, {P4, P6}

P2 {P4, P5}, {P5, P6}

P3 {P4, P5}

Candidates

V1 V4

V5

V7

V9

P1 P2 P3

V22/2

V3 V6

V8

(1) (∞) (∞)

V22/2

P4 P5 P6

Page 32: Power-Saving in Large-Scale Storage Systems with Data Migration

Power-Saving Algorithm 2Short-term Optimization (Reduction)

Procedure1. C → P: the information about the workloads for every virtual node2. P lists all possible combinations of a subset of physical nodes s.t. P can absorb

their virtual nodes

3. P → C: the result of Step 24. C calculates the intersection for all possible combinations of the results.

P1 {P4, P5}, {P4, P6}

P2 {P4, P5}, {P5, P6}

P3 {P4, P5}

Candidates {P4, P5}

V1 V4

V5

V7

V9

P1 P2 P3

V2

V32/2 V62/2

V8

(1) (1) (2)

V32/2

P4 P5 P6

V62/2

Page 33: Power-Saving in Large-Scale Storage Systems with Data Migration

Power-Saving Algorithm 2Short-term Optimization (Reduction)

Procedure1. C → P: the information about the workloads for every virtual node2. P lists all possible combinations of a subset of physical nodes s.t. P can absorb

their virtual nodes

3. P → C: the result of Step 24. C calculates the intersection for all possible combinations of the results.

P1 {P4, P5}, {P4, P6}

P2 {P4, P5}, {P5, P6}

P3 {P4, P5}

Candidates {P4, P5}, {P5}

V1 V42/2

V5

V7

V9

P1 P2 P3

V2

V32/2 V6

V8

(1) (1) (2)

V32/2

P4 P5 P6

V41/2

Page 34: Power-Saving in Large-Scale Storage Systems with Data Migration

Power-Saving Algorithm 2Short-term Optimization (Reduction)

Procedure1. C → P: the information about the workloads for every virtual node2. P lists all possible combinations of a subset of physical nodes s.t. P can absorb

their virtual nodes

3. P → C: the result of Step 24. C calculates the intersection for all possible combinations of the results.

P1 {P4, P5}, {P4, P6}

P2 {P4, P5}, {P5, P6}

P3 {P4, P5}

Candidates {P4, P5}, {P5}, {P4}

Solution

V1 V4

V5

V7

V9

P1 P2 P3

V2

V32/2 V62/2

V8

(1) (1) (2)

V32/2

P4 P5 P6

V62/2

Page 35: Power-Saving in Large-Scale Storage Systems with Data Migration

Power-Saving Algorithm 3Long-term Optimization

To maintain effective power-saving, it requires load-balancing in each block.

Example:

V1

V2

V3

V4

V5

V6

V7

V8

V9

(1)

(4) (5)

(8)(7)

(2)

(6)

(9)

(3)

P1 P2 P3 P4 P5 P6

Parent Child

High workload

Page 36: Power-Saving in Large-Scale Storage Systems with Data Migration

Power-Saving Algorithm 3Long-term Optimization

To maintain effective power-saving, it requires load-balancing in each block.

Example:

V4

V5

V6

V7

V8

V9

(4) (5)

(8)(7)

(6)

(9)

P1 P2 P3 P4 P5 P6

Parent Child

V12/2

V22/2

V32/2

V11/2

V21/2

V31/2

Low workload

Page 37: Power-Saving in Large-Scale Storage Systems with Data Migration

Power-Saving Algorithm 3Long-term Optimization

To maintain effective power-saving, it requires load-balancing in each block.

Example:

V1

V2

V3

V4

V5

V6

V7

V8

V9

(1)

(4) (5)

(8)(7)

(2)

(6)

(9)

(3)

P1 P2 P3 P4 P5 P6

Parent Child

Page 38: Power-Saving in Large-Scale Storage Systems with Data Migration

Power-Saving Algorithm 3Long-term Optimization

To maintain effective power-saving, it requires load-balancing in each block.

Example:

V1

V5

V9

V4

V2

V6

V7

V8

V3

(1)

(4) (5)

(8)(7)

(2)

(6)

(9)

(3)

P1 P2 P3 P4 P5 P6

Parent Child

Load is balanced

Page 39: Power-Saving in Large-Scale Storage Systems with Data Migration

Purposes Evaluate the efficiency of skewing the workload. Evaluate the validity of long-term optimization.

Simulation environment

39

Evaluation (Simulation)

Number of physical nodes 800

Number of virtual nodes 10,000

Term of simulation 1 day

Migration condition Split:more than 90%Merge:less than 70%

Workload of all virtual nodes Initially at its lowest,increased until middle of the day.Gap was sixfold.

Virtual node groups Gap of the loads is twice.

Page 40: Power-Saving in Large-Scale Storage Systems with Data Migration

40

Simulation Results (Average load)

• In the caseWithout long-term optimization:

57-69%With Long-term optimization:

67-74%

Long-term optimization algorithm improves the average load as expected.

Physical nodes run effectively, coping with the daily variation of workload.

Results

Time (hour)

Ave

rage

load

of a

ctiv

e ph

ysic

al n

odes

(%)

Page 41: Power-Saving in Large-Scale Storage Systems with Data Migration

Simulation Results (Active nodes)

Long-term optimization saves onAverage:

7-14%Up to:

17-39%

41

Optimization improves the power consumption consistently and continually.

Results

Time (hour)

The

num

ber

of a

ctiv

e ph

ysic

al n

odes

Page 42: Power-Saving in Large-Scale Storage Systems with Data Migration

Purposes Verify the efficiency of load intensive at real machine. Verify whether response time becomes below the desired time.

Response time:from sending a request until the data were loaded into memory in the server.

Experiment environment

42

Evaluation (Prototype implementation)

Number of physical nodes 40 •Xeon 3.60GHZ CPU x2•Memory about 2GB•HDD(SCSI) 36GB

Number of Files 60,000 x 1MB (total 60GB)

Term of experiment 1 day

Migration condition Split:over 90%,Merge:under 70%

Workload of all virtual nodes Initially at its lowest,increased until middle of the day.Gap is sixfold.

Virtual node groups Twice between two groups

Amount of each migration 10% of all the data

Page 43: Power-Saving in Large-Scale Storage Systems with Data Migration

43

Response Time

• Average response time80msec

• Maximum response time534msec

Our algorithms can keep almost below desired response time.

Results

Time (hour)

Res

pons

e tim

e pe

r re

ques

t (m

s)

Page 44: Power-Saving in Large-Scale Storage Systems with Data Migration

44

Average Load

• Overall average load:67% of the capacity

Can also skew the workload effectively as the simulation.

Results

Time (hour)Load

of a

ctiv

e ph

ysic

al n

odes

(%)

Page 45: Power-Saving in Large-Scale Storage Systems with Data Migration

Number of Active Physical Nodes

• Migration is done onAverage: 0.14 virtual nodesMaximum: 20 virtual nodes

Our system adjusts the number of physical nodes to the variation of

workloads and reduces power effectively

Time (hour)

The

num

ber

of a

ctiv

e ph

ysic

al n

odes

The

num

ber

of m

igra

tions

Page 46: Power-Saving in Large-Scale Storage Systems with Data Migration

Conclusions Power-saving method for large-scale distributed storage

systems. Short/Long-term optimization algorithms for reducing power

consumption.

Performance evaluation Simulation results showed that our method kept the workload

on Average: 67–74%

Prototype implementation results showed that Overall Average load was: 67% It can maintain a preferred response time

Page 47: Power-Saving in Large-Scale Storage Systems with Data Migration

Future work Implement replication mechanism to improve reliability.

Improve the long-term optimization algorithm.