clustering with minitab -...

of 103/103
Clustering with Minitab Soft Computing Lab Yonsei Univ.

Post on 21-Feb-2019

213 views

Category:

Documents

0 download

Embed Size (px)

TRANSCRIPT

Clustering with Minitab

Soft Computing Lab

Yonsei Univ.

S FT COMPUTING @ YONSEI UNIV . KOREA 16

1. http://sclab.yonsei.ac.kr/Dataset.zip

2. http://sclab.yonsei.ac.kr/ -> Courses -> Special lecture -> ->

1

http://sclab.yonsei.ac.kr/Dataset.ziphttp://sclab.yonsei.ac.kr/

S FT COMPUTING @ YONSEI UNIV . KOREA 16

: 22

: 8

(sales) (fuel cost)

2

S FT COMPUTING @ YONSEI UNIV . KOREA 16

Fixed_charge: (/)

RoR:

Cost:

Load_factor:

Demand_growth: 1974 1975 (kwh) (%)

Sales: (kwh/)

Nuclear: (%)

Fuel Cost: (cents/kwh)

3

S FT COMPUTING @ YONSEI UNIV . KOREA 16

->()->Y: , X: Sales

4

Click

S FT COMPUTING @ YONSEI UNIV . KOREA 16

5

S FT COMPUTING @ YONSEI UNIV . KOREA 16

2~3

,

,

,

6

S FT COMPUTING @ YONSEI UNIV . KOREA 16

(Hierarchical methods)

: n

:

(Nonhierarchical methods)

K-

7

S FT COMPUTING @ YONSEI UNIV . KOREA 16

()

(scale)

(-)/

->

8

Click

Click

S FT COMPUTING @ YONSEI UNIV . KOREA 16

()

9

S FT COMPUTING @ YONSEI UNIV . KOREA 16

10

S FT COMPUTING @ YONSEI UNIV . KOREA 16

()

?

?

(, , ) ?

?

( )

( )

11

S FT COMPUTING @ YONSEI UNIV . KOREA 16

:

:

:

McQuitty:

:

:

Ward: ,

12

S FT COMPUTING @ YONSEI UNIV . KOREA 16

2

-> ->

:

:

,

13

S FT COMPUTING @ YONSEI UNIV . KOREA 16

( )

-> ->

: , : Euclid, : 1

14

Click

S FT COMPUTING @ YONSEI UNIV . KOREA 16

( )

-> ->

15

Click

S FT COMPUTING @ YONSEI UNIV . KOREA 16

( )

-> ->

-> : C18

16

Click

S FT COMPUTING @ YONSEI UNIV . KOREA 16

( )

C18

17

S FT COMPUTING @ YONSEI UNIV . KOREA 16

( )

18

S FT COMPUTING @ YONSEI UNIV . KOREA 16

( )

(2 )

19

S FT COMPUTING @ YONSEI UNIV . KOREA 16

( )

(3 )

20

S FT COMPUTING @ YONSEI UNIV . KOREA 16

( )

21

S FT COMPUTING @ YONSEI UNIV . KOREA 16

( )

22

S FT COMPUTING @ YONSEI UNIV . KOREA 16

:

(, , )

a B

B

23

S FT COMPUTING @ YONSEI UNIV . KOREA 16

,

,

24

S FT COMPUTING @ YONSEI UNIV . KOREA 16

(k- )

( )

k-

k

25

S FT COMPUTING @ YONSEI UNIV . KOREA 16

(k- )

-> ->K-

26

Click

Click

S FT COMPUTING @ YONSEI UNIV . KOREA 16

(k- )

( ), : 6

27

S FT COMPUTING @ YONSEI UNIV . KOREA 16

(k- )

( ), : 6

28

S FT COMPUTING @ YONSEI UNIV . KOREA 16

29

S FT COMPUTING @ YONSEI UNIV . KOREA 16

:

77 , ,

. .

.

. ? ?

30

S FT COMPUTING @ YONSEI UNIV . KOREA 16

:

. .

-> ->, , 4~6

31

Click Click

S FT COMPUTING @ YONSEI UNIV . KOREA 16

:

: , : 5

32

: 1

S FT COMPUTING @ YONSEI UNIV . KOREA 16

:

: , : 5

33

S FT COMPUTING @ YONSEI UNIV . KOREA 16

:

: , : 6

34

: 1

S FT COMPUTING @ YONSEI UNIV . KOREA 16

:

: , : 6

35

S FT COMPUTING @ YONSEI UNIV . KOREA 16

:

36

S FT COMPUTING @ YONSEI UNIV . KOREA 16

:

( )

( )

37

S FT COMPUTING @ YONSEI UNIV . KOREA 16

:

. ? ?

.

: mg g

.

: (protein, fat, sodium, sugar)

38

S FT COMPUTING @ YONSEI UNIV . KOREA 16

:

-> ->

(protein, fat, sodium, fiber), , 3

39

Click Click

S FT COMPUTING @ YONSEI UNIV . KOREA 16

:

protein fat fiver 1

40

S FT COMPUTING @ YONSEI UNIV . KOREA 16

:

100%_Bran, All-Bran, All-Bran_with_Extra_Fiber

41

Click

Click

S FT COMPUTING @ YONSEI UNIV . KOREA 16

Wine

42

S FT COMPUTING @ YONSEI UNIV . KOREA 16

Wine

13

Alcohol :

Malic Acid :

Ash :

Alkalinity of ash :

Magnesium :

Total phenols :

Flavanoids :

Nonflavanoid phenols

Proanthocyanins :

Color intensity :

Hue :

OD280/OD315 of diluted wines

Proline :

43

Wine

S FT COMPUTING @ YONSEI UNIV . KOREA 16

1. Wine 2 . ( : )

2. 1 Wine 2 .

3. K- 2 .

4. 3 .

5. .

44

Wine

S FT COMPUTING @ YONSEI UNIV . KOREA 16

1 ()

Wine

[]->[ ] Wine.xls

[]->[ ]->[ ]

45

Wine

S FT COMPUTING @ YONSEI UNIV . KOREA 16

1 ()

2

:

C15, C16

46

Wine

S FT COMPUTING @ YONSEI UNIV . KOREA 16

1

1(PC1) C15 2(PC2) C16

47

Wine

S FT COMPUTING @ YONSEI UNIV . KOREA 16

2 ()

[]->[]

X, Y PCA

48

Wine

1 2

S FT COMPUTING @ YONSEI UNIV . KOREA 16

2

PC1 PC2

2~3

49

Wine

S FT COMPUTING @ YONSEI UNIV . KOREA 16

3 ()

[]->[ ]->[K- ]

2 ,

c17

50

Wine

S FT COMPUTING @ YONSEI UNIV . KOREA 16

3 ()

C15

51

Wine

S FT COMPUTING @ YONSEI UNIV . KOREA 16

3 ()

[]->[]

52

Wine

S FT COMPUTING @ YONSEI UNIV . KOREA 16

3

[] C17

53

Wine

S FT COMPUTING @ YONSEI UNIV . KOREA 16

4 ()

[]->[ ]->[ ]

, , 4

54

Wine

S FT COMPUTING @ YONSEI UNIV . KOREA 16

4

3

55

Wine

S FT COMPUTING @ YONSEI UNIV . KOREA 16

5 ()

[]->[]

56

Wine

S FT COMPUTING @ YONSEI UNIV . KOREA 16

5 ()

C18-C30

57

Wine

S FT COMPUTING @ YONSEI UNIV . KOREA 16

5 ()

: []->[ ]

58

Wine

S FT COMPUTING @ YONSEI UNIV . KOREA 16

5 ()

, , Cluster

59

Wine

S FT COMPUTING @ YONSEI UNIV . KOREA 16

5

e.g. 1 2 15

60

Wine

S FT COMPUTING @ YONSEI UNIV . KOREA 16

.

Euclid, Manhattan, Pearson, Euclid , Pearson

61

Wine

S FT COMPUTING @ YONSEI UNIV . KOREA 16

IRIS

62

S FT COMPUTING @ YONSEI UNIV . KOREA 16

Iris

3 (Setosa, Versicolour, Virginica)

Sepal length (cm ) :

Sepal width (cm ) :

Petal length (cm ) :

Petal width (cm ) :

Species : (setosa / versicolor / virginica)

63

Iris

S FT COMPUTING @ YONSEI UNIV . KOREA 16

1. Iris . ( )

2. K- .

64

Iris

S FT COMPUTING @ YONSEI UNIV . KOREA 16

1

1. Iris . ( )

65

Iris

S FT COMPUTING @ YONSEI UNIV . KOREA 16

2

2. K- .

66

Iris

S FT COMPUTING @ YONSEI UNIV . KOREA 16

Boston House

67

S FT COMPUTING @ YONSEI UNIV . KOREA 16

BostonHousing

CRIM : (town) 1

ZN : 25,000

INDUS :

CHAS : ( 1, 0)

NOX : 10ppm

RM : 1

AGE : 1940

DIS : 5

RAD :

TAX : 10,000

PTRATIO : /

B : 1000(Bk-0.63)^2 (Bk )

LSTAT : (%)

MEDV : () ( : $1,000)

68

BostonHousing

S FT COMPUTING @ YONSEI UNIV . KOREA 16

1. BostonHousing RM MEDV .

2. K- RM MEDV .

3. 2 .

4. Manhattan Pearson 4 .

69

BostonHousing

S FT COMPUTING @ YONSEI UNIV . KOREA 16

1

1. BostonHousing RM MEDV .

70

BostonHousing

S FT COMPUTING @ YONSEI UNIV . KOREA 16

2

2. K- RM MEDV .

71

BostonHousing

S FT COMPUTING @ YONSEI UNIV . KOREA 16

3

3. 2 .

72

BostonHousing

S FT COMPUTING @ YONSEI UNIV . KOREA 16

4

4. 4 .

73

BostonHousing

Manhattan Pearson

S FT COMPUTING @ YONSEI UNIV . KOREA 16

74

S FT COMPUTING @ YONSEI UNIV . KOREA 16

.mtw : 143

,

,

,

: 143 , ,

: 2, 78, 15 . (1=, 2=, 3=)

K- , .

75

S FT COMPUTING @ YONSEI UNIV . KOREA 16

76

S FT COMPUTING @ YONSEI UNIV . KOREA 16

.

2 =1, 78=2, 15=3

77

0 .

S FT COMPUTING @ YONSEI UNIV . KOREA 16

K-

78

S FT COMPUTING @ YONSEI UNIV . KOREA 16

K-

79

S FT COMPUTING @ YONSEI UNIV . KOREA 16

K-

80

S FT COMPUTING @ YONSEI UNIV . KOREA 16

-

81

S FT COMPUTING @ YONSEI UNIV . KOREA 16

-

82

S FT COMPUTING @ YONSEI UNIV . KOREA 16

1: .

2:

83

S FT COMPUTING @ YONSEI UNIV . KOREA 16

1

84

S FT COMPUTING @ YONSEI UNIV . KOREA 16

2

85

S FT COMPUTING @ YONSEI UNIV . KOREA 16

K- vs

86

S FT COMPUTING @ YONSEI UNIV . KOREA 16

87

S FT COMPUTING @ YONSEI UNIV . KOREA 16

Telco-CAT (2001)

: ID , ,

: Churn(), Tariff(), Tariff_OK( )

: Peak( ), Off-Peak( ), Weekend( ) , International( )

88

S FT COMPUTING @ YONSEI UNIV . KOREA 16

1. (Behavior) . 8 .

Customer_ID( ID), Peak_calls_Sum ( )

AvePeak ( (), 1)

OffPeak_calls_Sum ( )

AveOffPeak ( (), 1)

Weekend_calls_Sum ( )

AveWeekend( (), 1)

International_min_Sum ( ())

2. .

(AvePeak, AveOffPeak, AveWeekend )

3. 2 .

4. K-means 2 .( )

5. K-means 3, 4 . ( )

89

S FT COMPUTING @ YONSEI UNIV . KOREA 16

10%

10~20%

Hot deck cast substitution

Regression

Model-based methods

20%

Egression

Model-based method

90

S FT COMPUTING @ YONSEI UNIV . KOREA 16

1

91

S FT COMPUTING @ YONSEI UNIV . KOREA 16

1

92

S FT COMPUTING @ YONSEI UNIV . KOREA 16

93

S FT COMPUTING @ YONSEI UNIV . KOREA 16

2

()

International_mins_Sum = 3237

= 3196

= 41

94

S FT COMPUTING @ YONSEI UNIV . KOREA 16

International_mins_Sum = 168.799

95

->

S FT COMPUTING @ YONSEI UNIV . KOREA 16

96

S FT COMPUTING @ YONSEI UNIV . KOREA 16

1. (Behavior) . 8 .( )

Customer_ID( ID)

Peak_calls_Sum ( )

AvePeak ( (), 1)

OffPeak_calls_Sum ( )

AveOffPeak ( (), 1)

Weekend_calls_Sum ( )

AveWeekend( (), 1)

International_min_Sum ( ())

2. .

3. 2 .

4. K-means 2 .( )

5. K-means 3, 4 . ( )

97

S FT COMPUTING @ YONSEI UNIV . KOREA 16

1, 2

8

98

S FT COMPUTING @ YONSEI UNIV . KOREA 16

3

99

S FT COMPUTING @ YONSEI UNIV . KOREA 16

3

100

S FT COMPUTING @ YONSEI UNIV . KOREA 16

4, 5

101

S FT COMPUTING @ YONSEI UNIV . KOREA 16

4, 5

102