경북대학교 데이터베이스 연구실 milestonesrosaec.snu.ac.kr/meet/file/20110625c.pdf ·...

31
20116251

Upload: others

Post on 05-Jul-2020

0 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: 경북대학교 데이터베이스 연구실 Milestonesrosaec.snu.ac.kr/meet/file/20110625c.pdf · Based Graph Indexing Techniques." Wook-Shin Han, Jinsoo Lee, Minh-Duc Pham, and

2011년 6월 25일

1

Page 2: 경북대학교 데이터베이스 연구실 Milestonesrosaec.snu.ac.kr/meet/file/20110625c.pdf · Based Graph Indexing Techniques." Wook-Shin Han, Jinsoo Lee, Minh-Duc Pham, and

그래프 데이터베이스에서 기존의 인덱싱 기법의 성능을 공정하게 분석할 수 있는 프레임워크를 개발(VLDB 2010)하고, 여러 알고리즘의 성능을 효과적으로 분석 할 수 있는 툴 개발(SIGMOD 2011)

시계열 데이터베이스에서 순위 지원 서브시퀀스 검색 방법에 대한연구 (SIGMOD 2011)

연구 성과

◦ SIGMOD 2011 2편 논문 발표 (데모 포함)

◦ “iGraph in Action: Performance Analysis of Disk-Based Graph Indexing Techniques.”Woo-Shin Han, Minh-Duc Pham, Jinsoo Lee, Romans Kasperovics, and Jeffrey Xu Yu.Proceedings of the 37th SIGMOD International Conference on Management of Data (SIGMOD„11). 2011.

◦ “A New Approach for Processing Ranked Subsequence Matching Based on Ranked Union.”Wook-Shin Han, Jinsoo Lee, Yang-Sae Moon, Seuong-won Hwang, and Hwanjo Yu.Proceedings of the 37th SIGMOD International Conference on Management of Data (SIGMOD„11). 2011.

2

Page 3: 경북대학교 데이터베이스 연구실 Milestonesrosaec.snu.ac.kr/meet/file/20110625c.pdf · Based Graph Indexing Techniques." Wook-Shin Han, Jinsoo Lee, Minh-Duc Pham, and

VLDB 2010, ICDE 2010에 총 2편 발표

"iGraph: A Framework for Comparisons of Disk Based Graph Indexing Techniques."Wook-Shin Han, Jinsoo Lee, Minh-Duc Pham, and Jeffrey Xu Yu.Proceedings of the Very Large Data Base Endowment (PVLDB). 3 (1). 2010.

"On Supporting Effective Web Extraction."Wook-Shin Han, Wooseong Kwak, and Hwanjo Yu.26th IEEE International Conference on Data Engineering (ICDE‟10). 2010.

3

Page 4: 경북대학교 데이터베이스 연구실 Milestonesrosaec.snu.ac.kr/meet/file/20110625c.pdf · Based Graph Indexing Techniques." Wook-Shin Han, Jinsoo Lee, Minh-Duc Pham, and

SIGMOD 2009에 논문 발표

"Dependency-Aware Reordering for Parallelizing Query Optimization in Multi-Core CPUs."Wook-Shin Han and Jinsoo Lee.Proceedings of the 35th SIGMOD International Conference on Management of Data (SIGMOD'09). 2009.

4

Page 5: 경북대학교 데이터베이스 연구실 Milestonesrosaec.snu.ac.kr/meet/file/20110625c.pdf · Based Graph Indexing Techniques." Wook-Shin Han, Jinsoo Lee, Minh-Duc Pham, and

"StreamTX: Extracting Tuples from Streaming XML Data."Wook-Shin Han, Haifeng Jiang, Howard Ho, and Quanzhong Li.Proceedings of the Very Large Data Base Endowment (PVLDB). 1 (1). 2008.

"Parallelizing Query Optimization."Wook-Shin Han, Wooseong Kwak, Jinsoo Lee, Guy M. Lohman, and Volker Markl.Proceedings of the Very Large Data Base Endowment (PVLDB). 1 (1). 2008.

"Ranked Subsequence Matching in Time-Series Databases."Wook-Shin Han, Jinsoo Lee, Yang-Sae Moon, and Haifeng Jiang.Proceedings of the 33rd International Conference on Very Large Data Bases (VLDB'07). 2007.

"Progressive Optimization in a Shared-Nothing Parallel Database."Wook-Shin Han, Jack Ng, Volker Markl, Holger Kache, and Mokhtar Kandil.Proceedings of the 2007 ACM SIGMOD International Conference on Management of Data (SIGMOD'07). 2007.

5

Page 6: 경북대학교 데이터베이스 연구실 Milestonesrosaec.snu.ac.kr/meet/file/20110625c.pdf · Based Graph Indexing Techniques." Wook-Shin Han, Jinsoo Lee, Minh-Duc Pham, and

6

Page 7: 경북대학교 데이터베이스 연구실 Milestonesrosaec.snu.ac.kr/meet/file/20110625c.pdf · Based Graph Indexing Techniques." Wook-Shin Han, Jinsoo Lee, Minh-Duc Pham, and

7

What are time-series data?

Page 8: 경북대학교 데이터베이스 연구실 Milestonesrosaec.snu.ac.kr/meet/file/20110625c.pdf · Based Graph Indexing Techniques." Wook-Shin Han, Jinsoo Lee, Minh-Duc Pham, and

8

Page 9: 경북대학교 데이터베이스 연구실 Milestonesrosaec.snu.ac.kr/meet/file/20110625c.pdf · Based Graph Indexing Techniques." Wook-Shin Han, Jinsoo Lee, Minh-Duc Pham, and

9

Page 10: 경북대학교 데이터베이스 연구실 Milestonesrosaec.snu.ac.kr/meet/file/20110625c.pdf · Based Graph Indexing Techniques." Wook-Shin Han, Jinsoo Lee, Minh-Duc Pham, and

10(c) Eamonn Keogh, [email protected]

Page 11: 경북대학교 데이터베이스 연구실 Milestonesrosaec.snu.ac.kr/meet/file/20110625c.pdf · Based Graph Indexing Techniques." Wook-Shin Han, Jinsoo Lee, Minh-Duc Pham, and

11(c) Eamonn Keogh, [email protected]

Page 12: 경북대학교 데이터베이스 연구실 Milestonesrosaec.snu.ac.kr/meet/file/20110625c.pdf · Based Graph Indexing Techniques." Wook-Shin Han, Jinsoo Lee, Minh-Duc Pham, and

12

Red Howler Monkey

Mantled Howler Monkey

Orangutan

(juvenile)

Borneo Orangutan

(c) Eamonn Keogh, [email protected]

Page 13: 경북대학교 데이터베이스 연구실 Milestonesrosaec.snu.ac.kr/meet/file/20110625c.pdf · Based Graph Indexing Techniques." Wook-Shin Han, Jinsoo Lee, Minh-Duc Pham, and

13

0 10 20 30 40 50 60 70 80 90

Hand at rest

Hand moving to

shoulder level

Steady

pointing

0 10 20 30 40 50 60 70 80 90

Hand at rest

Hand moving

above holster

Hand moving

down to grasp gun

Hand moving to

shoulder level

Steady

pointing

Point

Gun-Draw

(c) Eamonn Keogh, [email protected]

Page 14: 경북대학교 데이터베이스 연구실 Milestonesrosaec.snu.ac.kr/meet/file/20110625c.pdf · Based Graph Indexing Techniques." Wook-Shin Han, Jinsoo Lee, Minh-Duc Pham, and

14

Page 15: 경북대학교 데이터베이스 연구실 Milestonesrosaec.snu.ac.kr/meet/file/20110625c.pdf · Based Graph Indexing Techniques." Wook-Shin Han, Jinsoo Lee, Minh-Duc Pham, and

15

What is Similarity Search?

Page 16: 경북대학교 데이터베이스 연구실 Milestonesrosaec.snu.ac.kr/meet/file/20110625c.pdf · Based Graph Indexing Techniques." Wook-Shin Han, Jinsoo Lee, Minh-Duc Pham, and

16

Page 17: 경북대학교 데이터베이스 연구실 Milestonesrosaec.snu.ac.kr/meet/file/20110625c.pdf · Based Graph Indexing Techniques." Wook-Shin Han, Jinsoo Lee, Minh-Duc Pham, and

17

Page 18: 경북대학교 데이터베이스 연구실 Milestonesrosaec.snu.ac.kr/meet/file/20110625c.pdf · Based Graph Indexing Techniques." Wook-Shin Han, Jinsoo Lee, Minh-Duc Pham, and

18

Euclidean Distance

Page 19: 경북대학교 데이터베이스 연구실 Milestonesrosaec.snu.ac.kr/meet/file/20110625c.pdf · Based Graph Indexing Techniques." Wook-Shin Han, Jinsoo Lee, Minh-Duc Pham, and

19

Page 20: 경북대학교 데이터베이스 연구실 Milestonesrosaec.snu.ac.kr/meet/file/20110625c.pdf · Based Graph Indexing Techniques." Wook-Shin Han, Jinsoo Lee, Minh-Duc Pham, and

20

n

iii cqCQD

1

2,

Q = q1…qn

C = c1…cn

(c) Eamonn Keogh, [email protected]

Page 21: 경북대학교 데이터베이스 연구실 Milestonesrosaec.snu.ac.kr/meet/file/20110625c.pdf · Based Graph Indexing Techniques." Wook-Shin Han, Jinsoo Lee, Minh-Duc Pham, and

21

R* trees

Page 22: 경북대학교 데이터베이스 연구실 Milestonesrosaec.snu.ac.kr/meet/file/20110625c.pdf · Based Graph Indexing Techniques." Wook-Shin Han, Jinsoo Lee, Minh-Duc Pham, and

22

1

2

A

h

B3

4

5

b

a

c

e

d

f

g

i

j

k

l

A B

1 2 3 4

a b c d f g k le h i j

5

Page 23: 경북대학교 데이터베이스 연구실 Milestonesrosaec.snu.ac.kr/meet/file/20110625c.pdf · Based Graph Indexing Techniques." Wook-Shin Han, Jinsoo Lee, Minh-Duc Pham, and

23

Dimensionality

Page 24: 경북대학교 데이터베이스 연구실 Milestonesrosaec.snu.ac.kr/meet/file/20110625c.pdf · Based Graph Indexing Techniques." Wook-Shin Han, Jinsoo Lee, Minh-Duc Pham, and

24

With R* tree

Dimension

Without R* tree

ElapsedTime

Page 25: 경북대학교 데이터베이스 연구실 Milestonesrosaec.snu.ac.kr/meet/file/20110625c.pdf · Based Graph Indexing Techniques." Wook-Shin Han, Jinsoo Lee, Minh-Duc Pham, and

25

Dimension Reduction!

M (256) → N (8)

Page 26: 경북대학교 데이터베이스 연구실 Milestonesrosaec.snu.ac.kr/meet/file/20110625c.pdf · Based Graph Indexing Techniques." Wook-Shin Han, Jinsoo Lee, Minh-Duc Pham, and

26

Piecewise Aggregate Approximation

Page 27: 경북대학교 데이터베이스 연구실 Milestonesrosaec.snu.ac.kr/meet/file/20110625c.pdf · Based Graph Indexing Techniques." Wook-Shin Han, Jinsoo Lee, Minh-Duc Pham, and

27

C

C

i

ij

jnN

i

Nn

Nn

cC1)1(

Page 28: 경북대학교 데이터베이스 연구실 Milestonesrosaec.snu.ac.kr/meet/file/20110625c.pdf · Based Graph Indexing Techniques." Wook-Shin Han, Jinsoo Lee, Minh-Duc Pham, and

28

Important Lemma

Page 29: 경북대학교 데이터베이스 연구실 Milestonesrosaec.snu.ac.kr/meet/file/20110625c.pdf · Based Graph Indexing Techniques." Wook-Shin Han, Jinsoo Lee, Minh-Duc Pham, and

29

D(Q, C) < D(PAA(Q), PAA(C)) <

Page 30: 경북대학교 데이터베이스 연구실 Milestonesrosaec.snu.ac.kr/meet/file/20110625c.pdf · Based Graph Indexing Techniques." Wook-Shin Han, Jinsoo Lee, Minh-Duc Pham, and

30

Pruning!

Page 31: 경북대학교 데이터베이스 연구실 Milestonesrosaec.snu.ac.kr/meet/file/20110625c.pdf · Based Graph Indexing Techniques." Wook-Shin Han, Jinsoo Lee, Minh-Duc Pham, and

31

We can prune any C such that D(PAA(Q), PAA(C)) > cur_top_kth_similar