trajectory pattern mining

37
Trajectory Pattern Mining NTU IM Hsieh, Hsun-Ping Trajectory Pattern Mining Reporter Hsieh, Hsun-Ping 解解解 (R96725019) Fosca Giannotti Mirco Nanni Dino Pedreschi Fa bio Pinelli Pisa KDD Laboratory KDD’07 ISTI - CNR, Area della Ricerca di Pisa, Via Giusepp e Moruzzi, 1 - 56124 Pisa, Italy Computer Science Dep., University of Pisa, Largo Po ntecorvo, 3 - 56127 Pisa, Italy

Upload: tyrell

Post on 19-Mar-2016

75 views

Category:

Documents


2 download

DESCRIPTION

Trajectory Pattern Mining. NTU IM Hsieh, Hsun-Ping. Trajectory Pattern Mining. Reporter : Hsieh, Hsun-Ping 解巽評 (R96725019) Fosca Giannotti Mirco Nanni Dino Pedreschi Fabio Pinelli Pisa KDD Laboratory KDD’07 - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Trajectory Pattern Mining

Trajectory Pattern Mining NTU IM Hsieh, Hsun-Ping

Trajectory Pattern Mining

Reporter: Hsieh, Hsun-Ping 解巽評 (R96725019)Fosca Giannotti Mirco Nanni Dino Pedreschi Fabio Pinelli

Pisa KDD Laboratory KDD’07

ISTI - CNR, Area della Ricerca di Pisa, Via Giuseppe Moruzzi, 1 - 56124 Pisa, ItalyComputer Science Dep., University of Pisa, Largo Pontecorvo, 3 - 56127 Pisa, Italy

Page 2: Trajectory Pattern Mining

2/37

Trajectory Pattern Mining

Introduction

Background and Related Work

Problem Definition

Regions-Of-Interest

T-Patterns with Static ROI

T-Patterns with Dynamic ROI

Outline

Experiments & Conclusion

NTU IM Hsieh, Hsun-Ping

Page 3: Trajectory Pattern Mining

3/37

Background &Related Work

ProblemDefinition

Region-Of-Interest

Introduction

T-Patterns with Static ROI

T-Patterns with Dynamic ROI

Summery

Algorithm

TrajectoryPattern

Spatio-temporalpattern

Spatio-Temporal pattern :•Space & time element should be considered•In this paper, Spatio-Temporal pattern is used to apply on Trajectory Pattern Mining.

Experiment &Conclusion

Trajectory Pattern Mining NTU IM Hsieh, Hsun-Ping

Page 4: Trajectory Pattern Mining

4/37

Background &Related Work

ProblemDefinition

Region-Of-Interest

Introduction

T-Patterns with Static ROI

T-Patterns with Dynamic ROI

Summery

Algorithm

Spatio-temporalpattern

Trajectory pattern application:•Pervasiveness of location-acpuisition technologies (GPS, GSM network) •Spatio-temporal datasets to discovery usable knowledge about movement behaviour. •the movement of people or vehicles within a given area can be observed from the digital devices.•Useful in the domain of sustainable mobility and traffic management.Experiment &

Conclusion

Trajectory Pattern Mining NTU IM Hsieh, Hsun-Ping

TrajectoryPattern

Page 5: Trajectory Pattern Mining

5/37

Background &Related Work

ProblemDefinition

Region-Of-Interest

Introduction

T-Patterns with Static ROI

T-Patterns with Dynamic ROI

Summery

Approach

Spatio-temporalpattern

Trajectory pattern :•a set of individual trajectories that share the property of visiting the same sequence of places with similar travel times.•Two notions are central: (i) the region of interest in the given space (ii) the typical travel time of moving object from region to region

Experiment &Conclusion

Trajectory Pattern Mining NTU IM Hsieh, Hsun-Ping

TrajectoryPattern

Page 6: Trajectory Pattern Mining

6/37

Background &Related Work

ProblemDefinition

Region-Of-Interest

Introduction

T-Patterns with Static ROI

T-Patterns with Dynamic ROI

Summery

Approach

Spatio-temporalpattern Trajectory pattern example:

Experiment &Conclusion

Trajectory Pattern Mining NTU IM Hsieh, Hsun-Ping

TrajectoryPattern

We only require that such trajectories visit the same sequence of places with similar transition times, even if they start at diffierent absolute times

Page 7: Trajectory Pattern Mining

7/37

Background &Related Work

ProblemDefinition

Region-Of-Interest

Introduction

T-Patterns with Static ROI

T-Patterns with Dynamic ROI

Summery

Spatio-temporalpattern Three Trajectory pattern mining Approach:

•Pre-conceived regions of interest

•Popular regions

•the identification of the regions of interest is dynamically intertwined with the mining of sequences with temporal information.

Experiment &Conclusion

Trajectory Pattern Mining NTU IM Hsieh, Hsun-Ping

TrajectoryPattern

Approach

Page 8: Trajectory Pattern Mining

8/37

Background &Related Work

ProblemDefinition

Region-Of-Interest

Introduction

T-Patterns with Static ROI

T-Patterns with Dynamic ROI

Approach

Spatio-temporalpattern

Contributions in this paper:•(i) the definition of the novel trajectory pattern

•(ii) a density-based algorithm for discovering regions of interest

•(iii) a trajectory pattern mining algorithm with predefined regions of interest

• (iv) a trajectory pattern mining algorithm which dynamically discovers regions of interest.

Experiment &Conclusion

Trajectory Pattern Mining NTU IM Hsieh, Hsun-Ping

TrajectoryPattern

Summery

Page 9: Trajectory Pattern Mining

9/37

TemporallyAnnotatedSequence

PreviousResearch

Spatio-temporal

sequential patternsBackground &

Related Work

ProblemDefinition

Region-Of-Interest

Introduction

T-Patterns with Static ROI

T-Patterns with Dynamic ROI

Experiment &Conclusion

Trajectory Pattern Mining NTU IM Hsieh, Hsun-Ping

Spatio-temporal sequential patterns :•frequent sequential pattern (FSP) problem, is defined over a database of sequences D, where each element of each sequence is a time stamped set of items (itemset).

•FSP problem consists in finding all the sequences that are frequent in D.

•A sequence α = α1 → · · · →αk is a subsequence of β = β1 → ·· · → βm if there exist integers 1 ≤ i1 < . . . < ik ≤ m such that ∀1≤n≤k αn β⊆ in. •Define support suppD(S) of a sequence S as the percentage of transactions T D ∈•S is frequent w.r.t. threshold smin if suppD(S) ≥ smin.

TASexample

Page 10: Trajectory Pattern Mining

10/37

TemporallyAnnotatedSequence

Spatio-temporal

sequential patternsBackground &

Related Work

ProblemDefinition

Region-Of-Interest

Introduction

T-Patterns with Static ROI

T-Patterns with Dynamic ROI

Experiment &Conclusion

Trajectory Pattern Mining NTU IM Hsieh, Hsun-Ping

TASexample

Previous Research about trajectory :•The work in [3] considers patterns that are in the form of trajectory segments and searches approximate instances in the data.

• The work in [7] provides a clustering-based perspective, and considers patterns in the form of moving regions within time intervals

•Finally, a similar goal, but focused on cyclic patterns,is pursued in [8]

•This focused on the extraction of patterns over sequences of events that describe also the temporal relations between events,

PreviousResearch

Page 11: Trajectory Pattern Mining

11/37

PreviousResearch

Spatio-temporal

sequential patterns

•Example:

Background &Related Work

ProblemDefinition

Region-Of-Interest

Introduction

T-Patterns with Static ROI

T-Patterns with Dynamic ROI

Experiment &Conclusion

Trajectory Pattern Mining NTU IM Hsieh, Hsun-Ping

Temporally Annotated Sequence :•Temporally annotated sequences (TAS), introduced in [5],are an extension of sequential patterns that enrich sequences with information about the typical transition times between their elements.

TASexample

TemporallyAnnotatedSequence

Page 12: Trajectory Pattern Mining

12/37

TASexample

PreviousResearch

Spatio-temporal

sequential patternsBackground &

Related Work

ProblemDefinition

Region-Of-Interest

Introduction

T-Patterns with Static ROI

T-Patterns with Dynamic ROI

Experiment &Conclusion

Trajectory Pattern Mining NTU IM Hsieh, Hsun-Ping

Definition 1 :

TemporallyAnnotatedSequence

Page 13: Trajectory Pattern Mining

13/37

PreviousResearch

Spatio-temporal

sequential patternsBackground &

Related Work

ProblemDefinition

Region-Of-Interest

Introduction

T-Patterns with Static ROI

T-Patterns with Dynamic ROI

Experiment &Conclusion

Trajectory Pattern Mining NTU IM Hsieh, Hsun-Ping

Example :

TemporallyAnnotatedSequence

if τ = 2 and smin = 1, not only T is frequent, but also any other variant of T having transition times t1, t2 where t1 ∈ [1, 5] and t2 ∈ [9, 13].

TASexample

Page 14: Trajectory Pattern Mining

14/37

Background &Related Work

ProblemDefinition

Region-Of-Interest

Introduction

T-Patterns with Static ROI

T-Patterns with Dynamic ROI

Experiment &Conclusion

Trajectory Pattern Mining NTU IM Hsieh, Hsun-Ping

•The key task in sequence mining consists in counting the occurrences of a pattern, i.e., those segments of the input data that match a candidate pattern.

•In this paper it requires locationsapproximated match and error tolerance when matching spatial .

•neighborhood function N : R2 →P(R2), which assigns to each pair (x, y) a set N (x, y) of neighboring points.

T-patternMining

Spatio-temporal

containment

T-pattern

Spatial containment

ST-sequence

Page 15: Trajectory Pattern Mining

15/37

T-patternMining

Spatio-temporal

containment

T-pattern

ST-sequenceBackground &Related Work

ProblemDefinition

Region-Of-Interest

Introduction

T-Patterns with Static ROI

T-Patterns with Dynamic ROI

Experiment &Conclusion

Trajectory Pattern Mining NTU IM Hsieh, Hsun-Ping

Spatial containment

Page 16: Trajectory Pattern Mining

16/37

T-patternMining

Spatio-temporal

containment

Spatial containment

ST-sequenceBackground &Related Work

ProblemDefinition

Region-Of-Interest

Introduction

T-Patterns with Static ROI

T-Patterns with Dynamic ROI

Experiment &Conclusion

Trajectory Pattern Mining NTU IM Hsieh, Hsun-Ping

T-pattern

Page 17: Trajectory Pattern Mining

17/37

T-patternMining

T-pattern

Spatial containment

ST-sequenceBackground &Related Work

ProblemDefinition

Region-Of-Interest

Introduction

T-Patterns with Static ROI

T-Patterns with Dynamic ROI

Experiment &Conclusion

Trajectory Pattern Mining NTU IM Hsieh, Hsun-Ping

Spatio-temporal

containment

Page 18: Trajectory Pattern Mining

18/37

T-patternMining

T-pattern

Spatial containment

ST-sequenceBackground &Related Work

ProblemDefinition

Region-Of-Interest

Introduction

T-Patterns with Static ROI

T-Patterns with Dynamic ROI

Experiment &Conclusion

Trajectory Pattern Mining NTU IM Hsieh, Hsun-Ping

Figure 1 spatial and temporal constraints essentially form a spatio-temporal neighborhood around each point of the reference trajectory

Spatio-temporal

containment

Page 19: Trajectory Pattern Mining

19/37

Spatio-temporal

containment

T-pattern

Spatial containment

ST-sequenceBackground &Related Work

ProblemDefinition

Region-Of-Interest

Introduction

T-Patterns with Static ROI

T-Patterns with Dynamic ROI

Experiment &Conclusion

Trajectory Pattern Mining NTU IM Hsieh, Hsun-Ping

T-patternMining

Page 20: Trajectory Pattern Mining

20/37

neighborhood function is used to model Regions-of-Interest (RoI), that represent a natural way to partition the space into meaningful areas and to associate spatial points with region labels.Background &

Related Work

ProblemDefinition

Region-Of-Interest

Introduction

T-Patterns with Static ROI

T-Patterns with Dynamic ROI

Experiment &Conclusion

ROIConstruction

Popualr pointsdetection

DiscoveringROI

TrajectoryPreproceing

Region-Of-Interest

Trajectory Pattern Mining NTU IM Hsieh, Hsun-Ping

Integrating ROI & trajectories :•Input:set R of disjoint spatial regions•Neigborhood function:

•two points are considered similar iff they fall in the same region.

•points disregarded by R will be virtually deleted from trajectories and spatio-temporal patterns.

REGIONS-OF-INTEREST

Page 21: Trajectory Pattern Mining

21/37

Background &Related Work

ProblemDefinition

Region-Of-Interest

Introduction

T-Patterns with Static ROI

T-Patterns with Dynamic ROI

Experiment &Conclusion

ROIConstruction

Popular pointsdetection

DiscoveringROI

TrajectoryPreprocessing

Region-Of-Interest

Trajectory Pattern Mining NTU IM Hsieh, Hsun-Ping

•The regions associated with each point, i.e., NR(x, y), are essentially used as labels representing events of the form “the trajectory is in region NR(x, y) at time t”

•the methods developed for extracting frequent TAS’s can be directly applied to the translated input sequences, and each pattern (TAS) of the form A →B represents the set of T-patterns

Page 22: Trajectory Pattern Mining

22/37

When Regions-of-Interest are not provided by external means they have to be automatically computed through some heuristics.

Background &Related Work

ProblemDefinition

Region-Of-Interest

Introduction

T-Patterns with Static ROI

T-Patterns with Dynamic ROI

Experiment &Conclusion

ROIConstruction

Popualr pointsdetection

DiscoveringROI

TrajectoryPreproceing

Region-Of-Interest

Trajectory Pattern Mining NTU IM Hsieh, Hsun-Ping

Static preprocessed spatial regions

The underlying idea is that locations frequently visited by moving objects probably represent interesting places

consideration the density of spatial regions.

Page 23: Trajectory Pattern Mining

23/37

Background &Related Work

ProblemDefinition

Region-Of-Interest

Introduction

T-Patterns with Static ROI

T-Patterns with Dynamic ROI

Experiment &Conclusion

ROIConstruction

Popular pointsdetection

DiscoveringROI

Region-Of-Interest

Trajectory Pattern Mining NTU IM Hsieh, Hsun-Ping

•Assuming to know a suitable set of RoI, applying them to the T-pattern mining problem simply consists in preprocessing the input sequences to corresponding sequences of RoI.

•provide a model for such point movement, e.g., linear regression.it is not obvious which time-stamp should be associated with the event “Region A” in the translated sequence.

•Different way in this paper: 1. if the trajectory starts at time t from a point already inside a region A, yield the couple (A, t);

2.An object can enter several times in a region, and each entry will be associated with a different time-stamp.

TrajectoryPreprocessing

Trajectory preprocessing

Page 24: Trajectory Pattern Mining

24/37

Background &Related Work

ProblemDefinition

Region-Of-Interest

Introduction

T-Patterns with Static ROI

T-Patterns with Dynamic ROI

Experiment &Conclusion

ROIConstruction

Popular pointsdetection

Region-Of-Interest

Trajectory Pattern Mining NTU IM Hsieh, Hsun-Ping

When Regions-of-Interest are not known a priori, someheuristics that enable to automatically identify them areneeded. Several different methods are possible:

• selecting among a database of candidate places • automatically computing candidate places through the analysis of trajectories, EX: selecting all minimal square regions that were visited by at least 10% of the objects;• mixing the two approaches

TrajectoryPreprocessing

Discovering Region-of-Interest

DiscoveringROI Second type:

•dense (i.e., popular) points in space are detected

•a set of significant regions are extracted to represent them succinctly.

Page 25: Trajectory Pattern Mining

25/37

Background &Related Work

ProblemDefinition

Region-Of-Interest

Introduction

T-Patterns with Static ROI

T-Patterns with Dynamic ROI

Experiment &Conclusion

ROIConstruction

Region-Of-Interest

Trajectory Pattern Mining NTU IM Hsieh, Hsun-Ping

•modeling the popularity of a point as the number of distinct moving objects that pass close to it w.r.t. a neighborhood function.

•discretize the working space through a regular grid with cells of small size.

•set cell width at a given fraction of the chosen neighborhood. Then, the density of cells is computed by taking each single trajectory and incrementing the density of all the cells that contain any of its points

TrajectoryPreprocessing

Popular points detection

DiscoveringROI

Popular pointsdetection

Page 26: Trajectory Pattern Mining

26/37

Background &Related Work

ProblemDefinition

Region-Of-Interest

Introduction

T-Patterns with Static ROI

T-Patterns with Dynamic ROI

Experiment &Conclusion

Popular pointsdetection

Region-Of-Interest

Trajectory Pattern Mining NTU IM Hsieh, Hsun-Ping

DiscoveringROI

TrajectoryPreprocessing

ROI construction

ROIConstruction

In general, the set of popular regions can be extremely large – even infinite, if we work on a continuous space.Therefore, some additional constraints should be enforced to select a significant, yet limited, subset of them.

(i) Each r R ∈ forms a rectangular region(ii) sets in R are pairwise disjoint;(iii) all dense cells in G are contained in some set r R∈ ;

(iv) all r R ∈ have avg (i, j) ∈r G( i, j) ≥ δ(v) Assuming that r ∈ R has size h × k, all its rectangular supersets r r ⊇ of size (h + 1) × k or h × (k + 1) violate (iv) or r and r contain exactly the same number of dense cells.

Page 27: Trajectory Pattern Mining

27/37

Background &Related Work

ProblemDefinition

Region-Of-Interest

Introduction

T-Patterns with Static ROI

T-Patterns with Dynamic ROI

Experiment &Conclusion

Popular pointsdetection

Region-Of-Interest

Trajectory Pattern Mining NTU IM Hsieh, Hsun-Ping

DiscoveringROI

TrajectoryPreprocessing

ROI construction

ROIConstruction

Page 28: Trajectory Pattern Mining

28/37

Background &Related Work

ProblemDefinition

Region-Of-Interest

Introduction

T-Patterns with Static ROI

T-Patterns with Dynamic ROI

Experiment &Conclusion

Trajectory Pattern Mining NTU IM Hsieh, Hsun-Ping

ROI construction example

Page 29: Trajectory Pattern Mining

29/37

Background &Related Work

ProblemDefinition

Region-Of-Interest

Introduction

T-Patterns with Static ROI

T-Patterns with Dynamic ROI

Experiment &Conclusion

Trajectory Pattern Mining NTU IM Hsieh, Hsun-Ping

T-Patterns with Static ROI

Page 30: Trajectory Pattern Mining

30/37

Background &Related Work

ProblemDefinition

Region-Of-Interest

Introduction

T-Patterns with Static ROI

T-Patterns with Dynamic ROI

Experiment &Conclusion

Implement

A step-wiseheuristic

Dynamicneighborhood

Approach

Trajectory Pattern Mining NTU IM Hsieh, Hsun-Ping

T-Patterns with Dynamic ROI

T-patterns exacerbate the difficulty of the density estimation task in two ways:

(i) the dimensionality of working spaces of TAS’s grows less quickly

(ii) the sequence component in each TAS strongly limits the number of instances that can be found within each input sequence, making the density estimation task easier.

Page 31: Trajectory Pattern Mining

31/37

Background &Related Work

ProblemDefinition

Region-Of-Interest

Introduction

T-Patterns with Static ROI

T-Patterns with Dynamic ROI

Experiment &Conclusion

Implement

Dynamicneighborhood

Approach

Trajectory Pattern Mining NTU IM Hsieh, Hsun-Ping

A step-wise heuristic

any frequent T-pattern of length n + 1 is the extension of some frequent T-pattern of length n, as stated by the following property:

A step-wiseheuristic

This property implies that the support of a T-pattern is less than or equal to the support of any its prefixes, allows us to adopt a level-wise approach by mining step-by-step

Page 32: Trajectory Pattern Mining

32/37

Background &Related Work

ProblemDefinition

Region-Of-Interest

Introduction

T-Patterns with Static ROI

T-Patterns with Dynamic ROI

Experiment &Conclusion

Dynamicneighborhood

Approach

Trajectory Pattern Mining NTU IM Hsieh, Hsun-Ping

Implementation of the method

A step-wiseheuristic

only a segment of such trajectories really needs to be searched, since we only need to find continuations of the pattern pn, and no point occurring before the end time of pn can be appended to pn to obtain pn+1. Therefore, any point occurring beforesuch end time can be removed from the trajectory.

Implement

Page 33: Trajectory Pattern Mining

33/37

Background &Related Work

ProblemDefinition

Region-Of-Interest

Introduction

T-Patterns with Static ROI

T-Patterns with Dynamic ROI

Experiment &Conclusion

Implement

Dynamicneighborhood

Approach

Trajectory Pattern Mining NTU IM Hsieh, Hsun-Ping

A step-wiseheuristic

Page 34: Trajectory Pattern Mining

34/37

Experiment-Real data

Background &Related Work

ProblemDefinition

Region-Of-Interest

Introduction

T-Patterns with Static ROI

T-Patterns with Dynamic ROI

Experiment &Conclusion

Trajectory Pattern Mining NTU IM Hsieh, Hsun-Ping

The real data used in these experiments describe the GPStraces of a fleet of 273 trucks in Athens, Greece, for a total of112203 points6. Running both the Static RoI T-pattern andDynamic RoI T-pattern algorithms with various parameter settings

Dynamic approach:

Static approach:

(Δt1,Δt2) ∈ [330, 445] × [116, 190] (Δt1,Δt2) ∈ [400, 513]×[41, 61]

Page 35: Trajectory Pattern Mining

35/37

Performance-synthetic dataBackground &Related Work

ProblemDefinition

Region-Of-Interest

Introduction

T-Patterns with Static ROI

T-Patterns with Dynamic ROI

Experiment &Conclusion

Trajectory Pattern Mining NTU IM Hsieh, Hsun-Ping

Page 36: Trajectory Pattern Mining

36/37

Performance-synthetic dataBackground &Related Work

ProblemDefinition

Region-Of-Interest

Introduction

T-Patterns with Static ROI

T-Patterns with Dynamic ROI

Experiment &Conclusion

Trajectory Pattern Mining NTU IM Hsieh, Hsun-Ping

Page 37: Trajectory Pattern Mining

37/37

ConclusionBackground &Related Work

ProblemDefinition

Region-Of-Interest

Introduction

T-Patterns with Static ROI

T-Patterns with Dynamic ROI

Experiment &Conclusion

Trajectory Pattern Mining NTU IM Hsieh, Hsun-Ping

T-patterns are a basic building block for spatio-temporal data mining, around which more sophisticated analysis tools can be constructed, including:• integration with background geographic knowledge• adequate visualization metaphors for T-patterns• adequate mechanisms for spatio-temporal querying