joint with yang-jin kim, eun hee choi , chung mo nam

40
Nonparametric analysis of recurrent events with incomplete observation gaps Jinheum Kim University of Suwon Korea Joint with Yang-Jin Kim, Eun Hee Choi, Chung Mo Nam 2011-12-17 Joint 2011 Taipei Symposium 1

Upload: gusty

Post on 24-Feb-2016

39 views

Category:

Documents


0 download

DESCRIPTION

Nonparametric analysis of recurrent events with incomplete observation gaps Jinheum Kim University of Suwon Korea. Joint with Yang-Jin Kim, Eun Hee Choi , Chung Mo Nam . Recurrent event data. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Joint with Yang-Jin Kim,  Eun Hee Choi , Chung Mo Nam

Nonparametric analysis of recurrent events with incomplete observation gapsJinheum KimUniversity of SuwonKorea

Joint with Yang-Jin Kim, Eun Hee Choi, Chung Mo Nam 2011-12-17

Joint 2011 Taipei Symposium 1

Page 2: Joint with Yang-Jin Kim,  Eun Hee Choi , Chung Mo Nam

Recurrent event data Recurrent events arise frequently in clinical

trials in which patients are followed longi-tudinally and response of interest is tran-sient

Examples Transient ischemic attacks in patients with

cerebrovascular disease Seizures in epileptic patients Tumor recurrences in cancer patients

2011-12-17Joint 2011 Taipei Symposium

2

Page 3: Joint with Yang-Jin Kim,  Eun Hee Choi , Chung Mo Nam

Motivating example Young Traffic Offenders Program(YTOP)

data

What is YTOP? Conducted at the University of Missouri-Co-

lumbia Health Sciences Center and other medical centers since late 1987

One-day educational intervention for people under 24 convicted of speeding by 20+ mph over the posted speed limit

2011-12-17 Joint 2011 Taipei Symposium

3

Page 4: Joint with Yang-Jin Kim,  Eun Hee Choi , Chung Mo Nam

YTOP: objective

To evaluate the effect of the program on reducing the rates of speeding violation of young drivers

2011-12-17Joint 2011 Taipei Symposium

4

Page 5: Joint with Yang-Jin Kim,  Eun Hee Choi , Chung Mo Nam

YTOP: characteristics Data were collected on 192 young drivers about their

speeding violation information since they obtained their driving license

29 subjects(13: YTOP, 16: non-YTOP) received suspensions These subjects dropped out of the study when the suspen-

sion began, and then came back to the study after complet-ing suspension

These intermittent dropouts are called observation gaps The observation gap is determined by the starting and ter-

minating time of the suspension BUT, the observation gaps of YTOP data are incomplete

due to missing terminating times

2011-12-17 Joint 2011 Taipei Symposium

5

Page 6: Joint with Yang-Jin Kim,  Eun Hee Choi , Chung Mo Nam

Previous works & their limitations

2011-12-17Joint 2011 Taipei Symposium

6

Farmer et al.(2000, Brain Injury): two-sample rank test Ignore the detailed information about the conviction

process Can assess only the long-term impact

Sun et al.(2001, JASA): nonparametric & semi-parametric methods Treat YTOP data as recurrent event data Evaluate both short- and long-term effects applying the

conviction process information Ignore the observation gaps

Page 7: Joint with Yang-Jin Kim,  Eun Hee Choi , Chung Mo Nam

Previous works & their limitations

2011-12-17Joint 2011 Taipei Symposium

7

Zhao & Sun(2006, CSDA) Similar to Sun et al. (2001, JASA), but consider

the observation gaps Suspension periods: NOT deterministic, BUT vary-

ing and possibly unknown

Kim & Jhun(2008, StatMed) Treat incomplete observation gaps as terminating

times being interval-censored Only use the first suspension and ignore all the

other suspensions after the first one.

Page 8: Joint with Yang-Jin Kim,  Eun Hee Choi , Chung Mo Nam

Objective of this talk Use the multivariate-interval censored

data to estimate the distribution of the terminating time

Propose a non-parametric test to com-pare the conviction rates of two groups

Evaluate the proposed test via simula-tion & reanalyze the YTOP data

2011-12-17Joint 2011 Taipei Symposium

8

Page 9: Joint with Yang-Jin Kim,  Eun Hee Choi , Chung Mo Nam

Illustrative data: notation

2011-12-17Joint 2011 Taipei Symposium

9

●: conviction time

X: starting time of the suspension

?: terminating times of the suspension (unknown)

■ : end of study time

Page 10: Joint with Yang-Jin Kim,  Eun Hee Choi , Chung Mo Nam

Diagram10

??

? ??

Page 11: Joint with Yang-Jin Kim,  Eun Hee Choi , Chung Mo Nam

IC data: objective

2011-12-17Joint 2011 Taipei Symposium

11

Want to estimate the distribu-tion of the terminating time us-ing the interval censored data

Page 12: Joint with Yang-Jin Kim,  Eun Hee Choi , Chung Mo Nam

Two types of IC data

2011-12-17Joint 2011 Taipei Symposium

12

Univariate IC data cases II, III, & V having one observa-

tion gaps eg: (5,9)], (7, ), (2,10]

Multivariate IC data case IV having two observation gaps eg: (2,5]. BUT, …

Page 13: Joint with Yang-Jin Kim,  Eun Hee Choi , Chung Mo Nam

Multivariate IC data

2011-12-17Joint 2011 Taipei Symposium

13

How can we define the origin time of the second terminating time?

Page 14: Joint with Yang-Jin Kim,  Eun Hee Choi , Chung Mo Nam

Constructing IC data

2011-12-17Joint 2011 Taipei Symposium

14

Set the origin to be the first conviction time after the previous suspension eg: For the second suspension of case IV,

Hence, estimate the distribution of the terminating time using

,9(7 55 2 4]

(2,4](5,9], (7, ), (2,10],(2,5]

Page 15: Joint with Yang-Jin Kim,  Eun Hee Choi , Chung Mo Nam

EM-ICM

2011-12-17Joint 2011 Taipei Symposium

15

Hybrid algorithm(Wellner & Zhan, 1997, JASA)

Use R package interval

Page 16: Joint with Yang-Jin Kim,  Eun Hee Choi , Chung Mo Nam

Notation & assumption the number of occurrences of the event

up to time

function indicating if subject is under observation at time

cumulative mean (frequency) function (CMF)

independent2011-12-17Joint 2011 Taipei Symposium

16

( ) :iN t,t , ,1i n

( ) :iY t

& :i iN Y

ti

( ) { ( )}:i it N tE

Page 17: Joint with Yang-Jin Kim,  Eun Hee Choi , Chung Mo Nam

Data

: the th recurrent event time of subject

: 0-1 indicator whether or not sub-ject has experienced observation gaps

: starting time of the th observa-tion gap of subject

2011-12-17Joint 2011 Taipei Symposium

17

{( , , , 1, , ; 1, ,) ; },1,ik i ij i it Y h i n k n mj D

iYikt

ijh

k

j

i

i

i

Page 18: Joint with Yang-Jin Kim,  Eun Hee Choi , Chung Mo Nam

Unobservable terminating times

BUT, the terminating time of each observation gaps is NOT avail-able, that is, interval-censored

So, need to estimate the distribu-tion of the terminating time of the observation gap

2011-12-17Joint 2011 Taipei Symposium

18

Page 19: Joint with Yang-Jin Kim,  Eun Hee Choi , Chung Mo Nam

IC data

2011-12-17Joint 2011 Taipei Symposium

19

: minimum of greater than

: minimum of greater than

can be right-censored

{( , ], 1, , ; 1, , }ij ij iL R i n j m I

0ijt

1ijt, 1i jh

ijh

ijR

ikt

0 1 0,ij ij ij ij ij ijRL h t t t

ikt

Page 20: Joint with Yang-Jin Kim,  Eun Hee Choi , Chung Mo Nam

Terminating time distribu-tion

2011-12-17Joint 2011 Taipei Symposium

20

: the terminating time for the th observation gap of subject

: the mid-point of the th equivalence class

: mass of at

ijSj

l( 1, , )l l qa

laijS( )l ij lP Sf a

i

Page 21: Joint with Yang-Jin Kim,  Eun Hee Choi , Chung Mo Nam

Redefine a risk set

Replace by the esti-mated expected risk set defined as

2011-12-17Joint 2011 Taipei Symposium

21

0*

1 1

11

ˆ( ) ( )( ) 1

ˆ( ) ( )

i

i

m qij ij l ij l

i i m qj l

ij ij r ij rrj

J t I t t a R fY t Y

J t I L a R f

( )iY t

Page 22: Joint with Yang-Jin Kim,  Eun Hee Choi , Chung Mo Nam

Equivalence classes

2011-12-17Joint 2011 Taipei Symposium

22

Case II: (L R]Case III: (L Case IV: (L R] (L R]Case V: (L R]Combine L R L L R R R

Equivalence classes: (2,4], (5,5], (7,9] Mid-points: 3, 5, 8

Assume with terminating time

1 2 3, ( 5)( 3) ,, ( 8)P S Pf S f P S f :S

1 2 3 1ff f

Page 23: Joint with Yang-Jin Kim,  Eun Hee Choi , Chung Mo Nam

Unadjusted(naive) risk set23

1 1 1 1 1 1 1 1 1 1 1 1

1 1 1 1 1 1 0 0 0 1 1

1 1 1 1 1 1 1 1 0 0

1 1 1 0 0 1 1 0 0 1 1

1 1 1 0 0 0 0 0 0 0 1 1

Page 24: Joint with Yang-Jin Kim,  Eun Hee Choi , Chung Mo Nam

Modified(proposed) risk set24

1 1 1 1 1 1 1 1 1 1 1 1

1 1 1 1 1 1 0 0 0 1 1 1 1 1 1 1 1 1 1 0 1 1 1 1 0 p 1 1 1 0 1 1 1 1 1 0 q q r r r 1 1 1

1 2

1 ,pf

f f

1

1 2 3

ff

qf f

1 3

21

2

f ff

rff

Page 25: Joint with Yang-Jin Kim,  Eun Hee Choi , Chung Mo Nam

Estimated CMF Assuming that all subjects have the common CMF,

total number of events observed at time

total number of subjects at risk at time

2011-12-17Joint 2011 Taipei Symposium

25

*

*0

( )ˆ ( )( )

t dN st

Y s

1 2( ) ( ) ( ) ( ),nt tt t

* *

1

(( :) ) ( )n

i ii

Y t dd t tN N

* *

1

( :) ( )n

ii

Y tY t

t

t

Page 26: Joint with Yang-Jin Kim,  Eun Hee Choi , Chung Mo Nam

Representation of estimated CMF

distinct event times across all individuals

Similar to the Nelson-Aalen estimator from survival analysis

Cook & Lawless(2007)

2011-12-17Joint 2011 Taipei Symposium

26

*

*:

( )ˆ ( )( )

jj t t

j

j

dN tt

Y t

1 2 :dt tt

Page 27: Joint with Yang-Jin Kim,  Eun Hee Choi , Chung Mo Nam

Asymptotic properties Following Lin et al.(2001, JRSSB),

Consistent Asymptotically normal with consistently es-

timated variance,

Approximate pointwise CI:

2011-12-17Joint 2011 Taipei Symposium

27

* *2

* *1 :

( ) ( )ˆˆ { ( )} { [ ( ) ]}( ) ( )

j

ni j j

i ji j t t j j

Y t dN tvar t dN t

Y t Y t

1

/ 2ˆ ˆˆ( ) {log ( )exp(lo }g )t z var t

Page 28: Joint with Yang-Jin Kim,  Eun Hee Choi , Chung Mo Nam

Comparison of two groups : the CMFs for group 1 and 2

: the number of subjects in the th group with

: the estimated CMF for the th group

2011-12-17Joint 2011 Taipei Symposium

28

g

g

1 2( ), ( )t t

1 2,n n1 2n nn

1 2ˆ ˆ( ), ( )t t

Page 29: Joint with Yang-Jin Kim,  Eun Hee Choi , Chung Mo Nam

Two-sample test statistic Hypothesis:

Test statistic:

: maximum follow-up time across two groups

: non-negative predictable function

2011-12-17Joint 2011 Taipei Symposium

29

0 1 2: ( ) ( ), 0tH t t

1 20ˆ ˆ( ){ ( ) ( )}wT w s d s d s

( 0)

( )w t

Page 30: Joint with Yang-Jin Kim,  Eun Hee Choi , Chung Mo Nam

Asymptotic distribution Following Cook et al.(1996, BCS) & Lin et

al.(2001, JRSSB), Asymptotically normal with consistently es-

timated variance,

Hence, asymptotically

2011-12-17Joint 2011 Taipei Symposium

30

*22

*01 1

( ) ˆˆ ( [ ( ) { ( ) )}]( )

) (gn

gkw gk

g k gg

Y svar T w s dN s d s

Y s

2 2ˆ ( ) (/ ~ 1)w wT var T

Page 31: Joint with Yang-Jin Kim,  Eun Hee Choi , Chung Mo Nam

Simulation: data generationGenerate the first event time from an exponential distribu-tion with a hazard rate of

Generate

: probability of having an observation gap

2011-12-17Joint 2011 Taipei Symposium

31

1 ~ (1, )i B p

1it

p

i

Page 32: Joint with Yang-Jin Kim,  Eun Hee Choi , Chung Mo Nam

Simulation: data generation If generate the starting time of

the first observation gap from

If generate the duration tine from an exponential distribution with a hazard rate of 2 resulting in the terminat-ing time

So, we get & (if ) 2011-12-17Joint 2011 Taipei Symposium

32

1 1( , 005)i iU t t

1iv

1ih

1it 1ih

1 1i ih v

1 1,i

1 1,i

1 1i

Page 33: Joint with Yang-Jin Kim,  Eun Hee Choi , Chung Mo Nam

Simulation: the first cycle of data genera-tion

2011-12-17Joint 2011 Taipei Symposium

33

t

0 1it

0 w/ observation gap

1it

0 w/o observation gap1it

1 1i ih v

2it

1 2i it t

1ih

1iv 2it

1 1 2i i ih v t

?

Page 34: Joint with Yang-Jin Kim,  Eun Hee Choi , Chung Mo Nam

Two-sample tests: design params

Event rate of each group

Weight: =0.05, 0.1, 0.2, 0.4 Fixed censoring at 5 Sample size: Replication: 1,000

2011-12-17Joint 2011 Taipei Symposium

34

( ) 1w t

1 1.5 2 1.5,1.4,1.3,1.2,1.1

50,100n

p

Page 35: Joint with Yang-Jin Kim,  Eun Hee Choi , Chung Mo Nam

Two-sample tests: results(n=100)Tests

Proposed Naive Sun et al. Farmer et al.

0.05 1.5 0.047 0.055 0.047 0.0471.4 0.271 0.273 0.263 0.1901.3 0.755 0.740 0.740 0.5931.2 0.978 0.972 0.979 0.918

0.1 1.5 0.060 0.055 0.056 0.0391.4 0.246 0.231 0.248 0.1821.3. 0.704 0.668 0.677 0.5381.2 0.981 0.966 0.980 0.907

0.4 1.5 0.062 0.049 0.068 0.0541.4 0.190 0.151 0.179 0.1381.3. 0.633 0.440 0.576 0.4231.2 0.941 0.793 0.914 0.772

35

2p

Page 36: Joint with Yang-Jin Kim,  Eun Hee Choi , Chung Mo Nam

YTOP data: estimated CMFs

2011-12-17Joint 2011 Taipei Symposium

36

Page 37: Joint with Yang-Jin Kim,  Eun Hee Choi , Chung Mo Nam

YTOP data: p-values

2011-12-17Joint 2011 Taipei Symposium

37

Tests CovariateYTOP Gender

Proposed 0.836 0.135Naive 0.735 0.161Sun et al. 0.456 0.008Farmer et al. 0.113 0.533

Page 38: Joint with Yang-Jin Kim,  Eun Hee Choi , Chung Mo Nam

Summary Propose the expected risk set to incorporate

the observation gaps

Our two-sample test shows to satisfy the level & has the higher power than the other tests

No significant difference in traffic conviction rates between YTOP and non-YTOP partici-pants & between male and female

2011-12-17Joint 2011 Taipei Symposium

38

Page 39: Joint with Yang-Jin Kim,  Eun Hee Choi , Chung Mo Nam

Extend to …

2011-12-17Joint 2011 Taipei Symposium

39

A regression model to include a time-de-pendent covariate where denotes the time at which a sub-ject undergoes the traffic education pro-gram

A case to allow a termination event such as death

( ) ( ),Z t I T t T

0( ) { ( ) | ( )} ( ) exp{ ( )}i i i it E N t z t t z t

Page 40: Joint with Yang-Jin Kim,  Eun Hee Choi , Chung Mo Nam

Thank you!40