kevin cummins statistical phenomenon 02-09-16

40
1 Interesting Statistical Phenomenon San Diego State University DSW/IRSU Brown Bag 3/16 Kevin Cummins

Upload: kevin-cummins

Post on 13-Apr-2017

109 views

Category:

Education


1 download

TRANSCRIPT

Page 1: Kevin Cummins Statistical Phenomenon 02-09-16

1

Interesting Statistical Phenomenon

San Diego State UniversityDSW/IRSU Brown Bag

3/16

Kevin Cummins

Page 2: Kevin Cummins Statistical Phenomenon 02-09-16

2

Definitions• Principle: a comprehensive and

fundamental law, doctrine, or assumption

• Fallacy: a false or mistaken idea• Paradox: a statement that is seemingly

contradictory or opposed to common sense and yet is perhaps true

Page 3: Kevin Cummins Statistical Phenomenon 02-09-16

3

Outline

• Objective• Simpson’s Paradox• Will Roger’s Paradox • Lord’s Paradox • Berkson’s Paradox• Monte Hall Paradox • Others

Page 4: Kevin Cummins Statistical Phenomenon 02-09-16

4

Objectives

• Create awareness of several statistical issues that might arise during observational research

• Sneak in an introduction to mosaic plots• Learn how to win prizes on game shows

Page 5: Kevin Cummins Statistical Phenomenon 02-09-16

5

Outline

• Objective• Simpson’s Paradox• Will Roger’s Paradox• Lord’s Paradox • Berkson’s Paradox• Monte Hall Paradox• Others

Page 6: Kevin Cummins Statistical Phenomenon 02-09-16

6

Delayed On Time

AlaskaAirline

17813%

1,33888%

AmericaWest

66110%

5,80490%

Which Airline Should You Fly?

Cells contain counts and row %

Page 7: Kevin Cummins Statistical Phenomenon 02-09-16

7

Page 8: Kevin Cummins Statistical Phenomenon 02-09-16

8

Alaska Airlines America West

.11.05

.17 .14.08

.29

Page 9: Kevin Cummins Statistical Phenomenon 02-09-16

9

Simpson’s Paradox

Occurs when the relationship between two (categorical) variables is reversed after a third variables is considered.

The relationship between two variables differs within subgroups compared to that observed for the aggregated data.

Page 10: Kevin Cummins Statistical Phenomenon 02-09-16

10

Simpson’s Paradox: Remedies/Responses

Study DesignUse ExperimentsCollect appropriate covariate data

Know the Research SystemCollect appropriate covariate dataAnalytically introduce conditionals

(i.e. moderators/covariates)Use appropriate interpretations

Page 11: Kevin Cummins Statistical Phenomenon 02-09-16

11

Outline

• Objective• Simpson’s Fallacy • Will Roger’s Paradox • Lord’s Paradox• Berkson’s Paradox• Monte Hall Paradox• Others

Page 12: Kevin Cummins Statistical Phenomenon 02-09-16

WRP: Health Insurance Example

1996 1997

HMO $98/Subscriber $119/Subscriber

PPO $126/Subscriber $142/Subscriber

PPO No Longer Free

Young et al. 1999

Cells are cost to employer (a hospital system)

Expected Lower Expenditures

Page 13: Kevin Cummins Statistical Phenomenon 02-09-16

13

Will Roger’s Paradox

“When the Okies left Oklahoma and moved to California, they raised the average intelligence level in both states.”

IC: uspsstamps.com

Page 14: Kevin Cummins Statistical Phenomenon 02-09-16

14

The Will Rogers Paradox (WRP) is observed when moving an element from one set to another set the mean values of both sets change in the same direction.

The effect will occur when both of these conditions are met:

1. The element being moved is below average for its current set.

2. The element being moved is above the current average of the set it is entering.

Page 15: Kevin Cummins Statistical Phenomenon 02-09-16

15

WRP: Effect of Shifting One Observation

Page 16: Kevin Cummins Statistical Phenomenon 02-09-16

WRP: Health Insurance Example

1996 1997

HMO $98/Subscriber $119/Subscriber

PPO $126/Subscriber $142/Subscriber

The 1997 migration moved lower utilization PPO subscribers into the HMO

Young et al. 1999

Low use

High use

Page 17: Kevin Cummins Statistical Phenomenon 02-09-16

17

Will Rogers: Remedies/Responses

Know Your SystemIn This Case:

Statistically adjust/stratify for baseline costs

Page 18: Kevin Cummins Statistical Phenomenon 02-09-16

18

Outline

• Objective• Simpson’s Fallacy• Will Roger’s Paradox• Lord’s Paradox • Berkson’s Paradox • Monte Hall Paradox• Others

Page 19: Kevin Cummins Statistical Phenomenon 02-09-16

19

Lord’s Paradox

• Occurs in situations where change score analysis and ANCOVA yield apparently conflicting results

Page 20: Kevin Cummins Statistical Phenomenon 02-09-16

20

An Extreme Example• Assessment of a supplemental educational program

• 10 schools, 5 schools opted into the programs (free-choice)

• 1 student from each school assessed

• Pre and post assessments given

• No random/sampling/measurement error (simplified)

Page 21: Kevin Cummins Statistical Phenomenon 02-09-16
Page 22: Kevin Cummins Statistical Phenomenon 02-09-16

22

Two StatisticiansStatistician One• Calculates

difference scores for each group

• Change scores are the same for both groups

Statistician Two• Adjusts for initial

score• Finds group

differences

Page 23: Kevin Cummins Statistical Phenomenon 02-09-16

23

Two StatisticiansPaired t-TestStatistician One

Data: group 1 vs. group 2

t = -0.002, df = 299, p-value = 0.99

ANCOVAStatistician Two

Coefficients: Value Pr(>|t|) (Intercept) 15.0 0.00 Pre 0.5 0.00 Group 20.0 0.00

Page 24: Kevin Cummins Statistical Phenomenon 02-09-16
Page 25: Kevin Cummins Statistical Phenomenon 02-09-16

25

Lord’s Paradox: Remedies/Responses

“With the data available…there is no logical or statistical procedure that can be counted on to make allowances for pre-existing conditions between groups.” Frederic Lord

•Know your system– Match your samples

•Use the best descriptive statement(s) that match your questions•Use and report multiple approaches (Wright 2006)

•Graph your data

Page 26: Kevin Cummins Statistical Phenomenon 02-09-16

26

Outline

• Objective• Simpson’s Fallacy• Will Roger’s Paradox• Lord’s Principle• Berkson’s Paradox• Monte Hall Paradox • Others

Page 27: Kevin Cummins Statistical Phenomenon 02-09-16

27

Berkson’s Paradox

An association reported from a hospital case-control study can be distorted

If cases and controls experience differential hospital admission rates with respect to the suspected causal factor

Page 28: Kevin Cummins Statistical Phenomenon 02-09-16

28

Typical Berkson Scenario

Example from Roberts et al. 1978

Investigated the relationship between circulatory and respiratory disease.

Sampled the general population and hospital populations.

Page 29: Kevin Cummins Statistical Phenomenon 02-09-16

29OR = 3.9 [95% CI: 1.4-10.9]

Circ

ulat

ory

Dis

ease

Page 30: Kevin Cummins Statistical Phenomenon 02-09-16

30

Circ

ulat

ory

Dis

ease

OR = 1.3 [95% CI: 0.9-2.3]

Page 31: Kevin Cummins Statistical Phenomenon 02-09-16

31

Berkson Example

Example from Lilienfeld and Stolley (1994)• No greater admission rate for subjects

with multiple conditions• Different rates of admission for cases and

controls• Results in an apparent association

between two conditions

Page 32: Kevin Cummins Statistical Phenomenon 02-09-16

Disease B

Disease A Case Control

Case 200 200

Control 800 800

Total 1000 1000

% with A 20 20

Disease B

Disease A Case Control

Case

Control

Total

% with A

110 17080 560

Community Hospital

P(H|A)=.50

P(H|B)=.10 P(H|!B)=70

100X .50X .10X .50X .70

100

X .70X .10

190 73058 23

OR=1 OR=4.5

Page 33: Kevin Cummins Statistical Phenomenon 02-09-16

33

Berkson’s: Remedies/Responses

– There is no safe analytical mitigation– Analysis of potential bias

• know your system• Sensitivity analysis

– Limit conclusions– Utilize multiple control pools– Consider alternative study design

Page 34: Kevin Cummins Statistical Phenomenon 02-09-16

34

Page 35: Kevin Cummins Statistical Phenomenon 02-09-16

35

Monty Hall Paradox

Page 36: Kevin Cummins Statistical Phenomenon 02-09-16

36

Miller et al. 1989

Page 37: Kevin Cummins Statistical Phenomenon 02-09-16

37

Review

Big PictureUse care to interpret observational studies

Know your system

Conditional ResponsesSimpson’sLord’sWill Roger’s

Perspective ProblemsBerkson’sMonte Hall

Page 38: Kevin Cummins Statistical Phenomenon 02-09-16

Doctor Tyrano, Look for a Covariate!

Cartoon used with with permission

Page 39: Kevin Cummins Statistical Phenomenon 02-09-16

39

Benford’s Law

Ones are the most common leading digit in most data.

Notice that if a data entry (base 10) begins with a 1, the entry has to be at least doubled to have a first significant digit of 2. However, if a leading digit begins with a 9, it only has to be increased by, at most, 11% to change the first significant digit into a 1.

Page 40: Kevin Cummins Statistical Phenomenon 02-09-16

40

Lindley’s Paradox

• Standard Sampling Theory VS. Bayesian Theory

Under some circumstances strong evidence against the null hypothesis doesn’t result in the null being rejected