linear regression analysis...

63
2017/12/20 1 臨床研究迴歸分析的應用 Wen, ShuHui [email protected]

Upload: others

Post on 27-Dec-2019

12 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Linear regression analysis 線性迴歸分析hlm.tzuchi.com.tw/epi-stat/images/class/2015/2015_class9.pdf · 迴歸分析(regression analysis) 2017/12/20 2 Response (研究感興趣

2017/12/20 1

臨床研究迴歸分析的應用

Wen, ShuHui

[email protected]

Page 2: Linear regression analysis 線性迴歸分析hlm.tzuchi.com.tw/epi-stat/images/class/2015/2015_class9.pdf · 迴歸分析(regression analysis) 2017/12/20 2 Response (研究感興趣

迴歸分析 (regression analysis)

2017/12/20 2

Response (研究感興趣的變數 , Y)

1. 連續型 e.g. BMI

(Multiple linear regression)

2.兩分類型e.g. 胖(Logistic regression)

3.存活時間 (追蹤研究) e.g.

time to death

王老師課程(Cox regression)

性別

年齡

教育程度

生化檢驗值

QoL

Page 3: Linear regression analysis 線性迴歸分析hlm.tzuchi.com.tw/epi-stat/images/class/2015/2015_class9.pdf · 迴歸分析(regression analysis) 2017/12/20 2 Response (研究感興趣

2017/12/20 3

Example

停經婦女之骨質密度BMD偏低,可能導致易骨折

older, heavier

高脂飲食者會有較高之LDL cholesterol,可能增加心血管疾病風險

They might be smokers and overweight.

Page 4: Linear regression analysis 線性迴歸分析hlm.tzuchi.com.tw/epi-stat/images/class/2015/2015_class9.pdf · 迴歸分析(regression analysis) 2017/12/20 2 Response (研究感興趣

2017/12/20 4

Multi-predictor analysis (regression model)

Potentially complex relationship in observational study A continuous outcome (Y, e.g. BMD, LDL) is related to a

risk factor (X1 e.g.停經, 高脂飲食)

But the risk factor of interest might be related to other factors (X2, e.g. age, BMI,smoke ) which also predict the outcome.

Similarly, for experiments (e.g. clinical trials) If randomization is implemented, confounding might not be

an issue.

For Multi-center trials, need to adjusted for clinical center.

When baseline differences are apparent between case and control group.

Y=α+β1X1+β2X2+...+βkXk+error

Page 5: Linear regression analysis 線性迴歸分析hlm.tzuchi.com.tw/epi-stat/images/class/2015/2015_class9.pdf · 迴歸分析(regression analysis) 2017/12/20 2 Response (研究感興趣

2017/12/20 5

以文獻(Å kesson et al. 2006)為例

探討鎘暴露對骨頭的影響

骨骼傷害(Y):因為鈣及磷酸的流失,以及因為腎損壞而抑制維他命D羊巠化反應,造成骨質疏鬆及軟化。

評估鎘的暴露量(X)和身體含量時,血中的鎘含量可表示最近的暴露量,尿中的鎘可表示身體的含量

採用multiple linear regression

可能還有其他影響因素(X2, X3,…,Xk)

Page 6: Linear regression analysis 線性迴歸分析hlm.tzuchi.com.tw/epi-stat/images/class/2015/2015_class9.pdf · 迴歸分析(regression analysis) 2017/12/20 2 Response (研究感興趣

2017/12/20 6

Statistical analyses

Data from two independent groups of subjects were compared by the Mann-Whitney U-test. We used Spearman rank correlation (rs) or Kendall’s tau to assess univariate associations (p ≤ 0.1). In multiple linear regression models, each bone-related variable was evaluated in relation to cadmium, potential confounders (factors associated with both cadmium and bone) and effect modifiers (factors associated with bone).We explored possible interactions in the model.

Page 7: Linear regression analysis 線性迴歸分析hlm.tzuchi.com.tw/epi-stat/images/class/2015/2015_class9.pdf · 迴歸分析(regression analysis) 2017/12/20 2 Response (研究感興趣

2017/12/20 7

Statistical analyses

Because the season of sampling correlated with blood and urinary cadmium, BMD, PTH, U-DPD, and urinary calcium, it was included in the models.

Residual and goodness-of-fit analyses indicated no deviation from a linear pattern in the regression models. The final regression model included, apart from cadmium, only statistically significant variables (p ≤ 0.05). All tests were two sided, and statistical evaluation was performed using SPSS (version 12.01; SPSS Inc., Chicago, IL, USA).

Page 8: Linear regression analysis 線性迴歸分析hlm.tzuchi.com.tw/epi-stat/images/class/2015/2015_class9.pdf · 迴歸分析(regression analysis) 2017/12/20 2 Response (研究感興趣

2017/12/20 8

Page 9: Linear regression analysis 線性迴歸分析hlm.tzuchi.com.tw/epi-stat/images/class/2015/2015_class9.pdf · 迴歸分析(regression analysis) 2017/12/20 2 Response (研究感興趣

2017/12/20 9

Page 10: Linear regression analysis 線性迴歸分析hlm.tzuchi.com.tw/epi-stat/images/class/2015/2015_class9.pdf · 迴歸分析(regression analysis) 2017/12/20 2 Response (研究感興趣

2017/12/20 10

Page 11: Linear regression analysis 線性迴歸分析hlm.tzuchi.com.tw/epi-stat/images/class/2015/2015_class9.pdf · 迴歸分析(regression analysis) 2017/12/20 2 Response (研究感興趣

2017/12/20 11

Outline

Correlation

Multiple linear regression

Predictor selection

Interaction

Other extended cases

Page 12: Linear regression analysis 線性迴歸分析hlm.tzuchi.com.tw/epi-stat/images/class/2015/2015_class9.pdf · 迴歸分析(regression analysis) 2017/12/20 2 Response (研究感興趣

2017/12/20 12

Example: FEV data

一秒最大呼氣量(FEV)

FEV 與抽菸的關係?

Other related factors, e.g. age, gender

Page 13: Linear regression analysis 線性迴歸分析hlm.tzuchi.com.tw/epi-stat/images/class/2015/2015_class9.pdf · 迴歸分析(regression analysis) 2017/12/20 2 Response (研究感興趣

2017/12/20 13

FEV data

Page 14: Linear regression analysis 線性迴歸分析hlm.tzuchi.com.tw/epi-stat/images/class/2015/2015_class9.pdf · 迴歸分析(regression analysis) 2017/12/20 2 Response (研究感興趣

2017/12/20 14

Analysis steps

Step1: Present the descriptive the clinical

features for FEV and other influencing factors.

Step2: Explore the correlation between FEV

and X1…Xk.

Step3: Build up the multiple linear regression

model and check for model adequacy.

Step4: Model revision or selection.

Step5: Interpretation the result (model).

Page 15: Linear regression analysis 線性迴歸分析hlm.tzuchi.com.tw/epi-stat/images/class/2015/2015_class9.pdf · 迴歸分析(regression analysis) 2017/12/20 2 Response (研究感興趣

2017/12/20 15

Step2: Explore the correlations

SPSS: Analyze Correlation Pairwise

相關係數處有三個選項

1.相關係數: For continuous Xs.

2.Kendall’s tau: For ordinal Xs

3.Spearman: For nominal Xs.

Page 16: Linear regression analysis 線性迴歸分析hlm.tzuchi.com.tw/epi-stat/images/class/2015/2015_class9.pdf · 迴歸分析(regression analysis) 2017/12/20 2 Response (研究感興趣

2017/12/20 16

Correlation matrix (recall Tab2 in paper

2006)

可將output的圖(選圖後滑鼠於圖上點兩下)直接編輯成下表

Page 17: Linear regression analysis 線性迴歸分析hlm.tzuchi.com.tw/epi-stat/images/class/2015/2015_class9.pdf · 迴歸分析(regression analysis) 2017/12/20 2 Response (研究感興趣

2017/12/20 17

Add the scatter plot Graph Scatter plotMatrix plot

Page 18: Linear regression analysis 線性迴歸分析hlm.tzuchi.com.tw/epi-stat/images/class/2015/2015_class9.pdf · 迴歸分析(regression analysis) 2017/12/20 2 Response (研究感興趣

2017/12/20 18

Matrix scatter plot

散佈圖與相關係數矩陣搭配著看

相關係數看出正相關且達顯著

散佈圖可看出是否為線性相關

Page 19: Linear regression analysis 線性迴歸分析hlm.tzuchi.com.tw/epi-stat/images/class/2015/2015_class9.pdf · 迴歸分析(regression analysis) 2017/12/20 2 Response (研究感興趣

2017/12/20 19

For nominal variables, Spearman rs is

more suitable.

Look at the correlation of FEV and gender(or

smoke)

Page 20: Linear regression analysis 線性迴歸分析hlm.tzuchi.com.tw/epi-stat/images/class/2015/2015_class9.pdf · 迴歸分析(regression analysis) 2017/12/20 2 Response (研究感興趣

2017/12/20 20

Spearman rs

FEV 與性別(0=female,1=male)有關,男性其FEV較大

FEV 與抽菸(0=No,1=Yes)有關,抽菸者其FEV較大

Page 21: Linear regression analysis 線性迴歸分析hlm.tzuchi.com.tw/epi-stat/images/class/2015/2015_class9.pdf · 迴歸分析(regression analysis) 2017/12/20 2 Response (研究感興趣

2017/12/20 21

FEV vs. smoke

抽菸者FEV

值大?

可能的原因是抽菸者多為男性或者年齡較大(體型較大)

Confounder?

Page 22: Linear regression analysis 線性迴歸分析hlm.tzuchi.com.tw/epi-stat/images/class/2015/2015_class9.pdf · 迴歸分析(regression analysis) 2017/12/20 2 Response (研究感興趣

2017/12/20 22

Summary for bivariate correlation

For continuous outcome (Y)

If factors (Xs) are continuous, we show the

Pearson correlation coefficient.

If factors (Xs) are categorical, we list the

Spearman correlation coefficient.

Also, provide the plots as possible.

Page 23: Linear regression analysis 線性迴歸分析hlm.tzuchi.com.tw/epi-stat/images/class/2015/2015_class9.pdf · 迴歸分析(regression analysis) 2017/12/20 2 Response (研究感興趣

2017/12/20 23

Summary of correlation analysis

FEV 與抽菸的關係?

Others related factors, e.g. age, gender

FEV 與 身高、年齡都呈正相關,且有統計上顯著相關(p<0.05)

FEV與性別有關(p<0.05),男性其FEV值越大

FEV與抽菸有關(p<0.05),抽菸者其FEV值越大,但此現象可能是有confounder造成,例如性別、年齡、身高尚未考慮

Page 24: Linear regression analysis 線性迴歸分析hlm.tzuchi.com.tw/epi-stat/images/class/2015/2015_class9.pdf · 迴歸分析(regression analysis) 2017/12/20 2 Response (研究感興趣

2017/12/20 24

Step3: Build up the multiple linear

regression model

Now, we want to build the model as

FEV=α+β1age+β2sex+β3Hgt+β4smoke

Page 25: Linear regression analysis 線性迴歸分析hlm.tzuchi.com.tw/epi-stat/images/class/2015/2015_class9.pdf · 迴歸分析(regression analysis) 2017/12/20 2 Response (研究感興趣

2017/12/20 25

Multiple linear regression

Page 26: Linear regression analysis 線性迴歸分析hlm.tzuchi.com.tw/epi-stat/images/class/2015/2015_class9.pdf · 迴歸分析(regression analysis) 2017/12/20 2 Response (研究感興趣

2017/12/20 26

Check for model adequacy.

點進”圖形”後選擇常態機率圖(為檢驗資料是否符合常態性假設)

畫殘差圖(Y axis:殘差值, X axis:FEV值)

為判斷同質性假設

若此兩假設不符

則後續檢定迴歸

係數之結果可能

會不對(not valid)

Page 27: Linear regression analysis 線性迴歸分析hlm.tzuchi.com.tw/epi-stat/images/class/2015/2015_class9.pdf · 迴歸分析(regression analysis) 2017/12/20 2 Response (研究感興趣

2017/12/20 27

Results-1:Pearson Correlation matrix

除了看出FEV與因子(Xs)間相關以外,Xs彼此也有些達統計相關e.g. age vs. Hgt

Page 28: Linear regression analysis 線性迴歸分析hlm.tzuchi.com.tw/epi-stat/images/class/2015/2015_class9.pdf · 迴歸分析(regression analysis) 2017/12/20 2 Response (研究感興趣

2017/12/20 28

Results-2: R-square

FEV的變異可被模式中所有因子共同解釋的變異比例為 0.774。換句話說,還有 22.6% 為誤差,可能還有其他影響FEV因素未被考慮。

Page 29: Linear regression analysis 線性迴歸分析hlm.tzuchi.com.tw/epi-stat/images/class/2015/2015_class9.pdf · 迴歸分析(regression analysis) 2017/12/20 2 Response (研究感興趣

2017/12/20 29

Result-3:Collinearity diagnosis(共線性)

Collinearity: 意指Xs彼此高相關而影響β值估計,如此則須再 revise the model.

檢查指標為VIF. 若VIF>10則表示該變項與其他變數高相關,可考慮拿掉

Page 30: Linear regression analysis 線性迴歸分析hlm.tzuchi.com.tw/epi-stat/images/class/2015/2015_class9.pdf · 迴歸分析(regression analysis) 2017/12/20 2 Response (研究感興趣

2017/12/20 30

Result-4-1: Normality

圖中直線若接近45度直線則表示常態性假設成立

通常sample size若夠大可不用太擔心常態性不成立

如果常態性不成立,一般會將Y轉換成log(Y) 重新做regression

Page 31: Linear regression analysis 線性迴歸分析hlm.tzuchi.com.tw/epi-stat/images/class/2015/2015_class9.pdf · 迴歸分析(regression analysis) 2017/12/20 2 Response (研究感興趣

2017/12/20 31

Result-4-2: Homogeneous

正常圖形應該看來是雜亂無pattern

右圖看來有點扇形(Fan shape)可能是違反同質性

另外Y-axis標準化殘差值落在(-3,3)之外的就是異常值

Page 32: Linear regression analysis 線性迴歸分析hlm.tzuchi.com.tw/epi-stat/images/class/2015/2015_class9.pdf · 迴歸分析(regression analysis) 2017/12/20 2 Response (研究感興趣

2017/12/20 32

Outliers

下表即為outliers. 一般也可以拿掉後重做regression. (Do you know how to do it?)

Page 33: Linear regression analysis 線性迴歸分析hlm.tzuchi.com.tw/epi-stat/images/class/2015/2015_class9.pdf · 迴歸分析(regression analysis) 2017/12/20 2 Response (研究感興趣

2017/12/20 33

Influential point

Criterion Bound

Leverage, h >2/n

Studentized residual, r >3

DFFIT >2

Cook's distance >1

High-leverage point could be x-outlier. Influential point, i.e. one or more β-hat would change by a large amount.

Page 34: Linear regression analysis 線性迴歸分析hlm.tzuchi.com.tw/epi-stat/images/class/2015/2015_class9.pdf · 迴歸分析(regression analysis) 2017/12/20 2 Response (研究感興趣

2017/12/20 34

Influential point (2)

Reference: Page 122 from Vittinghoff et al. 2005

Page 35: Linear regression analysis 線性迴歸分析hlm.tzuchi.com.tw/epi-stat/images/class/2015/2015_class9.pdf · 迴歸分析(regression analysis) 2017/12/20 2 Response (研究感興趣

2017/12/20 35

Outliers or influential points?

有outliers. 無影響點(max cook’s distance<1)

Page 36: Linear regression analysis 線性迴歸分析hlm.tzuchi.com.tw/epi-stat/images/class/2015/2015_class9.pdf · 迴歸分析(regression analysis) 2017/12/20 2 Response (研究感興趣

2017/12/20 36

Step4: Model revision or selection.

根據初步分析結果

FEV可被 age, gender, smoke. Height解釋變異之比例達77.4%

常態性符合,同質性雖不甚符合,但 n 夠大

無共線性問題,無影響點,有 5 個異常值

Model revision

試著將 outliers 去掉後再做一次

Page 37: Linear regression analysis 線性迴歸分析hlm.tzuchi.com.tw/epi-stat/images/class/2015/2015_class9.pdf · 迴歸分析(regression analysis) 2017/12/20 2 Response (研究感興趣

2017/12/20 37

先儲存標準化殘差,再利用selection功能將outlier去掉

執行完regression

後請到資料

選擇觀察值

Page 38: Linear regression analysis 線性迴歸分析hlm.tzuchi.com.tw/epi-stat/images/class/2015/2015_class9.pdf · 迴歸分析(regression analysis) 2017/12/20 2 Response (研究感興趣

2017/12/20 38

Delete outliers and do regression again

條件為 abs(ZRE_1) <=3

Page 39: Linear regression analysis 線性迴歸分析hlm.tzuchi.com.tw/epi-stat/images/class/2015/2015_class9.pdf · 迴歸分析(regression analysis) 2017/12/20 2 Response (研究感興趣

2017/12/20 39

Interpretation of regression analysis

重新做regression後的結果即可仿照 page 23-

33步驟 檢視統計結果

N=649 (原本有 654筆)

Page 40: Linear regression analysis 線性迴歸分析hlm.tzuchi.com.tw/epi-stat/images/class/2015/2015_class9.pdf · 迴歸分析(regression analysis) 2017/12/20 2 Response (研究感興趣

2017/12/20 40

Adjusted R-square (new)

R-square is 78.7%. A little larger than

previous one.

Page 41: Linear regression analysis 線性迴歸分析hlm.tzuchi.com.tw/epi-stat/images/class/2015/2015_class9.pdf · 迴歸分析(regression analysis) 2017/12/20 2 Response (研究感興趣

2017/12/20 41

Normality, Collinearity, Homogeneous

Normality 符合

常態機率圖 接近45度直線

Collinearity

VIF 皆小於 10, 無共線性

Homogeneous

殘差圖與之前一樣

Outliers

雖有但很輕微(很接近3)故不再排除

Page 42: Linear regression analysis 線性迴歸分析hlm.tzuchi.com.tw/epi-stat/images/class/2015/2015_class9.pdf · 迴歸分析(regression analysis) 2017/12/20 2 Response (研究感興趣

2017/12/20 42

Interpretation of regression analysis

Regression model

FEV=-4.521+0.057Age+0.131Sex-

0.067Smoke+0.042Hgt

Page 43: Linear regression analysis 線性迴歸分析hlm.tzuchi.com.tw/epi-stat/images/class/2015/2015_class9.pdf · 迴歸分析(regression analysis) 2017/12/20 2 Response (研究感興趣

2017/12/20 43

1. 拿掉outlier 後regression model影響不大

2. 與FEV顯著相關之變項仍是 Age, Sex, Height

有異常點

Page 44: Linear regression analysis 線性迴歸分析hlm.tzuchi.com.tw/epi-stat/images/class/2015/2015_class9.pdf · 迴歸分析(regression analysis) 2017/12/20 2 Response (研究感興趣

2017/12/20 44

整理成 paper 之表格 (供參考)

Beta 95% CI

Factors Coefficient lower bound upper bound p-value

Age(yr) 0.057 0.039 0.075 <0.001*

Sex 0.131 0.069 0.194 <0.001*

Smoke -0.067 -0.177 0.044 0.236

Height(cm) 0.042 0.038 0.046 <0.001*

Sex:0=female, 1=male. Smoke: 0=no, 1=yes. *: statistical significance

Table: Multiple linear regression analysis between FEV and factors.

Page 45: Linear regression analysis 線性迴歸分析hlm.tzuchi.com.tw/epi-stat/images/class/2015/2015_class9.pdf · 迴歸分析(regression analysis) 2017/12/20 2 Response (研究感興趣

2017/12/20 45

Solutions if Normality failed

對 Y 做轉換(特別在小樣本時) e.g, log(Y)

Model is log(Y)=α+βX

Interpretation of β

X每增加一單位,則Y會增加 (exp(beta)-1)%.

e.g. beta=0.2 change 22%

缺點:資料經轉換後,較不易解釋

How to do it?

先利用 compute 得到轉換後的Y

再利用剛剛學到的steps 2-4進行分析

Page 46: Linear regression analysis 線性迴歸分析hlm.tzuchi.com.tw/epi-stat/images/class/2015/2015_class9.pdf · 迴歸分析(regression analysis) 2017/12/20 2 Response (研究感興趣

2017/12/20 46

Solutions if Homogeneous failed

1. 亦可做轉換(尤其小樣本時) e.g. log(Y), 1/Y

2. 利用加權最小平方法(請洽 statisticians)

Page 47: Linear regression analysis 線性迴歸分析hlm.tzuchi.com.tw/epi-stat/images/class/2015/2015_class9.pdf · 迴歸分析(regression analysis) 2017/12/20 2 Response (研究感興趣

2017/12/20 47

Solutions if Collinearity exists

Model selection

利用模式選取的方式,放入較顯著的變項,以避免Xs之間之高相關

Forward, Backward, Stepwise regression

Stepwise 較常使用

Page 48: Linear regression analysis 線性迴歸分析hlm.tzuchi.com.tw/epi-stat/images/class/2015/2015_class9.pdf · 迴歸分析(regression analysis) 2017/12/20 2 Response (研究感興趣

2017/12/20 48

Stepwise regression

Page 49: Linear regression analysis 線性迴歸分析hlm.tzuchi.com.tw/epi-stat/images/class/2015/2015_class9.pdf · 迴歸分析(regression analysis) 2017/12/20 2 Response (研究感興趣

2017/12/20 49

Results

Page 50: Linear regression analysis 線性迴歸分析hlm.tzuchi.com.tw/epi-stat/images/class/2015/2015_class9.pdf · 迴歸分析(regression analysis) 2017/12/20 2 Response (研究感興趣

2017/12/20 50

Selected model

Model is FEV=-4.449+0.041Hgt+0.061Age+0.161Sex

(here is for all data, plz use data without outliers)

Page 51: Linear regression analysis 線性迴歸分析hlm.tzuchi.com.tw/epi-stat/images/class/2015/2015_class9.pdf · 迴歸分析(regression analysis) 2017/12/20 2 Response (研究感興趣

2017/12/20 51

Interaction

若Z與X對Y的交互作用存在,則Z的值不同時,X與Y的關係會改變

統計角度,可畫出 mean plot for each X*Z group

模式中要加入interaction effect, 作法是

加入X與Z的加乘項XZ,檢定XZ的迴歸係數是否為0,若顯著則X與Z之 interaction 存在

Page 52: Linear regression analysis 線性迴歸分析hlm.tzuchi.com.tw/epi-stat/images/class/2015/2015_class9.pdf · 迴歸分析(regression analysis) 2017/12/20 2 Response (研究感興趣

2017/12/20 52

Sex vs. Smoke?

Page 53: Linear regression analysis 線性迴歸分析hlm.tzuchi.com.tw/epi-stat/images/class/2015/2015_class9.pdf · 迴歸分析(regression analysis) 2017/12/20 2 Response (研究感興趣

2017/12/20 53

Check for mean FEV

2

2.5

3

3.5

4

Female Male

Mea

n F

EV

Nonsmoker

smoker

由敘述性統計值看來

男生的FEV值與女生的FEV值之差異會因抽菸狀態不同而不同

可能有交互作用存在(from statistical viewpoint)

2

2.5

3

3.5

4

nonsmoker smoker

Mea

n F

EV

Female

Male

此處尚未考慮Age, Height的影響喔,若加入confounder後關係會再改變!

(Multiple regression)

Page 54: Linear regression analysis 線性迴歸分析hlm.tzuchi.com.tw/epi-stat/images/class/2015/2015_class9.pdf · 迴歸分析(regression analysis) 2017/12/20 2 Response (研究感興趣

2017/12/20 54

Add interaction effects

檢驗抽菸與性別之交互作用

1. 先新增加乘項(name it as “interaction”)

Page 55: Linear regression analysis 線性迴歸分析hlm.tzuchi.com.tw/epi-stat/images/class/2015/2015_class9.pdf · 迴歸分析(regression analysis) 2017/12/20 2 Response (研究感興趣

2017/12/20 55

Build up the model

將 interaction 選入自變數清單

Page 56: Linear regression analysis 線性迴歸分析hlm.tzuchi.com.tw/epi-stat/images/class/2015/2015_class9.pdf · 迴歸分析(regression analysis) 2017/12/20 2 Response (研究感興趣

2017/12/20 56

Results (here is for all data, plz use data

without outliers)

Regression model

抽菸與性別之交互作用存在,此時的smoke 主效應亦存在

Page 57: Linear regression analysis 線性迴歸分析hlm.tzuchi.com.tw/epi-stat/images/class/2015/2015_class9.pdf · 迴歸分析(regression analysis) 2017/12/20 2 Response (研究感興趣

2017/12/20 57

Which one is the final model?

Add the interaction. (here is for all data)

Mean FEV=-4.422+0.066age+0.135Sex-0.183Smoke+0.041Hgt+0.234Interaction

Page 58: Linear regression analysis 線性迴歸分析hlm.tzuchi.com.tw/epi-stat/images/class/2015/2015_class9.pdf · 迴歸分析(regression analysis) 2017/12/20 2 Response (研究感興趣

2017/12/20 58

Interpretation

Sex Smoke InteractionEstimated FEV

(adjusted for age, height)

female(0) No(0) 0 baseline

female Yes(1) 0 -0.183

male(1) No 0 0.135

male Yes 1 0.186

Mean FEV= -4.422+0.066age+0.135Sex-0.183Smoke

+0.041Hgt+0.234Interaction

女性者抽菸其FEV值會較未抽菸者低0.183(l),男性者抽菸其FEV值會較未抽菸者高0.051(l)。可能原因是?

Page 59: Linear regression analysis 線性迴歸分析hlm.tzuchi.com.tw/epi-stat/images/class/2015/2015_class9.pdf · 迴歸分析(regression analysis) 2017/12/20 2 Response (研究感興趣

2017/12/20 59

會是身高影響?

Page 60: Linear regression analysis 線性迴歸分析hlm.tzuchi.com.tw/epi-stat/images/class/2015/2015_class9.pdf · 迴歸分析(regression analysis) 2017/12/20 2 Response (研究感興趣

2017/12/20 60

Further issues

What if Y is not continuous?

If Y is binary, say disease vs. healthy. Suggest use the logistic regression (future topic).

What if Y are repeated measure, say pre/post Y?

Might use post-Y as response variable, and adjusted for pre-Y and Xs. (For 2 time points)

For several time points, suggest use “repeated-measure” ANOVA. (請洽statisticians)

Page 61: Linear regression analysis 線性迴歸分析hlm.tzuchi.com.tw/epi-stat/images/class/2015/2015_class9.pdf · 迴歸分析(regression analysis) 2017/12/20 2 Response (研究感興趣

在職進修: 歡迎報考慈大公衛所

2017/12/20 61

Page 62: Linear regression analysis 線性迴歸分析hlm.tzuchi.com.tw/epi-stat/images/class/2015/2015_class9.pdf · 迴歸分析(regression analysis) 2017/12/20 2 Response (研究感興趣

2017/12/20 62

References

1. M. Pagano, K. Gauvereau. Principles of Biostatistics(2nd Ed). Australia ; Pacific Grove, CA : Duxbury, 2000. (歐亞書局代理)

2. Rosner B. (2006) Fundamentals of Biostatistics (6th ed). Belmont, CA : Thomson-Brooks/Cole (歐亞代理)

3. Vittinghoff E., Glidden D.V., Shiboski S.C., McCulloch C.E. Regression Methods in Biostatistics. Spreinger 2005.

4. 史麗珠 (2005),進階應用生物統計學。學富文化,台北。

Page 63: Linear regression analysis 線性迴歸分析hlm.tzuchi.com.tw/epi-stat/images/class/2015/2015_class9.pdf · 迴歸分析(regression analysis) 2017/12/20 2 Response (研究感興趣

2017/12/20 63