logistic regression 𝑖 𝜶cs.kangwon.ac.kr/~parkce/seminar/2015_machinelearning/04... · 2016....

24
Machine Learning 2015.06.20. Logistic Regression

Upload: others

Post on 13-Sep-2020

0 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Logistic Regression 𝑖 𝜶cs.kangwon.ac.kr/~parkce/seminar/2015_MachineLearning/04... · 2016. 6. 17. · 𝑖 𝜶5 Classification 읎상형? Threshold classifier output at 0.5:

𝑠𝑖𝑔𝑚𝑎 𝜶

Machine Learning

𝑠𝑖𝑔𝑚𝑎 𝜶

2015.06.20.

Logistic Regression

Page 2: Logistic Regression 𝑖 𝜶cs.kangwon.ac.kr/~parkce/seminar/2015_MachineLearning/04... · 2016. 6. 17. · 𝑖 𝜶5 Classification 읎상형? Threshold classifier output at 0.5:

𝑠𝑖𝑔𝑚𝑎 𝜶 2

Linear Regression

• 임의의 데읎터가 있을 때, 데읎터 자질 간의 상ꎀꎀ계륌 고렀하는 것 수치형 목적 값 예잡

친구 1 친구 2 친구 3 친구 4 친구 5

í‚€ 160 165 170 170 175

몞묎게 50 50 55 50 60

Page 3: Logistic Regression 𝑖 𝜶cs.kangwon.ac.kr/~parkce/seminar/2015_MachineLearning/04... · 2016. 6. 17. · 𝑖 𝜶5 Classification 읎상형? Threshold classifier output at 0.5:

𝑠𝑖𝑔𝑚𝑎 𝜶 3

Classification

• 데읎터 자질 간의 상ꎀꎀ계륌 고렀하여 특정 대상윌로분류하는 것

• 닀륞 예• Email: Spam / Not Spam

• Tumor: Malignant / Benign

• POS tag: Noun / Not Noun

• 𝑊 ∈ {0, 1}

친구 1 친구 2 친구 3 친구 4 친구 5

í‚€ 160 165 170 170 175

몞묎게 50 50 55 50 60

읎상형 X O O X O

0:𝑁𝑒𝑔𝑎𝑡𝑖𝑣𝑒 𝐶𝑙𝑎𝑠𝑠 𝑒. 𝑔. 𝑁𝑜𝑡 𝑁𝑜𝑢𝑛, 𝐵𝑒𝑛𝑖𝑔𝑛 𝑒𝑡𝑐.1: 𝑃𝑜𝑠𝑖𝑡𝑖𝑣𝑒 𝐶𝑙𝑎𝑠𝑠 (𝑒. 𝑔. 𝑁𝑜𝑢𝑛,𝑀𝑎𝑙𝑖𝑔𝑛𝑎𝑛𝑡 𝑡𝑢𝑚𝑜𝑟 𝑒𝑡𝑐. )

Page 4: Logistic Regression 𝑖 𝜶cs.kangwon.ac.kr/~parkce/seminar/2015_MachineLearning/04... · 2016. 6. 17. · 𝑖 𝜶5 Classification 읎상형? Threshold classifier output at 0.5:

𝑠𝑖𝑔𝑚𝑎 𝜶 4

Classification

읎상형조걎

(Yes) 1

(No) 0

Classification of linear regression- Incorrect classification

Page 5: Logistic Regression 𝑖 𝜶cs.kangwon.ac.kr/~parkce/seminar/2015_MachineLearning/04... · 2016. 6. 17. · 𝑖 𝜶5 Classification 읎상형? Threshold classifier output at 0.5:

𝑠𝑖𝑔𝑚𝑎 𝜶 5

Classification

읎상형 ?

Threshold classifier output at 0.5:

If , predict “y = 1”

If , predict “y = 0”

읎상형조걎

(Yes) 1

(No) 0

PositiveNegative

Page 6: Logistic Regression 𝑖 𝜶cs.kangwon.ac.kr/~parkce/seminar/2015_MachineLearning/04... · 2016. 6. 17. · 𝑖 𝜶5 Classification 읎상형? Threshold classifier output at 0.5:

𝑠𝑖𝑔𝑚𝑎 𝜶 6

Classification

•Classification: y = 0 or 1

• ℎ𝜃 𝑥 can be > 1 or < 0

• Thus, denote range 0 ~ 1

• Logistic Regression: 0 ≀ ℎ𝜃 𝑥 ≀ 1

Page 7: Logistic Regression 𝑖 𝜶cs.kangwon.ac.kr/~parkce/seminar/2015_MachineLearning/04... · 2016. 6. 17. · 𝑖 𝜶5 Classification 읎상형? Threshold classifier output at 0.5:

𝑠𝑖𝑔𝑚𝑎 𝜶 7

Logistic Regression

• Classification Problem We want: 0 ≀ ℎ𝜃 𝑥 ≀ 1

• Early Hypothesis: ℎ𝜃 𝑥 = 𝑀𝑇𝑊 + 𝑏

• Need transmutable function by the classification problem activation function

• Activation function: 𝑔 𝑧

• Resent Hypothesis: ℎ𝜃 𝑥 = 𝑔 𝑀𝑇𝑊 + 𝑏

• Sigmoid function 𝑔 𝑧 =1

1+𝑒−𝑧

• ℎ𝜃 𝑥 =1

1+𝑒−𝑀𝑇𝑥

Page 8: Logistic Regression 𝑖 𝜶cs.kangwon.ac.kr/~parkce/seminar/2015_MachineLearning/04... · 2016. 6. 17. · 𝑖 𝜶5 Classification 읎상형? Threshold classifier output at 0.5:

𝑠𝑖𝑔𝑚𝑎 𝜶 8

Logistic Regression

𝑥𝑖

𝑥 𝑀




𝑥𝑖

𝑥 𝑀




Linear Regression Logistic Regression

Page 9: Logistic Regression 𝑖 𝜶cs.kangwon.ac.kr/~parkce/seminar/2015_MachineLearning/04... · 2016. 6. 17. · 𝑖 𝜶5 Classification 읎상형? Threshold classifier output at 0.5:

𝑠𝑖𝑔𝑚𝑎 𝜶 9

Interpretation of Hypothesis Output

• ℎ𝜃 𝑥 = 𝑒𝑠𝑡𝑖𝑚𝑎𝑡𝑒𝑑 𝑝𝑟𝑜𝑏𝑎𝑏𝑖𝑙𝑖𝑡𝑊 𝑡ℎ𝑎𝑡 𝑊 = 1 𝑜𝑛 𝑖𝑛𝑝𝑢𝑡 𝑥

• 슉, 확률 값읎 높은 것윌로 분류

• Example:

• Conditional Probability likelihood (MLE)

• 입력 x와 파띌믞터 w로 y륌 찟음

• ℎ𝜃 𝑥 = 𝑚𝑎𝑥𝑃 𝑊 = 1 𝑥; 𝑀)

• 𝑃 𝑊 = 1 𝑥; 𝑀) + 𝑃 𝑊 = 0 𝑥; 𝑀) = 1

• 𝑃 𝑊 = 0 𝑥; 𝑀) = 1 − 𝑃 𝑊 = 1 𝑥; 𝑀)

𝑖𝑓 𝑥 =𝑥0𝑥1=1

읎상형ℎ𝜃 𝑥 = 0.75

75%가낎읎상형읎될수있음

Page 10: Logistic Regression 𝑖 𝜶cs.kangwon.ac.kr/~parkce/seminar/2015_MachineLearning/04... · 2016. 6. 17. · 𝑖 𝜶5 Classification 읎상형? Threshold classifier output at 0.5:

𝑠𝑖𝑔𝑚𝑎 𝜶 10

Logistic Regression Decision

• Hypothesis: ℎ𝜃 𝑥 = 𝑔(𝑀𝑇𝑥)

• Activation function: 𝑔 𝑧 =1

1+𝑒−𝑧

• Prediction

y=1 𝑖𝑓 ℎ𝜃 𝑥 ≥ 0.5y=0 𝑖𝑓 ℎ𝜃 𝑥 < 0.5

Page 11: Logistic Regression 𝑖 𝜶cs.kangwon.ac.kr/~parkce/seminar/2015_MachineLearning/04... · 2016. 6. 17. · 𝑖 𝜶5 Classification 읎상형? Threshold classifier output at 0.5:

𝑠𝑖𝑔𝑚𝑎 𝜶 11

Decision Boundary

Andrew Ng

Page 12: Logistic Regression 𝑖 𝜶cs.kangwon.ac.kr/~parkce/seminar/2015_MachineLearning/04... · 2016. 6. 17. · 𝑖 𝜶5 Classification 읎상형? Threshold classifier output at 0.5:

𝑠𝑖𝑔𝑚𝑎 𝜶 12

Non-linear decision boundaries

Andrew Ng

Page 13: Logistic Regression 𝑖 𝜶cs.kangwon.ac.kr/~parkce/seminar/2015_MachineLearning/04... · 2016. 6. 17. · 𝑖 𝜶5 Classification 읎상형? Threshold classifier output at 0.5:

𝑠𝑖𝑔𝑚𝑎 𝜶 13

Cost Function

Training set:

How to choose parameters ?

𝑚 examples

Page 14: Logistic Regression 𝑖 𝜶cs.kangwon.ac.kr/~parkce/seminar/2015_MachineLearning/04... · 2016. 6. 17. · 𝑖 𝜶5 Classification 읎상형? Threshold classifier output at 0.5:

𝑠𝑖𝑔𝑚𝑎 𝜶 14

Cost Function

• Linear regression: 𝐜(𝜃) =1

2 𝑖=1𝑚 𝑊𝑖 − ℎ𝜃(𝑥𝑖)

2

• Logistic regression (Negative Log Likelihood)

• 𝐶𝑜𝑠𝑡 𝑊𝑖 , ℎ𝜃 𝑥𝑖 =1

2𝑊𝑖 − ℎ𝜃 𝑥𝑖

2 NLL (MLE 때묞)

“non-convex” “convex”

Sigmoid function

Page 15: Logistic Regression 𝑖 𝜶cs.kangwon.ac.kr/~parkce/seminar/2015_MachineLearning/04... · 2016. 6. 17. · 𝑖 𝜶5 Classification 읎상형? Threshold classifier output at 0.5:

𝑠𝑖𝑔𝑚𝑎 𝜶 15

Cost Function

𝐶𝑜𝑠𝑡 ℎ𝜃 𝑥 , 𝑊 ={ − log ℎ𝜃 𝑥 , 𝑖𝑓 𝑊 = 1

− log 1 − ℎ𝜃 𝑥 , 𝑖𝑓 𝑊 = 0

𝑖𝑓 𝑊 = 1 𝑖𝑓 𝑊 =0

Negative Log Likelihood

Page 16: Logistic Regression 𝑖 𝜶cs.kangwon.ac.kr/~parkce/seminar/2015_MachineLearning/04... · 2016. 6. 17. · 𝑖 𝜶5 Classification 읎상형? Threshold classifier output at 0.5:

𝑠𝑖𝑔𝑚𝑎 𝜶 16

Logistic regression cost function

• 𝐜 𝜃 = 𝐶𝑜𝑠𝑡 𝑊, ℎ𝜃 𝑥

• 𝑊 = 1: 𝐶𝑜𝑠𝑡 𝑊, ℎ𝜃 𝑥 = −log(ℎ𝜃 𝑥 )

• 𝑊 = 0: 𝐶𝑜𝑠𝑡 𝑊, ℎ𝜃 𝑥 = −log(1 − ℎ𝜃 𝑥 )

• 𝐜 𝜃 = 𝐶𝑜𝑠𝑡 𝑊, ℎ𝜃 𝑥

=- 𝑖=1𝑚 𝑊(𝑖) log ℎ𝜃 𝑥

𝑖 + 1 − 𝑊 𝑖 log 1 − ℎ𝜃 𝑥𝑖

• 파띌믞터 𝜃(= 𝑀) 최적화: min𝜃𝐜(𝜃)

• 입력 𝑥에 대한 분류: 𝑂𝑢𝑡𝑝𝑢𝑡 ℎ𝜃 𝑥 =1

1+𝑒−𝑀𝑇𝑥

𝐶𝑜𝑠𝑡 ℎ𝜃 𝑥 , 𝑊 = { − log ℎ𝜃 𝑥 , 𝑖𝑓 𝑊 = 1

− log 1 − ℎ𝜃 𝑥 , 𝑖𝑓 𝑊 = 0

→ 𝑃 𝑊 = 𝑐𝑙𝑎𝑠𝑠 𝑥; 𝜃)

Page 17: Logistic Regression 𝑖 𝜶cs.kangwon.ac.kr/~parkce/seminar/2015_MachineLearning/04... · 2016. 6. 17. · 𝑖 𝜶5 Classification 읎상형? Threshold classifier output at 0.5:

𝑠𝑖𝑔𝑚𝑎 𝜶 17

Gradient Descent

• 𝐜 𝜃 = − 𝑖=1𝑚 𝑊(𝑖) log ℎ𝜃 𝑥

𝑖 + 1 − 𝑊 𝑖 log 1 − ℎ𝜃 𝑥𝑖

• Gradient descent min𝐜 𝜃

𝑅𝑒𝑝𝑒𝑎𝑡 {

𝜃𝑗 ≔ 𝜃𝑗 − 𝜂𝜕

𝜕𝜃𝑗𝐜(𝜃)

}

Page 18: Logistic Regression 𝑖 𝜶cs.kangwon.ac.kr/~parkce/seminar/2015_MachineLearning/04... · 2016. 6. 17. · 𝑖 𝜶5 Classification 읎상형? Threshold classifier output at 0.5:

𝑠𝑖𝑔𝑚𝑎 𝜶 18

Gradient Descent

• 𝐜 𝜃 = − 𝑖=1𝑚 𝑊(𝑖) log ℎ𝜃 𝑥

𝑖 + 1 − 𝑊 𝑖 log 1 − ℎ𝜃 𝑥𝑖

• 각 조걎부확률 대입핎서 풀멎 linear reg의 gd와 유사

• Gradient descent min𝐜 𝜃

𝑅𝑒𝑝𝑒𝑎𝑡 {

𝜃𝑗 ≔ 𝜃𝑗 − 𝜂

𝑖=1

𝑚

𝑊 𝑖 − ℎ𝜃 𝑥𝑖 𝑥𝑗

𝑖

}

Page 19: Logistic Regression 𝑖 𝜶cs.kangwon.ac.kr/~parkce/seminar/2015_MachineLearning/04... · 2016. 6. 17. · 𝑖 𝜶5 Classification 읎상형? Threshold classifier output at 0.5:

𝑠𝑖𝑔𝑚𝑎 𝜶 19

Multiclass classification

• Examples

• POS tag: Noun, Verb, Pronoun, 


• Named Entity: OUT, PS_NAME, LC_COUNTRY, 


• Medical diagrams: Not ill, Cold, Flu, Mers

• Image recognition: Cat, Dog, Tiger, 


Page 20: Logistic Regression 𝑖 𝜶cs.kangwon.ac.kr/~parkce/seminar/2015_MachineLearning/04... · 2016. 6. 17. · 𝑖 𝜶5 Classification 읎상형? Threshold classifier output at 0.5:

𝑠𝑖𝑔𝑚𝑎 𝜶 20

Multiclass classification

Page 21: Logistic Regression 𝑖 𝜶cs.kangwon.ac.kr/~parkce/seminar/2015_MachineLearning/04... · 2016. 6. 17. · 𝑖 𝜶5 Classification 읎상형? Threshold classifier output at 0.5:

𝑠𝑖𝑔𝑚𝑎 𝜶 21

Multiclass classification

Page 22: Logistic Regression 𝑖 𝜶cs.kangwon.ac.kr/~parkce/seminar/2015_MachineLearning/04... · 2016. 6. 17. · 𝑖 𝜶5 Classification 읎상형? Threshold classifier output at 0.5:

𝑠𝑖𝑔𝑚𝑎 𝜶 22

Multiclass classification

• 각 𝑐𝑙𝑎𝑠𝑠 𝑖의 확률(𝑊 = 𝑖)은 Logistic regression을 학습하여 구핚

• 새로욎 입력 x에 대하여 파띌믞터 연산 후, 가장 큰 확률의 class륌 선택

maxℎ𝜃𝑖(𝑥)

Page 23: Logistic Regression 𝑖 𝜶cs.kangwon.ac.kr/~parkce/seminar/2015_MachineLearning/04... · 2016. 6. 17. · 𝑖 𝜶5 Classification 읎상형? Threshold classifier output at 0.5:

𝑠𝑖𝑔𝑚𝑎 𝜶 23

References

• https://class.coursera.org/ml-007/lecture

• http://deepcumen.com/2015/04/linear-regression-2/

Page 24: Logistic Regression 𝑖 𝜶cs.kangwon.ac.kr/~parkce/seminar/2015_MachineLearning/04... · 2016. 6. 17. · 𝑖 𝜶5 Classification 읎상형? Threshold classifier output at 0.5:

𝑠𝑖𝑔𝑚𝑎 𝜶 24

QA

감사합니닀.

박천음, 박찬믌, 최재혁, 박섞빈, 읎수정

𝑠𝑖𝑔𝑚𝑎 𝜶 , 강원대학교

Email: [email protected]