ml 94 1 chap2 supervised learning

Upload: -

Post on 20-Feb-2018

217 views

Category:

Documents


0 download

TRANSCRIPT

  • 7/24/2019 ML 94 1 Chap2 Supervised Learning

    1/32

    (01-805-11-13)

    1394

    http://faculties.sbu.ac.ir/~a_mahmoudi/

    Machine Learning

  • 7/24/2019 ML 94 1 Chap2 Supervised Learning

    2/32

    2

  • 7/24/2019 ML 94 1 Chap2 Supervised Learning

    3/32

    .

    .

    .

    .

    :

    x1: price, x2: engine power 3

    Class learning is f indin g a descr ipt ion that is shared by all pos i t ive

    examples and none of the negative examples.

    Prediction

    Knowledge extraction

  • 7/24/2019 ML 94 1 Chap2 Supervised Learning

    4/32

    4

    N

    t

    tt,r 1}{ xX

    negativeisif

    positiveisif

    x

    x

    0

    1r

    2

    1

    x

    xx

    Training set

  • 7/24/2019 ML 94 1 Chap2 Supervised Learning

    5/32

    Hypothesis class H

    5

    2121 powerengineANDprice eepp

    Hypothesis Class

    hH Hypothesis

    .

    .

    .

    .

  • 7/24/2019 ML 94 1 Chap2 Supervised Learning

    6/32

    negativeissaysif

    positiveissaysif)(

    x

    x

    x

    h

    hh

    0

    1

    6

    N

    t

    tt rhhE1

    1 x)|( XError of h on H

    empirical error (training error)

    Ch

    .

    h

    .

  • 7/24/2019 ML 94 1 Chap2 Supervised Learning

    7/32

    .

    .

    .

    7

    Generalization

  • 7/24/2019 ML 94 1 Chap2 Supervised Learning

    8/32

    Version space

    8

    most specific hypothesis, S

    most general hypothesis, G

    h H, between S and G is consistent

    and make up the version space(Mitchell, 1997)

  • 7/24/2019 ML 94 1 Chap2 Supervised Learning

    9/32

    .

    .

    9

  • 7/24/2019 ML 94 1 Chap2 Supervised Learning

    10/32

    .

    .

    C

    .

    10

    0( |X)E h

    doubt

    capacity

  • 7/24/2019 ML 94 1 Chap2 Supervised Learning

    11/32

    VC Dimension

    N

    2N

    )

    ).

    2N

    HN

    (

    )

    shatter .

    N

    .

    NVapnik-Chervonenkis (VC)Dimension VC(H ) = N

    .

    11

    hH

  • 7/24/2019 ML 94 1 Chap2 Supervised Learning

    12/32

    VC Dimension

    12

    3 points shattered 4 points impossible

  • 7/24/2019 ML 94 1 Chap2 Supervised Learning

    13/32

    .

    C

    PAC-learnable

    1

    .

    ,>0

    (N)

    1

    13

    Probably Approximately Correct (PAC) Learning

  • 7/24/2019 ML 94 1 Chap2 Supervised Learning

    14/32

    14

    Each strip is at most /4

    Pr that we miss a strip 1

    /4Pr that N instances miss a strip (1 /4)N

    Pr that N instances miss 4 strips is at most 4(1 /4)N

    4(1 /4)N and (1 x)exp( x)

    4exp( N/4) and N (4/)log(4/)

    P{Ch} 1-

    Probably Approximately Correct (PAC) Learning

  • 7/24/2019 ML 94 1 Chap2 Supervised Learning

    15/32

    .

    .

    .

    15

    Teacher noise

  • 7/24/2019 ML 94 1 Chap2 Supervised Learning

    16/32

    .

    16

  • 7/24/2019 ML 94 1 Chap2 Supervised Learning

    17/32

    .

    .

    .

    bias

    .

    .

    .

    .

    17

    Less var iance

  • 7/24/2019 ML 94 1 Chap2 Supervised Learning

    18/32

    Occam

    Occam William of Ockham

    .

    14

    Occam

    .

    .

    18

    Occams razor

  • 7/24/2019 ML 94 1 Chap2 Supervised Learning

    19/32

    (k)

    19

    N

    t

    tt,r 1}{ xX

    ,i f

    i f

    ijr

    j

    t

    i

    t

    t

    iC

    C

    x

    x

    0

    1

    ,i f

    i f

    ijh

    j

    t

    i

    t

    t

    iC

    C

    x

    x

    x

    0

    1

    Train hypotheses

    hi(x), i =1,...,K:

    r

    k

    K

    Reject(doubt)

  • 7/24/2019 ML 94 1 Chap2 Supervised Learning

    20/32

    .

    .

    20

  • 7/24/2019 ML 94 1 Chap2 Supervised Learning

    21/32

    :

    .

    (

    )

    .

    (g(x)).

    21

    tN

    t

    tt rrx ,1

    X

    tt xfr

    ttt zxfr ,*

    ()

  • 7/24/2019 ML 94 1 Chap2 Supervised Learning

    22/32

    (

    )...

    22

    :

    N

    t

    ttxgr

    NgE

    1

    21X|

    Pattern Recog nit ion andMachine Learning (Bisho p)

  • 7/24/2019 ML 94 1 Chap2 Supervised Learning

    23/32

    :g(x)

    :

    23

    N

    t

    tt

    wxwrNwwE 1

    2

    0101

    1

    X|,

    01

    02211 wxwwxwxwgd

    j

    jj

    x

    ()

  • 7/24/2019 ML 94 1 Chap2 Supervised Learning

    24/32

    ()...

    :

    24

    N

    t

    tt wxwrNw

    E

    1

    01

    0

    12

    N

    t

    ttt xwxwrNw

    E

    1

    01

    1

    2

    xwrw 10

    221 xNxNrxrx

    w

    t

    t

    t

    tt

    N

    xx tt

    N

    rr t

    t

    N

    t

    ttwxwr

    NwwE

    1

    2

    0101

    1X|,

  • 7/24/2019 ML 94 1 Chap2 Supervised Learning

    25/32

    :

    25

  • 7/24/2019 ML 94 1 Chap2 Supervised Learning

    26/32

    Pattern Recog nit ion andMachine Learning (Bisho p)26

    Over fitting

    Best fit

  • 7/24/2019 ML 94 1 Chap2 Supervised Learning

    27/32

    ill-posed

    .

    .

    .

    27

    M d l S l ti

  • 7/24/2019 ML 94 1 Chap2 Supervised Learning

    28/32

    .

    inductive bias .

    .

    .

    .

    28

    Model Selection

  • 7/24/2019 ML 94 1 Chap2 Supervised Learning

    29/32

    .

    underfitting .

    .

    validation

    (validation error) .

    overfitting

    .

    .

    29

    T d ff

  • 7/24/2019 ML 94 1 Chap2 Supervised Learning

    30/32

    Tradeoff

    tradeoff

    :

    Hc (H)

    30

    N

    E

    c (

    )

    f i rst E

    and then E

    Cross Val idat ion

  • 7/24/2019 ML 94 1 Chap2 Supervised Learning

    31/32

    (validation set)

    .

    validation

    .

    validation

    .

    31

    Cross-Val idat ion

    Test set (pub l icat ion set)

  • 7/24/2019 ML 94 1 Chap2 Supervised Learning

    32/32

    (cost or loss function)

    .

    .

    |xg

    t

    ttgrLE |,| xX

    X|minarg* E