Transcript
  • (Mel-frequency cepstral coefficients)

  • :X(n) (n) E(n)

  • :

  • ce(n)n c(n)n

  • (Masking Effect)

    (Frequency Masking)(Temporal Masking)

  • (narrowband sound stimulus)(Critical Band)

    24

  • 105010021001502003200250300430035040054004505106510570630763070077087708409209920100010801010801170127011127013701480121480160017201317201850200014200021502320

  • 1523202500270016270029003150173150340037001837004000440019440048005300205300580064002164007000770022770085009500239500105001200024120001350015500

  • (Mel Scale):

  • 22050Hz:11025Hz 3176.32Hz3176.32/(4+1)=635.264

    0529.7011461.253097.585972.8311025

    0635.2641270.5281905.7922541.0563176.32

  • 10529.7011461.252529.7011461.253097.5831461.253097.585972.8343097.585972.8311025

    0529.7011461.253097.585972.8311025

  • 22050Hz400500Hz ?

    =/: 22050/400=55.125Hz500/55.12510500Hz10

  • Filtering

    J j

  • DCT:

    LMFCC

  • (Delta Cepstrum Coefficients)

    m = 1,2,,L

  • M1:Cm(t)Cm(t+)Cm(t-)

    D1

    D2

    D4

    D3

    D5

    D6

    Frame1

    Frame2

    Frame3

    Frame4

    Frame5

  • **


Top Related