speech perception1 dynamics of speech perception of english speech sounds. cues for...

32
Speech Perception 1 Speech Perception Dynamics of Speech Perception of English speech sounds. Cues for manner-place-voicing • Suprasegmentals Context dependence Categorical perception Hearing Loss and speech perception

Upload: donna-medler

Post on 14-Dec-2015

226 views

Category:

Documents


4 download

TRANSCRIPT

Page 1: Speech Perception1 Dynamics of Speech Perception of English speech sounds. Cues for manner-place-voicing Suprasegmentals Context dependence Categorical

Speech Perception 1

Speech Perception

• Dynamics of Speech• Perception of English speech sounds.

• Cues for manner-place-voicing• Suprasegmentals• Context dependence• Categorical perception• Hearing Loss and speech perception

Page 2: Speech Perception1 Dynamics of Speech Perception of English speech sounds. Cues for manner-place-voicing Suprasegmentals Context dependence Categorical

• intensity• frequency• suprasegmentals

Dynamics

of

Speech

Page 3: Speech Perception1 Dynamics of Speech Perception of English speech sounds. Cues for manner-place-voicing Suprasegmentals Context dependence Categorical

• intensity– whisper - 20 dB HL– normal conversational speech - 50 to 60 dB HL– loud speech - 70 dB HL– shouting - 90 dB HL

Dynamics

of

Speech

Page 4: Speech Perception1 Dynamics of Speech Perception of English speech sounds. Cues for manner-place-voicing Suprasegmentals Context dependence Categorical

• frequency– about 250 to 8000 Hz– most important speech sounds - 500 through 6000 Hz

Dynamics

of

Speech

Page 5: Speech Perception1 Dynamics of Speech Perception of English speech sounds. Cues for manner-place-voicing Suprasegmentals Context dependence Categorical

Dynamics

of

Speech

Dynamics of Specific Speech Sounds

Class Intensity Frequency

Vowels High Low

Nasals Low Low

Stops Varies Mid-high

Fricatives Low High

F2 transitions Varies Mid-high

Page 6: Speech Perception1 Dynamics of Speech Perception of English speech sounds. Cues for manner-place-voicing Suprasegmentals Context dependence Categorical

Dynamics

of

Speech

Page 7: Speech Perception1 Dynamics of Speech Perception of English speech sounds. Cues for manner-place-voicing Suprasegmentals Context dependence Categorical

Speech Perception 7

Perception of English Speech Sounds

• Vowels• Diphthongs• Semivowels• Nasal consonants• Stop/plosives• Fricatives and affricates

Page 8: Speech Perception1 Dynamics of Speech Perception of English speech sounds. Cues for manner-place-voicing Suprasegmentals Context dependence Categorical

Speech Perception 8

Vowels• Most important cue is relationship between the F1 & F2 formant frequencies.

Page 9: Speech Perception1 Dynamics of Speech Perception of English speech sounds. Cues for manner-place-voicing Suprasegmentals Context dependence Categorical

Speech Perception 9

Vowels

• Other cues– For high-front vowels: F3 formant

– For low-back vowels: A strong F1 frequency

– Contextual cues

Page 10: Speech Perception1 Dynamics of Speech Perception of English speech sounds. Cues for manner-place-voicing Suprasegmentals Context dependence Categorical

Speech Perception 10

Vowels

Page 11: Speech Perception1 Dynamics of Speech Perception of English speech sounds. Cues for manner-place-voicing Suprasegmentals Context dependence Categorical

Speech Perception 11

Diphthongs

• Most important cue is gliding formants

Page 12: Speech Perception1 Dynamics of Speech Perception of English speech sounds. Cues for manner-place-voicing Suprasegmentals Context dependence Categorical

Speech Perception 12

Diphthongs

• Specifically …• Rate of frequency change. The faster the rate of change the easier it is to perceive

• Brief steady state formant pattern at the beginning and the end of the diphthong also improves perception.– I.e., tense monothongs are more difficult to perceive because they lack the long steady state formant.

Page 13: Speech Perception1 Dynamics of Speech Perception of English speech sounds. Cues for manner-place-voicing Suprasegmentals Context dependence Categorical

Speech Perception 13

Diphthongs

Page 14: Speech Perception1 Dynamics of Speech Perception of English speech sounds. Cues for manner-place-voicing Suprasegmentals Context dependence Categorical

Speech Perception 14

Semivowels

• Glides -----> /j/ and /w/• Liquids -----> /r/ and /l/• As with diphthongs, perception relies on vowel formant transitions.

• Most important transition is F2• For /r/ and /l/, the F3 formant is also important

• Semivowels differ from diphthongs since they have more rapid F2 transitions which makes them consonant like.

Page 15: Speech Perception1 Dynamics of Speech Perception of English speech sounds. Cues for manner-place-voicing Suprasegmentals Context dependence Categorical

Speech Perception 15

Semivowels (continued)

Page 16: Speech Perception1 Dynamics of Speech Perception of English speech sounds. Cues for manner-place-voicing Suprasegmentals Context dependence Categorical

Speech Perception 16

Nasals

• Most important decisions when perceiving nasals.– Is the speech segment nasal or non-nasal?– If it is a nasal, which nasal is it?

• Important Cues– Transition between nasal and adjacent vowel– Weakening of upper formants– Lengthening of nasal tract causes primary resonant frequency to drop from 500 to about 250 Hz.

– Lowering of intensity when compared to vowels.

Page 17: Speech Perception1 Dynamics of Speech Perception of English speech sounds. Cues for manner-place-voicing Suprasegmentals Context dependence Categorical

Speech Perception 17

Nasals (continued)

Page 18: Speech Perception1 Dynamics of Speech Perception of English speech sounds. Cues for manner-place-voicing Suprasegmentals Context dependence Categorical

Speech Perception 18

Nasals (continued)

• In perceiving specific nasals the most important cues are…

• Transition to and from adjacent vowels are similar to different stops…– /m/ same as /p-b/– /n/ same as /t-d/– “ng” same as /k-g/

Page 19: Speech Perception1 Dynamics of Speech Perception of English speech sounds. Cues for manner-place-voicing Suprasegmentals Context dependence Categorical

Speech Perception 19

Nasals (continued)

• Frequency cues in F2 transition from adjoining vowels– /m/ lowest in frequency and shortest in duration

– /n/ higher in frequency and a bit longer in duration

– /ng/ highest in frequency and shortest in duration.

Page 20: Speech Perception1 Dynamics of Speech Perception of English speech sounds. Cues for manner-place-voicing Suprasegmentals Context dependence Categorical

Speech Perception 20

Nasals (continued)

• Resonance and anti-resonant patterns are different for different nasals. This is important for discriminating between /m/ and /n/, but not important for perceiving /ng/

• Nasal murmur

Page 21: Speech Perception1 Dynamics of Speech Perception of English speech sounds. Cues for manner-place-voicing Suprasegmentals Context dependence Categorical

Speech Perception 21

Stops

• Manner-place of articulation-voicing are important cues.

Page 22: Speech Perception1 Dynamics of Speech Perception of English speech sounds. Cues for manner-place-voicing Suprasegmentals Context dependence Categorical

Speech Perception 22

Stops (manner)

• Cues for manner– Oral occlusion is heard either as a …

•Silence in the voiceless stop /p-t-k/•Brief attenuation of sound in voiced stops /b-d-g/.

– Stopped air is often released and heard as a plosive (i.e., and aperiodic sound source)

– Stops have much shorter transition to and from adjacent vowels than semi-vowels.

Page 23: Speech Perception1 Dynamics of Speech Perception of English speech sounds. Cues for manner-place-voicing Suprasegmentals Context dependence Categorical

Speech Perception 23

Stops (manner)

Page 24: Speech Perception1 Dynamics of Speech Perception of English speech sounds. Cues for manner-place-voicing Suprasegmentals Context dependence Categorical

Speech Perception 24

Stops (place of articulation)

• To review…• /p-b/ …. Bilabial• /t-d/ …. Alveolar• /k-g/ …. Velar (or palatal)

Page 25: Speech Perception1 Dynamics of Speech Perception of English speech sounds. Cues for manner-place-voicing Suprasegmentals Context dependence Categorical

Speech Perception 25

Stops (place of articulation)

• Cues for place include …• Frequency position of the burst in relation to F1 and F2 formants.

•When burst is above 3000 Hz, the sound is perceived as a /t/.

•When burst is below F2 formant the vowel was perceived as a /p/, with the exception of high-back vowels.

•When bust was just above F2 formant, phoneme perceived as a /k/

Page 26: Speech Perception1 Dynamics of Speech Perception of English speech sounds. Cues for manner-place-voicing Suprasegmentals Context dependence Categorical

Speech Perception 26

Stops (place of articulation)

Page 27: Speech Perception1 Dynamics of Speech Perception of English speech sounds. Cues for manner-place-voicing Suprasegmentals Context dependence Categorical

Speech Perception 27

Stops (place of articulation)

• F2 transition. Depending upon direction of transition, different stop plosives will be perceived. Refer to previous slide.

Page 28: Speech Perception1 Dynamics of Speech Perception of English speech sounds. Cues for manner-place-voicing Suprasegmentals Context dependence Categorical

Speech Perception 28

Stops (voicing)

• Voicing cues include• Low frequency voice bar.

Page 29: Speech Perception1 Dynamics of Speech Perception of English speech sounds. Cues for manner-place-voicing Suprasegmentals Context dependence Categorical

Speech Perception 29

Stops (voicing)

• Presence of aspiration• Onset of F1 Formant for a following vowel (F1 cutback)

Page 30: Speech Perception1 Dynamics of Speech Perception of English speech sounds. Cues for manner-place-voicing Suprasegmentals Context dependence Categorical

Speech Perception 30

Stops (voicing)

Page 31: Speech Perception1 Dynamics of Speech Perception of English speech sounds. Cues for manner-place-voicing Suprasegmentals Context dependence Categorical

Speech Perception 31

Stops (voicing)

• Voice onset time (VOT)– Longer the VOT prior to a vowel and after a stop… the more likely you will perceive it as voiceless.

• Vowel duration preceding stop.– Longer duration of vowel, the more likely it will be perceived as voiced.

Page 32: Speech Perception1 Dynamics of Speech Perception of English speech sounds. Cues for manner-place-voicing Suprasegmentals Context dependence Categorical

Speech Perception 32

Summary