investigation on inter-speaker variability in the feature space
DESCRIPTION
Investigation on Inter-Speaker Variability in The Feature Space. Presenter : 陳彥達. Reference. R. Haeb-Umbach, “Investigation on Inter-Speaker Variability in The Feature Space”, ICASSP 99. Outline. Introduction A measure of inter-speaker variability Vocal tract normalization - PowerPoint PPT PresentationTRANSCRIPT
Investigation on Inter-Speaker Variability in The Feature Space
Presenter : 陳彥達
Reference
R. Haeb-Umbach, “Investigation on Inter-Speaker Variability in The Feature Space”, ICASSP 99.
Outline Introduction A measure of inter-speaker variability Vocal tract normalization Cepstral mean and variance normalization
Introduction Adaptation
Reduce mismatch by adapting feature vectors or model parameters to the target environment.
Introduction(2) Normalization
Compute feature or model parameters that are insensitive to undesired variations of the speech signal.
Introduction(3) Fisher discriminant analysis
An early assessment of a feature set without running recognition first
The ratio of feature variability due to different phonemes and due to different speakers
A measure of inter-speaker variability
Good feature vector space Close together when belonging to the same
phoneme class Separated from each other when belonging to
the different phoneme class
A measure of inter-speaker variability(2)
: cepstral feature vectors
: cepstral mean feature vector
: class mean vector
: total mean vector
)(tx
cphrtxcr
cr txN .,)(,
, )(1
CcN
mcphrrcr
cc ,,1;
1
}.|{,
C
c cphrr
C
ccccr mN
NNm
1 }.|{ 1,
11
A measure of inter-speaker variability(3)
: cepstral mean feature vector : class mean vector
: total mean vector
: between class covariance matrix
: within class covariance matrix
m
cr ,
cm
Tccr
C
c cphrrccrW mm
NS )()(
1,
1 }.|{,
Tc
C
cc
cB mmmm
N
NS )()(
1
A measure of inter-speaker variability(4)
Fisher variate analysis = the sum of the eigenvalues
of The radius of the scattering volume Higher
lower recognition error rate
)( 1BSStr
W
BW SS 1
)( 1BSStr
W
Vocal tract normalization Reduce inter-speaker variability by a speaker-
specific frequency warping Differences in vocal tract length are compensated
for by a linear warping factor
)110(7001
)( 2595 melfk
melHz fkf
Vocal tract normalization(2)
42 male + 42 female 42 male
Vocal tract normalization(3)
a normalization on a per sentence basis performs better than a normalization on a per speaker basis
Cepstral mean and variance normalization
: input cepstral feature : estimate of the mean of the input cepstral
feature : estimate of the standard deviation of the
input cepstral feature : the mean and variance normalized feature : number of features
Ddt
txtxty
d
ddd ,,1;
)(ˆ)()(
)(
)(tyd
)(txd)(txd
)(ˆ td
D
Cepstral mean and variance normalization(2)
42 male + 42 female 42 male