DocumentCode :
3424440
Title :
Directional dependency of cepstrum on vocal tract length
Author :
Saito, Daisuke ; Matsuura, Ryo ; Asakawa, Satoshi ; Minematsu, Nobuaki ; Hirose, Keikichi
Author_Institution :
Grad. Sch. of Frontier Sci., Univ. of Tokyo, Tokyo
fYear :
2008
fDate :
March 31 2008-April 4 2008
Firstpage :
4485
Lastpage :
4488
Abstract :
IN this paper, we prove that the direction of cepstrum vectors strongly depends on vocal tract length and that this dependency is represented as rotation in the n dimensional cepstrum space. In speech recognition studies, vocal tract length normalization (VTLN) techniques are widely used to cancel age- and gender-differences. In VTLN, a frequency warping is often carried out and it can be implemented as a linear transformation in a cepstrum space; c = Ac. However, the geometric properties of this transformation matrix A have not been well discussed. In this study, its properties are made clear using n dimensional geometry and it is shown that the matrix rotates any cepstrum vector similarly and apparently. Experimental results using resynthesized speech demonstrate that cepstrum vectors extracted from a speaker of 180 [cm] in height and those from another speaker of 120 [cm] in height are reasonably orthogonal. This result makes clear one of the reasons why children´s speech is very difficult for conventional speech recognizers to deal with adequately.
Keywords :
matrix algebra; speech recognition; cepstrum; dimensional geometry; directional dependency; linear transformation; speech recognition; transformation matrix; vocal tract length normalization; Acoustics; Cepstrum; Frequency; Geometry; Information science; Loudspeakers; Microphones; Space technology; Speech recognition; Vectors; cepstrum; frequency warping; rotation; rotation matrix; vocal tract length;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location :
Las Vegas, NV
ISSN :
1520-6149
Print_ISBN :
978-1-4244-1483-3
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2008.4518652
Filename :
4518652
Link To Document :
بازگشت