DocumentCode
2423765
Title
Speaker normalization using dynamic frequency warping
Author
Huang, Zhenhua ; Hou, Limin
Author_Institution
Sch. of Commun. & Inf. Eng., Shanghai Univ., Shanghai
fYear
2008
fDate
7-9 July 2008
Firstpage
1091
Lastpage
1095
Abstract
In an effort to reduce the degradation in a gender-independence isolated word recognition performance caused by variation character among different speaker, a dynamic frequency warping approach to speaker normalization is investigated. There are a lot of discrepancy in frequency domain which caused by vocal tract length difference among different speakers. Dynamic frequency warping (DFW) is an exact analog of dynamic time warping (DTW) which is used to reduce the discrepancy frequency scale of speech and normalize the frequency accurately. In this paper, the DFW method is to be introduced to normalize the frequency scale of speech and then applied it to a gender-independence isolated word recognition system. The results of experiments show a large improvement in average word error rate.
Keywords
speaker recognition; speech synthesis; average word error rate; dynamic frequency warping; dynamic time warping; gender-independence isolated word recognition performance; speaker normalization; vocal tract length difference; Character recognition; Degradation; Error analysis; Frequency domain analysis; Frequency estimation; Loudspeakers; Piecewise linear techniques; Speech analysis; Speech recognition; Testing;
fLanguage
English
Publisher
ieee
Conference_Titel
Audio, Language and Image Processing, 2008. ICALIP 2008. International Conference on
Conference_Location
Shanghai
Print_ISBN
978-1-4244-1723-0
Electronic_ISBN
978-1-4244-1724-7
Type
conf
DOI
10.1109/ICALIP.2008.4590058
Filename
4590058
Link To Document