DocumentCode :
2423765
Title :
Speaker normalization using dynamic frequency warping
Author :
Huang, Zhenhua ; Hou, Limin
Author_Institution :
Sch. of Commun. & Inf. Eng., Shanghai Univ., Shanghai
fYear :
2008
fDate :
7-9 July 2008
Firstpage :
1091
Lastpage :
1095
Abstract :
In an effort to reduce the degradation in a gender-independence isolated word recognition performance caused by variation character among different speaker, a dynamic frequency warping approach to speaker normalization is investigated. There are a lot of discrepancy in frequency domain which caused by vocal tract length difference among different speakers. Dynamic frequency warping (DFW) is an exact analog of dynamic time warping (DTW) which is used to reduce the discrepancy frequency scale of speech and normalize the frequency accurately. In this paper, the DFW method is to be introduced to normalize the frequency scale of speech and then applied it to a gender-independence isolated word recognition system. The results of experiments show a large improvement in average word error rate.
Keywords :
speaker recognition; speech synthesis; average word error rate; dynamic frequency warping; dynamic time warping; gender-independence isolated word recognition performance; speaker normalization; vocal tract length difference; Character recognition; Degradation; Error analysis; Frequency domain analysis; Frequency estimation; Loudspeakers; Piecewise linear techniques; Speech analysis; Speech recognition; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Audio, Language and Image Processing, 2008. ICALIP 2008. International Conference on
Conference_Location :
Shanghai
Print_ISBN :
978-1-4244-1723-0
Electronic_ISBN :
978-1-4244-1724-7
Type :
conf
DOI :
10.1109/ICALIP.2008.4590058
Filename :
4590058
Link To Document :
بازگشت