DocumentCode :
3423155
Title :
Target-oriented phone tokenizers for spoken language recognition
Author :
Tong, Rong ; Ma, Bin ; Li, Haizhou ; Chng, Eng Siong
Author_Institution :
Inst. for Infocomm Res., Singapore
fYear :
2008
fDate :
March 31 2008-April 4 2008
Firstpage :
4221
Lastpage :
4224
Abstract :
This paper presents a new strategy for designing the parallel phone recognizers for spoken language recognition. Given a collection of parallel phone recognizers, we select a subset of phones from each phone recognizer for each target language to construct a target-oriented phone tokenizer (TOPT). As a result, the collection of target-oriented phone tokenizers is more effective than the original parallel phone recognizers. This approach improves system performance significantly without requesting for additional transcribed training samples. We validate the effectiveness of the proposed strategy within the framework of the parallel phone recognizer followed by vector space modeling backend, or PPR-VSM. We achieve equal-error-rate of 2.21% and 3.65% on the 2003 and 2005 NIST LRE databases, respectively, for 30-second trials.
Keywords :
natural language processing; speech recognition; parallel phone recognizers; spoken language recognition; target-oriented phone tokenizer; vector space modeling backend; Classification algorithms; Data mining; Humans; Natural languages; Space technology; Speech recognition; Stacking; Support vector machine classification; Support vector machines; Target recognition; Spoken language recognition; parallel phone tokenizer; target-oriented phone tokenizer;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location :
Las Vegas, NV
ISSN :
1520-6149
Print_ISBN :
978-1-4244-1483-3
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2008.4518586
Filename :
4518586
Link To Document :
بازگشت