Title :
Target-oriented phone tokenizers for spoken language recognition
Author :
Tong, Rong ; Ma, Bin ; Li, Haizhou ; Chng, Eng Siong
Author_Institution :
Inst. for Infocomm Res., Singapore
fDate :
March 31 2008-April 4 2008
Abstract :
This paper presents a new strategy for designing the parallel phone recognizers for spoken language recognition. Given a collection of parallel phone recognizers, we select a subset of phones from each phone recognizer for each target language to construct a target-oriented phone tokenizer (TOPT). As a result, the collection of target-oriented phone tokenizers is more effective than the original parallel phone recognizers. This approach improves system performance significantly without requesting for additional transcribed training samples. We validate the effectiveness of the proposed strategy within the framework of the parallel phone recognizer followed by vector space modeling backend, or PPR-VSM. We achieve equal-error-rate of 2.21% and 3.65% on the 2003 and 2005 NIST LRE databases, respectively, for 30-second trials.
Keywords :
natural language processing; speech recognition; parallel phone recognizers; spoken language recognition; target-oriented phone tokenizer; vector space modeling backend; Classification algorithms; Data mining; Humans; Natural languages; Space technology; Speech recognition; Stacking; Support vector machine classification; Support vector machines; Target recognition; Spoken language recognition; parallel phone tokenizer; target-oriented phone tokenizer;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location :
Las Vegas, NV
Print_ISBN :
978-1-4244-1483-3
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2008.4518586