Title :
Tree-structured model selection and simulated-data adaptation for environmental and speaker robust speech recognition
Author :
Thatphithakkul, Nattanun ; Kruatrachue, Boontee ; Wutiwiwatchai, Chai ; Marukatat, Sanparith ; Boonpiam, Vataya
Author_Institution :
King Mongkut´´s Inst. of Technol., Bangkok
Abstract :
This paper proposes the use of tree-structured model selection and simulated-data in maximum likelihood linear regression (MLLR) adaptation for environment and speaker robust speech recognition. The objective of this work is to solve major problems in robust speech recognition system, namely unknown speaker and unknown environmental noise. The proposed solution is composed of two components. The first one is based on a tree-structured model for selecting a speaker-dependent model that best matches to the input speech. The second component uses simulated-data to adapt the selected acoustic model to fit with the unknown noise. The proposed technique can thus alleviate both problems simultaneously. Experimental results show that the proposed system achieves a higher recognition rate than the system using only the input speech in adaptation and the system using a multi-conditioned acoustic model.
Keywords :
maximum likelihood estimation; regression analysis; speech recognition; trees (mathematics); maximum likelihood linear regression adaptation; multi-conditioned acoustic model; robust speech recognition system; simulated-data adaptation; speaker robust speech recognition; speaker-dependent model; tree-structured model selection; Acoustic noise; Computational modeling; Computer simulation; Loudspeakers; Maximum likelihood linear regression; Noise robustness; Paper technology; Speech enhancement; Speech recognition; Working environment noise;
Conference_Titel :
Communications and Information Technologies, 2007. ISCIT '07. International Symposium on
Conference_Location :
Sydney,. NSW
Print_ISBN :
978-1-4244-0976-1
Electronic_ISBN :
978-1-4244-0977-8
DOI :
10.1109/ISCIT.2007.4392267