Title :
A comparative study of fMPE and RDLT approaches to LVCSR
Author :
Jian Xu ; Zhi-Jie Yan ; Qiang Huo
Author_Institution :
Dept. of Autom., Univ. of Sci. & Technol. of China, Hefei, China
Abstract :
This paper presents a comparative study of two discriminatively trained feature transform approaches, namely feature-space minimum phone error (fMPE) and region-dependent linear transform (RDLT), to large vocabulary continuous speech recognition (LVCSR). Experiments are performed on an LVCSR task of conversational telephone speech transcription using about 2,000 hours training data. Starting from a maximum likelihood (ML) trained GMM-HMM based baseline system, recognition accuracy and run-time efficiency of different variants of the above two methods are evaluated, and a specific RDLT approach is identified and recommended for deployment in LVCSR applications.
Keywords :
hidden Markov models; learning (artificial intelligence); maximum likelihood estimation; speech processing; speech recognition; transforms; Gaussian mixture models; LVCSR task; conversational telephone speech transcription; discriminatively trained feature transform approaches; fMPE; feature-space minimum phone error; hidden Markov models; large vocabulary continuous speech recognition; maximum likelihood trained GMM-HMM based baseline system; recognition accuracy; region-dependent linear transform; run-time efficiency; specific RDLT approach; Acoustics; Context; Speech; Speech recognition; Training; Transforms; Vectors; LVCSR; RDLT; discriminative training; fMPE; feature transform;
Conference_Titel :
Chinese Spoken Language Processing (ISCSLP), 2012 8th International Symposium on
Conference_Location :
Kowloon
Print_ISBN :
978-1-4673-2506-6
Electronic_ISBN :
978-1-4673-2505-9
DOI :
10.1109/ISCSLP.2012.6423511