مرکز منطقه ای اطلاع رساني علوم و فناوري - A comparative study of fMPE and RDLT approaches to LVCSR

DocumentCode :

3124780

Title :

A comparative study of fMPE and RDLT approaches to LVCSR

Author :

Jian Xu ; Zhi-Jie Yan ; Qiang Huo

Author_Institution :

Dept. of Autom., Univ. of Sci. & Technol. of China, Hefei, China

fYear :

2012

fDate :

5-8 Dec. 2012

Firstpage :

Lastpage :

Abstract :

This paper presents a comparative study of two discriminatively trained feature transform approaches, namely feature-space minimum phone error (fMPE) and region-dependent linear transform (RDLT), to large vocabulary continuous speech recognition (LVCSR). Experiments are performed on an LVCSR task of conversational telephone speech transcription using about 2,000 hours training data. Starting from a maximum likelihood (ML) trained GMM-HMM based baseline system, recognition accuracy and run-time efficiency of different variants of the above two methods are evaluated, and a specific RDLT approach is identified and recommended for deployment in LVCSR applications.

Keywords :

hidden Markov models; learning (artificial intelligence); maximum likelihood estimation; speech processing; speech recognition; transforms; Gaussian mixture models; LVCSR task; conversational telephone speech transcription; discriminatively trained feature transform approaches; fMPE; feature-space minimum phone error; hidden Markov models; large vocabulary continuous speech recognition; maximum likelihood trained GMM-HMM based baseline system; recognition accuracy; region-dependent linear transform; run-time efficiency; specific RDLT approach; Acoustics; Context; Speech; Speech recognition; Training; Transforms; Vectors; LVCSR; RDLT; discriminative training; fMPE; feature transform;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Chinese Spoken Language Processing (ISCSLP), 2012 8th International Symposium on

Conference_Location :

Kowloon

Print_ISBN :

978-1-4673-2506-6

Electronic_ISBN :

978-1-4673-2505-9

Type :

conf

DOI :

10.1109/ISCSLP.2012.6423511

Filename :

6423511

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3124780