DocumentCode :
2176301
Title :
Investigation of acoustic units for LVCSR systems
Author :
Liu, X. ; Gales, M.J.F. ; Hieronymus, J.L. ; Woodland, P.C.
Author_Institution :
Eng. Dept., Cambridge Univ., Cambridge, UK
fYear :
2011
fDate :
22-27 May 2011
Firstpage :
4872
Lastpage :
4875
Abstract :
One important issue in designing state-of-the-art LVCSR systems is the choice of acoustic units. Context dependent (CD) phones remain die dominant form of acoustic units. They can capture the co-articulatory effect in speech via explicit modelling. However, for other more complicated phonological processes, they rely on the implicit modelling ability of the underlying statistical models. Alternatively, it is possible to construct acoustic models based on higher level linguistic units, for example, syllables, to explicitly capture these complex patterns. When sufficient training data is available, this approach may show an advantage over implicit acoustic modelling. In this paper a wide range of acoustic units are investigated to improve LVCSR system performance. Significant error rate gains up to 7.1% relative (0.8% abs.) were obtained on a state-of-the-art Mandarin Chinese broadcast audio recognition task using word and syllable position dependent triphone and quinphone models.
Keywords :
speech recognition; CD phones; LVCSR systems; Mandarin Chinese broadcast audio recognition task; acoustic unit investigation; complicated phonological processes; context dependent phones; explicit modelling; quinphone models; syllable position dependent triphone model; Acoustics; Context; Decision trees; Hidden Markov models; Speech; Speech recognition; Training;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location :
Prague
ISSN :
1520-6149
Print_ISBN :
978-1-4577-0538-0
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2011.5947447
Filename :
5947447
Link To Document :
بازگشت