Title :
Exploiting complementary aspects of phonological features in automatic speech recognition
Author :
Momayyez, Parya ; Waterhouse, James ; Rose, Richard
Author_Institution :
McGill Univ., Montreal
Abstract :
This paper presents techniques for exploiting complementary information contained in multiple definitions of phonological feature systems. Three different feature systems, differing in their structure and in the acoustic phonetic features they represent, are considered. A two stage process involving a mechanism for frame level phonological feature detection and a mechanism for decoding phoneme sequences from features is implemented for each phonological feature system. Two methods are investigated for integrating these features with MFCC based ASR systems. First, phonological feature and MFCC based systems are combined in a lattice re-scoring paradigm. Second, confusion network based system combination (CNC) is used to combine phone networks derived from phonological distinctive feature (PDF) and MFCC based systems. It is shown, using both methods, that phone error rates can be reduced by as much as 15% relative to the phone error rates obtained for any individual feature stream.
Keywords :
acoustic signal processing; decoding; error statistics; feature extraction; sequences; speech coding; speech recognition; MFCC based automatic speech recognition system; acoustic phonetic feature detection; confusion network-based system combination; lattice re-scoring paradigm; phone error rate; phoneme sequence decoding; phonological feature detection system; Acoustic signal detection; Automatic speech recognition; Computer numerical control; Computer vision; Decoding; Detectors; Hidden Markov models; Lattices; Mel frequency cepstral coefficient; Speech recognition; Acoustic Modeling; Phonological Features; Speech Recognition;
Conference_Titel :
Automatic Speech Recognition & Understanding, 2007. ASRU. IEEE Workshop on
Conference_Location :
Kyoto
Print_ISBN :
978-1-4244-1746-9
Electronic_ISBN :
978-1-4244-1746-9
DOI :
10.1109/ASRU.2007.4430082