Title : 
Out-of-vocabulary word recognition with a hierarchical doubly Markov language model
         
        
            Author : 
Kokubo, Hiroaki ; Yamamoto, Hiroshi ; Ogawa, Yoshihiko ; Sagisaka, Yoshinori ; Kikui, Genichiro
         
        
            Author_Institution : 
ATR Spoken Language Translation Res. Labs, Kyoto, Japan
         
        
        
            fDate : 
30 Nov.-3 Dec. 2003
         
        
        
        
            Abstract : 
We describe a novel language model for task-dependent out-of-vocabulary (OOV) words. OOV words, such as personal names and place names in a new task, can make language model adaptation difficult. To cope with this problem, we propose a hierarchical, 2-layered language model consisting of inter-word constraints and intra-word constraints. Stochastic properties of OOV words in the two constraints are represented by multi-class modeling and trained as independent Markov models. Occurrence probabilities of an OOV word are expressed by statistics of two Markov models (namely, doubly Markov model). The proposed model has been tested in a Japanese conversational speech database of appointment making. The correct word rate was improved by 7.5% from 78.2% to 86.7% when the new language model was used to recognize sentences with OOV words.
         
        
            Keywords : 
Markov processes; learning (artificial intelligence); natural languages; speech recognition; Japanese conversational speech database; hierarchical doubly Markov language model; independent Markov models; inter-word constraints; intra-word constraints; out-of-vocabulary word recognition; Adaptation model; Databases; Natural languages; Probability; Speech; Statistics; Stochastic processes; Testing; Training data; Vocabulary;
         
        
        
        
            Conference_Titel : 
Automatic Speech Recognition and Understanding, 2003. ASRU '03. 2003 IEEE Workshop on
         
        
            Print_ISBN : 
0-7803-7980-2
         
        
        
            DOI : 
10.1109/ASRU.2003.1318498