Title :
Double bigram-decoding in phonotactic language identification
Author :
J. Navratil;W. Zuhlke
Author_Institution :
Dept. of Commun. & Meas., Tech. Univ. Ilmanau, Germany
Abstract :
In this paper a phonotactic language identification system that employs a multilingual phone-recognizer with multiple language-dependent grammars to tokenize the spoken signal into several phone-streams is described. For each stream an independent set of language models is used to compute the language scores that are subsequently processed by two classification stages. Thus, the system acquires information from both the original-label and the decoded-phone statistics. A discriminative weighting method is applied in the second stage for better distinguishing between similar languages. A modified language-bigram model, the so-called skip-gram, that allows exploiting of a wider phonotactic context without increasing the estimation costs of a standard bigram, is introduced. Measured on the NIST´95 evaluation set, the described system outperforms the state-of-the-art phonotactic components that use multiple recognizers, and is, at the same time, less computationally expensive.
Keywords :
"Decoding","Natural languages","Statistics","Signal processing","Context modeling","Costs","Time measurement","Performance evaluation","Acoustic testing","Automatic testing"
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1997. ICASSP-97., 1997 IEEE International Conference on
Print_ISBN :
0-8186-7919-0
DOI :
10.1109/ICASSP.1997.596137