DocumentCode :
3327192
Title :
Language identification using parallel syllable-like unit recognition
Author :
Nagarajan, T. ; Murthy, Hema A.
Author_Institution :
Dept. of Comput. Sci. & Eng., Indian Inst. of Technol., Madras, India
Volume :
1
fYear :
2004
fDate :
17-21 May 2004
Abstract :
Automatic spoken language identification (LID) is the task of identifying the language from a short utterance of the speech signal. The most successful approach to LID uses phone recognizers of several languages in parallel. The basic requirement to build a parallel phone recognition (PPR) system is annotated corpora. A novel approach is proposed for the LID task which uses parallel syllable-like unit recognizers, in a framework similar to the PPR approach in the literature. The difference is that unsupervised syllable models are built from the training data. The data is first segmented into syllable-like units. The syllable segments are then clustered using an incremental approach. This results in a set of syllable models for each language. Our initial results on the OGI MLTS corpora show that the performance is 69.5%. We further show that if only a subset of syllable models that are unique (in some sense), are considered, the performance improves to 75.9%.
Keywords :
learning (artificial intelligence); natural languages; parallel processing; speech recognition; LID; annotated corpora; automatic language identification; parallel phone recognition system; parallel syllable-like unit recognition; speech signal; spoken language identification; training data; unsupervised syllable models; Automatic speech recognition; Computer science; Frequency; Humans; Information resources; Natural languages; Performance analysis; Signal processing; Testing; Training data;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
ISSN :
1520-6149
Print_ISBN :
0-7803-8484-9
Type :
conf
DOI :
10.1109/ICASSP.2004.1326007
Filename :
1326007
Link To Document :
بازگشت