DocumentCode :
3441744
Title :
Speaker and text independent language identification using predictive error histogram vectors
Author :
Gu, Qian-Rong ; Shibata, Tudushi
Author_Institution :
Dept. of Electron. Eng., Univ. of Tokyo, Japan
Volume :
1
fYear :
2003
fDate :
6-10 April 2003
Abstract :
A predictive vector quantization (Gray, M., 1984; Jain, A.K. et al., 2000) based speaker and text independent language identification system is proposed, which uses the statistical distribution of predictive error vectors to recognize the language spoken by native speakers. According to Stan C. Kwasny et al. (see Proc. 5th Midwest Artificial Intelligence and Cognitive Science Soc. Conf., p.53-7, 1993), most high level features of speech, such as tone of voice, rhythm, style, pace, accent, etc., appear to be related to distributional patterns or statistical aggregates of speech waveforms. We further develop the method used by Qian-Rong Gu and Tadashi Shibata (6th World Multiconference on Systemics, Cybernetics and Informatics - SCI2002, 2002) to extract these statistical distributional patterns directly from raw speech waveforms and then use them to identify language. The system has been trained and. tested by speech from English and Japanese native speakers. A best identification ratio of 76.8% can be achieved by our system.
Keywords :
natural languages; speech processing; speech recognition; statistical analysis; vector quantisation; English native speakers; Japanese native speakers; predictive error histogram vectors; predictive error vectors; predictive vector quantization; speaker independent language identification; speech features; speech waveforms; statistical aggregates; statistical distribution; statistical distributional patterns; text independent language identification; Aggregates; Artificial intelligence; Cognitive science; Histograms; Natural languages; Rhythm; Speech; Statistical distributions; Text recognition; Vector quantization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
ISSN :
1520-6149
Print_ISBN :
0-7803-7663-3
Type :
conf
DOI :
10.1109/ICASSP.2003.1198710
Filename :
1198710
Link To Document :
بازگشت