Title :
Malaysian English accents identification using LPC and formant analysis
Author :
Yusnita, M.A. ; Paulraj, M.P. ; Yaacob, Sazali ; Bakar, Shahriman Abu ; Saidatul, A.
Author_Institution :
Fac. of Electr. Eng., Univ. Teknol. MARA Malaysia, Shah Alam, Malaysia
Abstract :
In Malaysia, most people speak several varieties of English known as Malaysian English (MalE) and there is no uniform version because of the existence of multi-ethnic population. It is a common scenario that Malaysians speak a particular local Malay, Chinese or Indian English accent. As most commercial speech recognizers have been developed using a standard English language, it is a challenging task for achieving highly efficient performance when other accented speech are presented to this system. Accent identification (AccID) can be one of the subsystem in speaker independent automatic speech recognition (SI-ASR) system so that deterioration issue in its performance can be tackled. In this paper, the most important speech features of three ethnic groups of MalE speakers are extracted using Linear Predictive Coding (LPC), formant and log energy feature vectors. In the subsequent stage, the accent identity of a speaker is predicted using K-Nearest Neighbors (KNN) classifier based on the extracted information. Prior, the preprocessing parameters and LPC order are investigated to properly extract the speech features. This study is conducted on a small set speech corpus developed as pilot study to determine the feasibility of automatic AccID of MalE speakers which has never been reported before. The experimental results indicate a highly promising recognition accuracy of 94.2% upon feature fusion sets of LPC, formants and log energy.
Keywords :
feature extraction; learning (artificial intelligence); linear predictive coding; pattern classification; sensor fusion; speech coding; speech recognition; AccID; LPC; MalE speakers; Malaysian English accents identification; SI-ASR; accent identification; accented speech; feature fusion sets; formant analysis; k-nearest neighbors classifier; linear predictive coding; log energy feature vectors; multiethnic population; speaker accent identity; speaker independent automatic speech recognition system; speech features extraction; speech recognizers; Conferences; Databases; Feature extraction; Hidden Markov models; Resonant frequency; Speech; Speech recognition; Accent Identification; Formant; K-Nearest Neighbor; Linear Predictive Coding; Malaysian English;
Conference_Titel :
Control System, Computing and Engineering (ICCSCE), 2011 IEEE International Conference on
Conference_Location :
Penang
Print_ISBN :
978-1-4577-1640-9
DOI :
10.1109/ICCSCE.2011.6190572