Malaysian English accents identification using LPC and formant analysis

Author

Yusnita, M.A. ; Paulraj, M.P. ; Yaacob, Sazali ; Bakar, Shahriman Abu ; Saidatul, A.

Author_Institution

Fac. of Electr. Eng., Univ. Teknol. MARA Malaysia, Shah Alam, Malaysia

fYear

2011

fDate

25-27 Nov. 2011

Firstpage

472

Lastpage

476

Abstract

In Malaysia, most people speak several varieties of English known as Malaysian English (MalE) and there is no uniform version because of the existence of multi-ethnic population. It is a common scenario that Malaysians speak a particular local Malay, Chinese or Indian English accent. As most commercial speech recognizers have been developed using a standard English language, it is a challenging task for achieving highly efficient performance when other accented speech are presented to this system. Accent identification (AccID) can be one of the subsystem in speaker independent automatic speech recognition (SI-ASR) system so that deterioration issue in its performance can be tackled. In this paper, the most important speech features of three ethnic groups of MalE speakers are extracted using Linear Predictive Coding (LPC), formant and log energy feature vectors. In the subsequent stage, the accent identity of a speaker is predicted using K-Nearest Neighbors (KNN) classifier based on the extracted information. Prior, the preprocessing parameters and LPC order are investigated to properly extract the speech features. This study is conducted on a small set speech corpus developed as pilot study to determine the feasibility of automatic AccID of MalE speakers which has never been reported before. The experimental results indicate a highly promising recognition accuracy of 94.2% upon feature fusion sets of LPC, formants and log energy.

Keywords

feature extraction; learning (artificial intelligence); linear predictive coding; pattern classification; sensor fusion; speech coding; speech recognition; AccID; LPC; MalE speakers; Malaysian English accents identification; SI-ASR; accent identification; accented speech; feature fusion sets; formant analysis; k-nearest neighbors classifier; linear predictive coding; log energy feature vectors; multiethnic population; speaker accent identity; speaker independent automatic speech recognition system; speech features extraction; speech recognizers; Conferences; Databases; Feature extraction; Hidden Markov models; Resonant frequency; Speech; Speech recognition; Accent Identification; Formant; K-Nearest Neighbor; Linear Predictive Coding; Malaysian English;

fLanguage

English

Publisher

ieee

Conference_Titel

Control System, Computing and Engineering (ICCSCE), 2011 IEEE International Conference on

Conference_Location

Penang

Print_ISBN

978-1-4577-1640-9

Type

conf

DOI

10.1109/ICCSCE.2011.6190572

Filename

6190572