• DocumentCode
    1940518
  • Title

    Malaysian English accents identification using LPC and formant analysis

  • Author

    Yusnita, M.A. ; Paulraj, M.P. ; Yaacob, Sazali ; Bakar, Shahriman Abu ; Saidatul, A.

  • Author_Institution
    Fac. of Electr. Eng., Univ. Teknol. MARA Malaysia, Shah Alam, Malaysia
  • fYear
    2011
  • fDate
    25-27 Nov. 2011
  • Firstpage
    472
  • Lastpage
    476
  • Abstract
    In Malaysia, most people speak several varieties of English known as Malaysian English (MalE) and there is no uniform version because of the existence of multi-ethnic population. It is a common scenario that Malaysians speak a particular local Malay, Chinese or Indian English accent. As most commercial speech recognizers have been developed using a standard English language, it is a challenging task for achieving highly efficient performance when other accented speech are presented to this system. Accent identification (AccID) can be one of the subsystem in speaker independent automatic speech recognition (SI-ASR) system so that deterioration issue in its performance can be tackled. In this paper, the most important speech features of three ethnic groups of MalE speakers are extracted using Linear Predictive Coding (LPC), formant and log energy feature vectors. In the subsequent stage, the accent identity of a speaker is predicted using K-Nearest Neighbors (KNN) classifier based on the extracted information. Prior, the preprocessing parameters and LPC order are investigated to properly extract the speech features. This study is conducted on a small set speech corpus developed as pilot study to determine the feasibility of automatic AccID of MalE speakers which has never been reported before. The experimental results indicate a highly promising recognition accuracy of 94.2% upon feature fusion sets of LPC, formants and log energy.
  • Keywords
    feature extraction; learning (artificial intelligence); linear predictive coding; pattern classification; sensor fusion; speech coding; speech recognition; AccID; LPC; MalE speakers; Malaysian English accents identification; SI-ASR; accent identification; accented speech; feature fusion sets; formant analysis; k-nearest neighbors classifier; linear predictive coding; log energy feature vectors; multiethnic population; speaker accent identity; speaker independent automatic speech recognition system; speech features extraction; speech recognizers; Conferences; Databases; Feature extraction; Hidden Markov models; Resonant frequency; Speech; Speech recognition; Accent Identification; Formant; K-Nearest Neighbor; Linear Predictive Coding; Malaysian English;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Control System, Computing and Engineering (ICCSCE), 2011 IEEE International Conference on
  • Conference_Location
    Penang
  • Print_ISBN
    978-1-4577-1640-9
  • Type

    conf

  • DOI
    10.1109/ICCSCE.2011.6190572
  • Filename
    6190572