Title of article :
A Systematic Approach for Malay Language Dialect Identification by Using CNN
Author/Authors :
sulaiman, mohd azman hanif universiti teknologi mara - school of electrical engineering, college of engineering, Shah Alam, Malaysia , abd aziz, nurhakimah universiti teknologi mara - school of electrical engineering, college of engineering, Shah Alam, Malaysia , zabidi, azlee university malaysia pahang (ump) - faculty of computing, Pahang, Malaysia , jantan, zuraidah university technology mara (uitm) - academy language of study (als), Shah Alam, Malaysia , mohd yassin, ihsan universiti teknologi mara - school of electrical engineering, college of engineering, Shah Alam, Malaysia , ali, megat syahirul amin megat universiti teknologi mara - school of electrical engineering, college of engineering, Shah Alam, Malaysia , eskandari, farzad alahmeh tabatabai university, tehran, iran
From page :
25
To page :
37
Abstract :
As Malaysia moves forward towards the Industrial Revolution (IR 4. 0), computer systems have become part of everyday life, leading to increased man-machine interactions. Verbal communication is a convenient means to interact with computers. Speech recognition systems need to be robust to cater for various languages and dialects in order to interact better with humans. Dialects within a spoken language present a challenge for computers require a speech recognition system to translate these verbal commands to computer understanding of the underlying meaning from spoken words. In this paper, works on Malay language dialect identification are presented using Convolution Neural Network (CNN) trained on Mel Frequency Cepstral Coefficient (MFCC) features. Data was collected from 12 native speakers. Each speaker was instructed to utter 10 carefully selected words to emphasize the dialect nuances of the eastern, northern and central (standard) Malay dialect. The MFCC features were then extracted from the recorded audio samples and converted to graphical form. The images were then used to train a custom CNN neural network to differentiate between the various spoken words and their dialects. Results demonstrate that CNN was able to effectively differentiate between the spoken words with excellent accuracy (between 85% and 100%).
Keywords :
Convolution Neural Network , Mel Frequency Cepstrum Coefficient , speech recognition , dialect recognition
Journal title :
journal of electrical and electronic systems research
Journal title :
journal of electrical and electronic systems research
Record number :
2705103
Link To Document :
بازگشت