Title :
Three steps of Neuron Network classification for EMG-based Thai tones speech recognition
Author :
Srisuwan, Niyawadee ; Phukpattaranont, Pornchai ; Limsakul, Chamnan
Author_Institution :
Dept. of Electr. Eng., Prince of Songkla Univ., Songkhla, Thailand
Abstract :
In order to overcome the problem existing in original speech recognition (e.g. noise interruption and private data loss), many researchers have investigated to deal with these problems. Electromyography (EMG) from the muscles producing speech was used to replace a voiced signal. Similarly, we aim to develop EMG speech recognition based on Thai language. Tone is the important characteristic of this language. Hence, Thai tone classification is the first work that was explored. This paper proposes the new technique that can classify five Thai tones for EMG-based Thai speech recognition. This method can overcome the limitation of our previous work that we can classify only two tones. EMG was captured from six positions of the strap muscles and facial muscles while a volunteer was uttering 21 Thai isolated words and five tones of each word (total 105 words). The 68 EMG features were calculated, and RES index was used to evaluate clustering capability of each feature. Top five features that have high value of RES index were selected. Neuron Network (NN) was used for tone classification. We found that Modify Mean Absolute Value 2nd type (MMAV2) is the best features. It yielded an accuracy rate of 56.2% for five Thai tones classification. However, it is not enough for our work. In order to improve the accuracy rate, the three steps of NN Classification was proposed. This technique is the series of three networks of NN classifier. Each network will classify different tones, and use distinct features. We obtained an accuracy rate of 80% for five Thai tones classification from this technique.
Keywords :
electromyography; feature extraction; medical signal processing; muscle; neurophysiology; signal classification; speech processing; speech recognition; EMG features; EMG-based Thai tone speech recognition; MMAV2; NN classification; RES index; Thai language; clustering capability; electromyography; facial muscles; modify mean absolute value 2nd type; muscle producing speech; neuron network classification; noise interruption; strap muscles; voiced signal; Accuracy; Artificial neural networks; Electromyography; Muscles; Neurons; Speech; Speech recognition; Electromyography; Myoelectric signal; Neuron Network; Speech recognition; Thai tone;
Conference_Titel :
Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology (ECTI-CON), 2013 10th International Conference on
Conference_Location :
Krabi
Print_ISBN :
978-1-4799-0546-1
DOI :
10.1109/ECTICon.2013.6559639