Title :
Isolated Bangla word recognition and speaker detection by semantic modular time delay neural network (MTDNN)
Author :
Md. Yasin Ali Khan;S. M. Mostaq Hossain;Mohammed Moshiul Hoque
Author_Institution :
Department of Computer Science and Engineering, Chittagong University of Engineering and Technology, Chittagong-4349, Bangladesh
Abstract :
Speaker recognition is the identification of a person from characteristics of his/her voices and speech recognition concerns the recognizing of what is being said by the speaker. This paper presents a framework to recognize the isolated Bangla words and the corresponding speaker by proposing a semantic modular time delay neural network (MTDNN). Underlying acoustic fuzziness of human utterance and fluctuations of data due to environmental disturbance are managed by well-known Fuzzy C Means clustering technique. We have used MFCC features to recognize Bangla words and speaker detection. Experimental result with different individuals show that the proposed framework is functioning quite satisfactory with average accuracy of 82.66%.
Keywords :
"Speech recognition","Speech","Mel frequency cepstral coefficient","Biological neural networks","Delay effects","Training"
Conference_Titel :
Computer and Information Technology (ICCIT), 2015 18th International Conference on
DOI :
10.1109/ICCITechn.2015.7488134