DocumentCode :
3776164
Title :
Isolated Bangla word recognition and speaker detection by semantic modular time delay neural network (MTDNN)
Author :
Md. Yasin Ali Khan;S. M. Mostaq Hossain;Mohammed Moshiul Hoque
Author_Institution :
Department of Computer Science and Engineering, Chittagong University of Engineering and Technology, Chittagong-4349, Bangladesh
fYear :
2015
Firstpage :
560
Lastpage :
565
Abstract :
Speaker recognition is the identification of a person from characteristics of his/her voices and speech recognition concerns the recognizing of what is being said by the speaker. This paper presents a framework to recognize the isolated Bangla words and the corresponding speaker by proposing a semantic modular time delay neural network (MTDNN). Underlying acoustic fuzziness of human utterance and fluctuations of data due to environmental disturbance are managed by well-known Fuzzy C Means clustering technique. We have used MFCC features to recognize Bangla words and speaker detection. Experimental result with different individuals show that the proposed framework is functioning quite satisfactory with average accuracy of 82.66%.
Keywords :
"Speech recognition","Speech","Mel frequency cepstral coefficient","Biological neural networks","Delay effects","Training"
Publisher :
ieee
Conference_Titel :
Computer and Information Technology (ICCIT), 2015 18th International Conference on
Type :
conf
DOI :
10.1109/ICCITechn.2015.7488134
Filename :
7488134
Link To Document :
بازگشت