Title : 
Speaker independent Sinhala speech recognition for voice dialling
         
        
            Author : 
Amarasingha, W.G.T.N. ; Gamini, D.D.A.
         
        
            Author_Institution : 
Dept. of Stat. & Comput. Sci., Univ. of Sri Jayewardenepura, Gangodawila, Sri Lanka
         
        
        
        
        
        
            Abstract : 
Speech is the most natural and the most powerful way of communication between humans. Speech recognition for voice dialling applications has already been developed for languages such as English, French and Japanese, etc. However, there is no evidence of existence of such applications for voice dialling in Sinhala language with speaker independent environment. The work described in this paper is based on an attempt to implement a speaker independent Sinhala speech recognizer and a voice dialling application which has been used to communicate with a VoIP application. The underlying concept of building the speech recognizer is Hidden Markov Model (HMM) and the system is developed using the Hidden Markov Model Toolkit (HTK). The first stages of building the speech recognizer involve the preparation of speech samples for training and the creation of the pronunciation dictionary which lists all the speech samples along with their phonetic representations. A noise reduction method has been applied at the front end of the voice dialling application to clean up the speech signal from the beginning. The middle stages comprise of employing a good feature extraction technique to enhance the speech recognition, and building and training the acoustic model to match a spoken digit to the observed input while the latter stages involve the creation of the language model to determine which digit has spoken. The results show that 87.37% of the digits are correctly recognized by the speech recognizer under quiet environment while 82.19% of the digits are correctly recognized in noisy environment.
         
        
            Keywords : 
Internet telephony; dictionaries; hidden Markov models; natural language processing; signal denoising; speech recognition; English; French; HMM; HTK; Japanese; VoIP application; acoustic model; feature extraction technique; hidden Markov model toolkit; noise reduction method; phonetic representations; pronunciation dictionary; speaker independent Sinhala speech recognition; speech recognizer; speech sample preparation; speech signal; voice dialling application; Acoustics; Databases; Feature extraction; Grammar; Laboratories; Speech recognition; Vectors; Acoustic model; Hidden Markov model; Language model; Noise reduction; Pronunciation dictionary; Speech recognition; Speech understanding;
         
        
        
        
            Conference_Titel : 
Advances in ICT for Emerging Regions (ICTer), 2012 International Conference on
         
        
            Conference_Location : 
Colombo
         
        
            Print_ISBN : 
978-1-4673-5529-2
         
        
        
            DOI : 
10.1109/ICTer.2012.6422064