مرکز منطقه ای اطلاع رساني علوم و فناوري - Speaker independent Sinhala speech recognition for voice dialling

DocumentCode :

3104739

Title :

Speaker independent Sinhala speech recognition for voice dialling

Author :

Amarasingha, W.G.T.N. ; Gamini, D.D.A.

Author_Institution :

Dept. of Stat. & Comput. Sci., Univ. of Sri Jayewardenepura, Gangodawila, Sri Lanka

fYear :

2012

fDate :

12-15 Dec. 2012

Firstpage :

Lastpage :

Abstract :

Speech is the most natural and the most powerful way of communication between humans. Speech recognition for voice dialling applications has already been developed for languages such as English, French and Japanese, etc. However, there is no evidence of existence of such applications for voice dialling in Sinhala language with speaker independent environment. The work described in this paper is based on an attempt to implement a speaker independent Sinhala speech recognizer and a voice dialling application which has been used to communicate with a VoIP application. The underlying concept of building the speech recognizer is Hidden Markov Model (HMM) and the system is developed using the Hidden Markov Model Toolkit (HTK). The first stages of building the speech recognizer involve the preparation of speech samples for training and the creation of the pronunciation dictionary which lists all the speech samples along with their phonetic representations. A noise reduction method has been applied at the front end of the voice dialling application to clean up the speech signal from the beginning. The middle stages comprise of employing a good feature extraction technique to enhance the speech recognition, and building and training the acoustic model to match a spoken digit to the observed input while the latter stages involve the creation of the language model to determine which digit has spoken. The results show that 87.37% of the digits are correctly recognized by the speech recognizer under quiet environment while 82.19% of the digits are correctly recognized in noisy environment.

Keywords :

Internet telephony; dictionaries; hidden Markov models; natural language processing; signal denoising; speech recognition; English; French; HMM; HTK; Japanese; VoIP application; acoustic model; feature extraction technique; hidden Markov model toolkit; noise reduction method; phonetic representations; pronunciation dictionary; speaker independent Sinhala speech recognition; speech recognizer; speech sample preparation; speech signal; voice dialling application; Acoustics; Databases; Feature extraction; Grammar; Laboratories; Speech recognition; Vectors; Acoustic model; Hidden Markov model; Language model; Noise reduction; Pronunciation dictionary; Speech recognition; Speech understanding;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Advances in ICT for Emerging Regions (ICTer), 2012 International Conference on

Conference_Location :

Colombo

Print_ISBN :

978-1-4673-5529-2

Type :

conf

DOI :

10.1109/ICTer.2012.6422064

Filename :

6422064

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3104739