DocumentCode :
2386972
Title :
Evaluating endpoint detection algorithms for isolated word from Malay parliamentary speech
Author :
Seman, Noraini ; Bakar, Z.A. ; Bakar, N.A. ; Mohamed, Haslizatul Fairuz ; Abdullah, Nur Atiqah Sia ; Ramakrisnan, Prasanna ; Ahmad, Sharifah Mumtazah Syed
Author_Institution :
Dept. of Comput. Sci., Univ. Teknol. MARA (UiTM), Shah Alam, Malaysia
fYear :
2010
fDate :
17-18 March 2010
Firstpage :
291
Lastpage :
296
Abstract :
This paper presents the endpoint detection approaches specifically for an isolated word uses Malay spoken speeches from Malaysian Parliamentary session. Currently, there are 7,995 vocabularies of utterances in the database collection and for the purpose of this study; the vocabulary is limited to ten words which are most frequently spoken selected from ten speakers. Endpoint detection, which aims to distinguish the speech and non-speech segments of digital speech signal, is considered as one of the key preprocessing steps in speech recognition system. Proper estimation of the start and end of the speech (versus silence or background noise) avoids the waste of speech recognition evaluations on preceding or ensuing silence. In this study, the endpoint detection and speech segmentation task is achieved by using the short-time energy (STE) and short-time zero crossing (STZC) measures and combination of both approaches. As a result, the Hidden Markov Model (HMM) recognizer derived the recognition accuracy rate of 91.4% for combination of both algorithms, if compared only 86.3% for STE and 82.1% for STZC rate alone. The experiments show that there are many problems arise where there are still misdetection of word boundaries for the words with weak fricative and nasal sounds. Other obstacles issues such as speaking styles or mood of speaking can also cause the recognition performance.
Keywords :
hidden Markov models; natural language processing; speech processing; speech recognition; word processing; Malay Parliamentary speech; digital speech signal; endpoint detection; hidden Markov model; isolated word; short time energy; short time zero crossing; speech recognition system; speech segmentation; vocabulary; Background noise; Databases; Detection algorithms; Energy measurement; Hidden Markov models; Mood; Speech analysis; Speech enhancement; Speech recognition; Vocabulary; endpoint detection; infinite impulse response; mel frequency cepstral coefficient; short-time energy; short-time zero crossing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Retrieval & Knowledge Management, (CAMP), 2010 International Conference on
Conference_Location :
Shah Alam, Selangor
Print_ISBN :
978-1-4244-5650-5
Type :
conf
DOI :
10.1109/INFRKM.2010.5466898
Filename :
5466898
Link To Document :
بازگشت