DocumentCode
431041
Title
An isolated speech endpoint detector using multiple speech features
Author
Ahmad, Abdul Manan ; Eng, Goh Kia ; Shaharoun, Awaluddin Mohamed ; Yeek, Tan Chiu ; Jarni, Muhamad Hafiz Bin
Author_Institution
Fac. of Comput. Sci. & Inf. Syst., Universiti Teknologi Malaysia, Johor, Malaysia
Volume
B
fYear
2004
fDate
21-24 Nov. 2004
Firstpage
403
Abstract
Energy and zero crossing rate of the speech signal have been the two most widely used features for detecting the endpoints of an utterance. This paper proposed a new approach for locating the endpoint for isolated speech, which significantly improve the endpoint detector performance. The proposed algorithm relies on multiple speech features: root mean square energy (rmse), zero crossing rate (zcr) and cepstral coefficient (cepstrum) where the Euclidean distance measure is adopted to accurately detect the endpoint of an isolated utterance. This algorithm offers better performance than conventional algorithm which using energy only. The vocabulary for the experiment includes English digit from 1 to 9. These experimental results were conducted by 360 utterances from a male speaker. Experimental results show that the accuracy of the algorithm is quite acceptable.
Keywords
cepstral analysis; feature extraction; mean square error methods; natural languages; signal detection; speaker recognition; vocabulary; English digit; Euclidean distance measure; RMSE; ZCR; cepstral coefficient; energy-zero crossing rate; isolated speech endpoint detector; male speaker; multiple speech feature; root mean square energy; vocabulary; Automatic speech recognition; Background noise; Cepstral analysis; Cepstrum; Computer science; Detectors; Frequency; Noise level; Speech enhancement; Speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
TENCON 2004. 2004 IEEE Region 10 Conference
Print_ISBN
0-7803-8560-8
Type
conf
DOI
10.1109/TENCON.2004.1414617
Filename
1414617
Link To Document