DocumentCode
119432
Title
Impact of acoustical voice activity detection on spontaneous filled pause classification
Author
Hamzah, Raseeda ; Jamil, Nursuriati ; Seman, Noraini ; Ardi, Norizah ; Doraisamy, Shyamala C.
Author_Institution
Comput. Sci. Dept., Univ. Teknol. MARA, Shah Alam, Malaysia
fYear
2014
fDate
26-28 Oct. 2014
Firstpage
1
Lastpage
6
Abstract
Filled pause detection is imperative for spontaneous speech recognition as it may degrade speech recognition rate. However, filled pause is commonly confused with elongation as they shared the same acoustical properties. Few attempts of classifying filled pause and elongation employed Hidden Markov model. Our proposed method of utilizing Neural Network as a classifier achieved 96% precision rate. We also proved that voice activity detection (VAD) affects the performance of speech recognition. Three acoustical-based VAD are compared and the best precision rate is achieved by incorporating volume and first-order difference features. Experiments are conducted using Malay language spontaneous speeches of Malaysia Parliamentary Debate sessions.
Keywords
hidden Markov models; neural nets; speech recognition; Malay language spontaneous speeches; Malaysia Parliamentary Debate sessions; acoustical properties; acoustical voice activity detection; acoustical-based VAD; first-order difference features; hidden Markov model; neural network; speech recognition rate; spontaneous filled pause classification; spontaneous speech recognition; Artificial neural networks; Feature extraction; Mel frequency cepstral coefficient; Neurons; Speech; Speech recognition; Training; elongations; filled pause; multi-layer perceptron neural network; voice activity detection;
fLanguage
English
Publisher
ieee
Conference_Titel
Open Systems (ICOS), 2014 IEEE Conference on
Conference_Location
Subang
Print_ISBN
978-1-4799-6366-9
Type
conf
DOI
10.1109/ICOS.2014.7042400
Filename
7042400
Link To Document