DocumentCode
701811
Title
Experimental studies on effect of speaking mode on spoken term detection
Author
Rout, Kallola ; Reddy, Pappagari Raghavendra ; Sri Rama Murty, K.
Author_Institution
Dept. of Electr. Eng., Indian Inst. of Technol. Hyderabad, Hyderabad, India
fYear
2015
fDate
Feb. 27 2015-March 1 2015
Firstpage
1
Lastpage
6
Abstract
The objective of this paper is to study the effect of speaking mode on spoken term detection (STD) system. The experiments are conducted with respect to query words recorded in isolated manner and words cut out from continuous speech. Durations of phonemes in query words greatly vary between these two modes. Hence pattern matching stage plays a crucial role which takes care of temporal variations. Matching is done using Subsequence dynamic time warping (DTW) on posterior features of query and reference utterances, obtained by training Multilayer perceptron (MLP). The difference in performance of the STD system for different phoneme groupings (45, 25, 15 and 6 classes) is also analyzed. Our STD system is tested on Telugu broadcast news. Major difference in STD system performance is observed for recorded and cut-out types of query words. It is observed that STD system performance is better with query words cut out from continuous speech compared to words recorded in isolated manner. This performance difference can be accounted for large temporal variations.
Keywords
multilayer perceptrons; pattern matching; speech processing; DTW; STD system; Telugu broadcast news; dynamic time warping; multilayer perceptron; query posterior features; speaking mode effect; spoken term detection system; Feature extraction; Hidden Markov models; Indexes; Mel frequency cepstral coefficient; Speech; Training;
fLanguage
English
Publisher
ieee
Conference_Titel
Communications (NCC), 2015 Twenty First National Conference on
Conference_Location
Mumbai
Type
conf
DOI
10.1109/NCC.2015.7084926
Filename
7084926
Link To Document