مرکز منطقه ای اطلاع رساني علوم و فناوري - Analysis of constraints on segmental DTW for the task of query-by-example spoken term detection

DocumentCode :

3765001

Title :

Analysis of constraints on segmental DTW for the task of query-by-example spoken term detection

Author :

Sri Harsha Dumpala;K N R K Raju Alluri;Suryakanth V. Gangashetty;Anil Kumar Vuppala

Author_Institution :

Speech and Vision Lab., International Institute of Information Technology, Hyderabad, India

fYear :

2015

Firstpage :

Lastpage :

Abstract :

Query-by-example spoken term detection (QbE-STD) refers to the task of determining the subsequence of a reference which matches with a query, where both the query and the reference are in audio format. Dynamic time warping (DTW) based techniques are explored to match the two sequences with different lengths in an unsupervised manner. In this paper, a completely unsupervised approach based on Segmental DTW (SDTW), a variant of DTW, is considered for the task of QbE-STD where both reference and query utterances are represented using a sequence of Gaussian posteriorgram vectors. SDTW using two different types of bands i.e., Sakoe-Chiba band and Itakura parallelogram is considered to compare the Gaussian posteriorgrams of the query and the reference sequence. The effect of varying different local constraints of the DTW algorithm on the performance of SDTW is also analyzed in this paper. Results obtained on MediaEval 2012 dataset indicate that SDTW using a band with variable speaking rate, as in Itakura parallelogram, performs better compared to that of using a band with fixed speaking rate, as in Sakoe-Chiba band, across all variations in local constraints.

Keywords :

"Speech","Feature extraction","Databases","Mel frequency cepstral coefficient","Information technology","Electronic mail"

Publisher :

ieee

Conference_Titel :

India Conference (INDICON), 2015 Annual IEEE

Electronic_ISBN :

2325-9418

Type :

conf

DOI :

10.1109/INDICON.2015.7443702

Filename :

7443702

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3765001