Title :
Zero resource graph-based confidence estimation for open vocabulary spoken term detection
Author :
Norouzian, Atta ; Rose, Rachel ; Ghalehjegh, Sina Hamidi ; Jansen, Anton
Author_Institution :
Dept. of Electr. & Comput. Eng., McGill Univ., Montreal, QC, Canada
Abstract :
In this paper the use of acoustic similarity of speech intervals for generating improved confidence scores for spoken term detection (STD) is investigated. A procedure based on acoustic dotplots which requires no training data is deployed for discovering similar speech intervals. A graph based random walk algorithm incorporates acoustic similarity of hypothesized term occurrences for improving the corresponding confidence scores. The proposed approach is evaluated in an open vocabulary STD task defined on a lecture domain corpus. It is shown that updating the confidence scores in this fashion results in a significant increase in term detection performance of out of vocabulary search terms. A relative improvement of 12.9% in figure of merit was gained relative to that obtained from a baseline lattice based STD system.
Keywords :
acoustic signal detection; graph theory; speech recognition; vocabulary; acoustic dotplots; acoustic similarity; baseline lattice; confidence estimation; confidence scores; graph based random walk; lecture domain corpus; open vocabulary spoken term detection; speech intervals; vocabulary search terms; zero resource graph; Acoustic measurements; Acoustics; Feature extraction; Lattices; Speech; Vectors; Vocabulary; Dotplot; Open vocabulary spoken term detection; Random walk on directional graphs;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location :
Vancouver, BC
DOI :
10.1109/ICASSP.2013.6639282