Title :
Fast Spoken Term Detection using pre-retrieval results of syllable bigrams
Author :
Saito, Hiroshi ; Itoh, Yoshio ; Kojima, Keisuke ; Ishigame, Masaaki ; Tanaka, Kiyoshi ; Shi-wook Lee
Author_Institution :
Fac. of Software, Iwate Prefectural Univ., Iwate, Japan
Abstract :
We propose a method of the Spoken Term Detection (STD) based on a priori retrieval results in which plural syllables are used as query terms. In the proposed method, all N-syllable combinations such as syllable bigrams are searched for in spoken documents. In the first step of the method, the retrieval results are prepared a priori, where pre-retrieval results include candidates with scores matching those of each N-syllable sequence. Given a query, the syllable sequence of the query is divided into plural syllable sequences whose lengths are the same as those of the pre-retrieval results. In the second step, the candidate sections are filtered by using the scores of query´s syllable combinations. This reduction in the number of candidate sections for detailed matching leads to a large reduction of the retrieval time. In the third step, these candidates sections are rescored by performing detailed matching. Experimental results show that the proposed method reduces the retrieval time by 93% with a performance degradation of less than 2 points.
Keywords :
query processing; speech processing; N-syllable combination; N-syllable sequence; plural syllable sequence; preretrieval result; query term; spoken document; spoken term detection; syllable bigram; Acoustics; Conferences; Hidden Markov models; Matched filters; Speech; Speech recognition;
Conference_Titel :
Signal & Information Processing Association Annual Summit and Conference (APSIPA ASC), 2012 Asia-Pacific
Conference_Location :
Hollywood, CA
Print_ISBN :
978-1-4673-4863-8