Title :
Introduction of false detection control parameters in spoken term detection
Author :
Furuya, Yasubumi ; Natori, Satoshi ; Nishizaki, Hiromitsu ; Sekiguchi, Yuta
Author_Institution :
Dept. of Educ., Univ. of Yamanashi, Kofu, Japan
Abstract :
This paper describes spoken term detection (STD) with false detection control. Our STD method uses phoneme transition network (PTN) derived by multiple automatic speech recognizers (ASRs) as an index. An PTN is almost the same to a sub-word based confusion network (CN), which is derived from an output of an ASR. The PTN-based index we proposed is made of the outputs of multiple ASRs, which is known to be robust to certain recognition errors and the out-of-vocabulary problem. Our PTN was very effective at detecting query terms. However, the PTN generates a lot of false detections especially for short query terms. Therefore, we applied two false detection control parameters to the Dynamic Time Warping-based term detection engine. In addition, we changed the search parameters depending on length of a query term. Finally, the STD performance was better (0.785 of F-measure) than without any parameters (0.717).
Keywords :
query processing; speaker recognition; ASR; CN; PTN-based index; STD; automatic speech recognition; confusion network; dynamic time warping-based term detection; false detection control parameter; phoneme transition network; query term detection; search parameter; spoken term detection; Educational institutions; Engines; Hidden Markov models; Indexing; Speech; Speech recognition;
Conference_Titel :
Signal & Information Processing Association Annual Summit and Conference (APSIPA ASC), 2012 Asia-Pacific
Conference_Location :
Hollywood, CA
Print_ISBN :
978-1-4673-4863-8