Title :
Speech-Annotated Photo Retrieval Using Syllable-Transformed Patterns
Author :
Wu, Chung-Hsien ; Huang, Chien-Lin ; Lee, Wei-Chuan ; Lai, Yu-Sheng
Author_Institution :
Dept. of Comput. Sci. & Inf. Eng., Nat. Cheng Kung Univ., Tainan
Abstract :
This study presents a novel indexing and retrieval scheme for digital photos with speech annotations based on syllable-transformed image-like patterns. Speech recognition error and out-of-vocabulary (OOV) problems generally result in incorrect indexing and degrade the retrieval performance. In this study, the recognized n -best candidates used to deal with recognition error problems are transformed into an image-like pattern using multidimensional scaling. A hybrid mechanism integrating syllables, characters, words, and image-like patterns is exploited for speech indexing and retrieval. Experiments show the hybrid indexing method integrating the syllable-transformed image-like patterns can achieve a better result compared to previous indexing methods.
Keywords :
image retrieval; speech recognition; digital photo; out-of-vocabulary problem; recognition error problem; speech indexing method; speech recognition error; speech-annotated photo retrieval; syllable-transformed image-like pattern; Digital cameras; Frequency; Image recognition; Image retrieval; Indexing; Lattices; Multidimensional systems; Pattern recognition; Speech analysis; Speech recognition; Multidimensional scaling; speech retrieval; syllable-transformed patterns;
Journal_Title :
Signal Processing Letters, IEEE
DOI :
10.1109/LSP.2008.2008490