DocumentCode
3744845
Title
Combination of syllable based N-gram search and word search for spoken term detection through spoken queries and IV/OOV classification
Author
Nagisa Sakamoto;Kazumasa Yamamoto;Seiichi Nakagawa
Author_Institution
Toyohashi University of Technology, Japan
fYear
2015
Firstpage
200
Lastpage
206
Abstract
This paper presents a Japanese spoken term detection method for spoken queries using a combination of word-based search and syllable-based N-gram search with in-vocabulary/out-of-vocabulary (IV/OOV) term classification. The N-gram index in a recognized syllable-based lattice for OOV terms, which assumes recognition errors such as substitution, insertion and deletion errors, incorporates a distance metric as a confidence score. To address spoken queries, we propose an automatic method for discriminating IV and OOV terms by using the confidence scores of spoken queries through large-vocabulary/syllable continuous speech recognition. Evaluation on an academic lecture presentation database with 44 hours of data shows that the combination of word search and syllable-based N-gram search yields significant improvement and outperforms the baseline syllable-based DTW approach.
Keywords
"Lattices","Arrays","Speech recognition","Acoustics","Speech","Indexing"
Publisher
ieee
Conference_Titel
Automatic Speech Recognition and Understanding (ASRU), 2015 IEEE Workshop on
Type
conf
DOI
10.1109/ASRU.2015.7404795
Filename
7404795
Link To Document