DocumentCode :
3744845
Title :
Combination of syllable based N-gram search and word search for spoken term detection through spoken queries and IV/OOV classification
Author :
Nagisa Sakamoto;Kazumasa Yamamoto;Seiichi Nakagawa
Author_Institution :
Toyohashi University of Technology, Japan
fYear :
2015
Firstpage :
200
Lastpage :
206
Abstract :
This paper presents a Japanese spoken term detection method for spoken queries using a combination of word-based search and syllable-based N-gram search with in-vocabulary/out-of-vocabulary (IV/OOV) term classification. The N-gram index in a recognized syllable-based lattice for OOV terms, which assumes recognition errors such as substitution, insertion and deletion errors, incorporates a distance metric as a confidence score. To address spoken queries, we propose an automatic method for discriminating IV and OOV terms by using the confidence scores of spoken queries through large-vocabulary/syllable continuous speech recognition. Evaluation on an academic lecture presentation database with 44 hours of data shows that the combination of word search and syllable-based N-gram search yields significant improvement and outperforms the baseline syllable-based DTW approach.
Keywords :
"Lattices","Arrays","Speech recognition","Acoustics","Speech","Indexing"
Publisher :
ieee
Conference_Titel :
Automatic Speech Recognition and Understanding (ASRU), 2015 IEEE Workshop on
Type :
conf
DOI :
10.1109/ASRU.2015.7404795
Filename :
7404795
Link To Document :
بازگشت