DocumentCode :
2770462
Title :
Improvements in phone based audio search via constrained match with high order confusion estimates
Author :
Chaudhari, Upendra V. ; Picheny, Michael
Author_Institution :
IBM T.J. Watson Res. Center, Yorktown Heights
fYear :
2007
fDate :
9-13 Dec. 2007
Firstpage :
665
Lastpage :
670
Abstract :
This paper investigates an approximate similarity measure for searching in phone based audio transcripts. The baseline method combines elements found in the literature to form an approach based on a phonetic confusion matrix that is used to determine the similarity of an audio document and a query, both of which are parsed into phone N-grams. Experimental results show comparable performance to other approaches in the literature. Extensions of the approach are developed based on a constrained form of the similarity measure that can take into consideration the system dependent errors that can occur. This is done by accounting for higher order confusions, namely of phone bi-grams and tri-grams. Results show improved performance across a variety of system configurations.
Keywords :
audio databases; matrix algebra; constrained match; high order confusion estimates; phone N-grams; phone based audio search; phone based audio transcripts; phone bi-grams; phonetic confusion matrix; tri-grams; Automatic speech recognition; Context modeling; Decoding; Indexing; Information retrieval; Lattices; Natural languages; Speech processing; Speech recognition; Vocabulary; Phone; approximate; indexing; search;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Automatic Speech Recognition & Understanding, 2007. ASRU. IEEE Workshop on
Conference_Location :
Kyoto
Print_ISBN :
978-1-4244-1746-9
Electronic_ISBN :
978-1-4244-1746-9
Type :
conf
DOI :
10.1109/ASRU.2007.4430191
Filename :
4430191
Link To Document :
بازگشت