Title :
Effect of pronounciations on OOV queries in spoken term detection
Author :
Can, Dogan ; Cooper, Erica ; Sethy, Abhinav ; White, Chris ; Ramabhadran, Bhuvana ; Saraclar, Murat
Author_Institution :
Bogazici Univ., Istanbul
Abstract :
The spoken term detection (STD) task aims to return relevant segments from a spoken archive that contain the query terms whether or not they are in the system vocabulary. This paper focuses on pronunciation modeling for out-of-vocabulary (OOV) terms which frequently occur in STD queries. The STD system described in this paper indexes word-level and sub-word level lattices or confusion networks produced by an LVCSR system using weighted finite state transducers (WFST).We investigate the inclusion of n-best pronunciation variants for OOV terms (obtained from letter-to-sound rules) into the search and present the results obtained by indexing confusion networks as well as lattices. The following observations are worth mentioning: phone indexes generated from sub-words represent OOVs well and too many variants for the OOV terms degrade performance if pronunciations are not weighted.
Keywords :
query processing; speech recognition; out-of-vocabulary term; speech recognition; spoken term detection queries; weighted finite state transducers; Automatic speech recognition; Decoding; Dictionaries; Indexing; Information retrieval; Lattices; NIST; Speech recognition; Transducers; Vocabulary; Speech Indexing and Retrieval; Speech Recognition; Spoken Term Detection; Weighted Finite State Transducers;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
Conference_Location :
Taipei
Print_ISBN :
978-1-4244-2353-8
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2009.4960494