Title :
Featherweight phonetic keyword search for conversational speech
Author :
Kintzley, Keith ; Jansen, Anton ; Hermansky, Hynek
Author_Institution :
U.S. Naval Acad., Annapolis, MD, USA
Abstract :
The point process model (PPM) for keyword search is a phonetic event-driven approach that provides a whole-word focused alternative to fast lattice matching techniques. Recent efforts in PPMs have been focused on improved model estimation techniques and efficient search algorithms, but past evaluations have been limited to searching relatively easy scripted corpora for simple unigram queries, preventing comprehensive benchmarking against standard search methods. In this paper, we present techniques for score normalization and the processing of multi-word and out of training query terms as required by the 2006 NIST Spoken Term Detection (STD) evaluation, permitting the first comprehensive benchmark of PPM search technology against state-of-the-art word and phonetic-based search systems. We demonstrate PPM to be the fastest phonetic system while posting accuracies competitive with the best phonetic alternatives. Moreover, index construction time and size are better than any keyword search system entered in the NIST evaluation.
Keywords :
query processing; speech processing; NIST spoken term detection evaluation; PPM search technology; STD evaluation; conversational speech; fast lattice matching techniques; featherweight phonetic keyword search system; improved model estimation techniques; multiword processing; phonetic event-driven approach; point process model; score normalization; standard search methods; training query terms; unigram queries; Computational modeling; Hidden Markov models; Indexes; Lattices; Speech; Speech processing; Training; compact speech indexing; point process model; score normalization; spoken term detection;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
Conference_Location :
Florence
DOI :
10.1109/ICASSP.2014.6855130