Title :
Use of geographical meta-data in ASR language and acoustic models
Author :
Bocchieri, Enrico ; Caseiro, Diamantino
Author_Institution :
AT&T Res., Florham Park, NJ, USA
Abstract :
The query distribution, in the speech recognition applications of directory assistance (DA) and voice-search, depends on the customer´s location. This motivates the research on query models conditioned on the user location, here denoted as local models. We describe and test our methods for the estimation of local models with various degrees of spacial “granularity”, for the recognition of city-state (sub-task of DA) and for the recognition of business listings, spoken over iPhones in a nation-wide business-listing voice-search service. Our local language models improve the accuracy of city-state by 2.4% absolute (32% relative error reduction), and of voice-search by 2.2% (7% relative).
Keywords :
geographic information systems; meta data; query processing; speech recognition; ASR language; acoustic models; business listings recognition; city-state recognition; customer location; directory assistance; geographical metadata; iPhones; local language models; nation-wide business-listing voice-search service; query distribution; query models; spacial granularity; speech recognition applications; user location; Acoustic applications; Automatic speech recognition; Cities and towns; Hidden Markov models; Natural languages; Speech recognition; State estimation; Telephony; Testing; User interfaces; ASR; Local; acoustic; language; metadata; model;
Conference_Titel :
Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
Conference_Location :
Dallas, TX
Print_ISBN :
978-1-4244-4295-9
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2010.5495026