مرکز منطقه ای اطلاع رساني علوم و فناوري - Estimating document frequencies in a speech corpus

DocumentCode :

3485826

Title :

Estimating document frequencies in a speech corpus

Author :

Karakos, Damianos ; Dredze, Mark ; Church, Ken ; Jansen, Aren ; Khudanpur, Sanjeev

fYear :

2011

fDate :

11-15 Dec. 2011

Firstpage :

407

Lastpage :

412

Abstract :

Inverse Document Frequency (IDF) is an important quantity in many applications, including Information Retrieval. IDF is defined in terms of document frequency, df (w), the number of documents that mention w at least once. This quantity is relatively easy to compute over textual documents, but spoken documents are more challenging. This paper considers two baselines: (1) an estimate based on the 1-best ASR output and (2) an estimate based on expected term frequencies computed from the lattice. We improve over these baselines by taking advantage of repetition. Whatever the document is about is likely to be repeated, unlike ASR errors, which tend to be more random (Poisson). In addition, we find it helpful to consider an ensemble of language models. There is an opportunity for the ensemble to reduce noise, assuming that the errors across language models are relatively uncorrelated. The opportunity for improvement is larger when WER is high. This paper considers a pairing task application that could benefit from improved estimates of df. The pairing task inputs conversational sides from the English Fisher corpus and outputs estimates of which sides were from the same conversation. Better estimates of df lead to better performance on this task.

Keywords :

document handling; information retrieval; speech processing; 1-best ASR output; English Fisher corpus; information retrieval; inverse document frequency; language models; noise reduction; pairing task application; speech corpus; word error rate; Adaptation models; Computational modeling; Frequency estimation; Lattices; Speech; Training; Training data;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Automatic Speech Recognition and Understanding (ASRU), 2011 IEEE Workshop on

Conference_Location :

Waikoloa, HI

Print_ISBN :

978-1-4673-0365-1

Electronic_ISBN :

978-1-4673-0366-8

Type :

conf

DOI :

10.1109/ASRU.2011.6163966

Filename :

6163966

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3485826