DocumentCode :
2180638
Title :
Named entity recognition from Conversational Telephone Speech leveraging Word Confusion Networks for training and recognition
Author :
Kurata, Gakuto ; Itoh, Nobuyasu ; Nishimura, Masafumi ; Sethy, Abhinav ; Ramabhadran, Bhuvana
Author_Institution :
IBM Res. - Tokyo, Yamato, Japan
fYear :
2011
fDate :
22-27 May 2011
Firstpage :
5572
Lastpage :
5575
Abstract :
Named Entity (NE) recognition from the results of Automatic Speech Recognition (ASR) is challenging because of ASR errors. To detect NEs, one of the options is to use a statistical NE model that is usually trained with ASR one-best results. In order to make NE recognition more robust to ASR errors, we propose using Word Confusion Networks (WCNs), sequences of bundled words, for both NE modeling and recognition by regarding the word bundles as units instead of the independent words. This is done by clustering similar word bundles that may originate from the same word. We trained the NE models with the maximum entropy principle and evaluated the performance using real-life call-center data. The results showed that by using the WCNs, the error of NE recognition was relatively reduced by up to 33.0%.
Keywords :
call centres; maximum entropy methods; speech recognition; ASR; NE recognition; WCN; automatic speech recognition; conversational telephone speech leveraging word confusion networks; maximum entropy principle; named entity recognition; real-life call-center data; word confusion network; Companies; Context; Speech recognition; Training; Conversational Telephone Speech; Maximum Entropy Model; Named Entity Recognition; Word Confusion Network;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location :
Prague
ISSN :
1520-6149
Print_ISBN :
978-1-4577-0538-0
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2011.5947622
Filename :
5947622
Link To Document :
بازگشت