Title :
Named entity recognition from Conversational Telephone Speech leveraging Word Confusion Networks for training and recognition
Author :
Kurata, Gakuto ; Itoh, Nobuyasu ; Nishimura, Masafumi ; Sethy, Abhinav ; Ramabhadran, Bhuvana
Author_Institution :
IBM Res. - Tokyo, Yamato, Japan
Abstract :
Named Entity (NE) recognition from the results of Automatic Speech Recognition (ASR) is challenging because of ASR errors. To detect NEs, one of the options is to use a statistical NE model that is usually trained with ASR one-best results. In order to make NE recognition more robust to ASR errors, we propose using Word Confusion Networks (WCNs), sequences of bundled words, for both NE modeling and recognition by regarding the word bundles as units instead of the independent words. This is done by clustering similar word bundles that may originate from the same word. We trained the NE models with the maximum entropy principle and evaluated the performance using real-life call-center data. The results showed that by using the WCNs, the error of NE recognition was relatively reduced by up to 33.0%.
Keywords :
call centres; maximum entropy methods; speech recognition; ASR; NE recognition; WCN; automatic speech recognition; conversational telephone speech leveraging word confusion networks; maximum entropy principle; named entity recognition; real-life call-center data; word confusion network; Companies; Context; Speech recognition; Training; Conversational Telephone Speech; Maximum Entropy Model; Named Entity Recognition; Word Confusion Network;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location :
Prague
Print_ISBN :
978-1-4577-0538-0
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2011.5947622