DocumentCode :
1155921
Title :
Graph-Based Partial Hypothesis Fusion for Pen-Aided Speech Input
Author :
Liu, Peng ; Soong, Frank K.
Author_Institution :
Microsoft Res. Asia, Beijing
Volume :
17
Issue :
3
fYear :
2009
fDate :
3/1/2009 12:00:00 AM
Firstpage :
478
Lastpage :
485
Abstract :
We study a specific partial hypothesis fusion problem in sequential data labeling. The problem arises in the multimodal applications where a decision is made by merging complete hypothesis from one input and partial hypothesis from the other. For example, in a pen-aided speech interface, appropriate pen input can provide partial but crucial information. We address the problem in a Bayesian framework, and reformulate the solution as a revised search in a representation. A dynamic programming algorithm is proposed to efficiently solve the partial hypothesis fusion via the graph. It is shown that the computational cost of the graph based partial hypothesis fusion is proportional to the size of the graph, which is highly feasible for a given compact graph. We apply the proposed algorithm to two real applications: an intelligent pen-based dictation error correction system and an automatic handwritten character completion with a speech ldquoshortcutrdquo. Experimental results show that the algorithm is effective in utilizing the partial information from one modality to enhance the bimodal interface performance.
Keywords :
Bayes methods; dynamic programming; error correction; graph theory; heuristic programming; natural language processing; speech recognition; user interfaces; Bayesian framework; bimodal interface; dynamic programming algorithm; error correction; partial hypothesis fusion; partial hypothesis fusion problem; pen-aided speech input; Bayesian methods; Computational efficiency; Computational intelligence; Dynamic programming; Error correction; Handwriting recognition; Heuristic algorithms; Labeling; Merging; Speech recognition; Handwriting recognition; multimodal interface; partial hypothesis fusion; speech recognition;
fLanguage :
English
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1558-7916
Type :
jour
DOI :
10.1109/TASL.2009.2013409
Filename :
4782043
Link To Document :
بازگشت