Title :
Computer-assisted transcription of speech based on confusion network reordering
Author :
Laurent, Antoine ; Meignier, Sylvain ; Merlin, Teva ; Deléglise, Paul
Author_Institution :
Comput. Sci. Res. Center, Univ. du Maine, Le Mans, France
Abstract :
Large vocabulary automatic speech recognition (ASR) technologies perform well in known and controlled contexts. In less con trolled conditions, however, human review is often necessary to check and correct the results of such systems in order to ensure that the output of ASR will be understandable. We propose a method for computer-assisted transcription of speech, based on automatic reordering confusion networks. Our method will be evaluated in terms of KSR (Keystroke Saving Rate) and WSR (Word Stroke Ratio). It allows to significantly reduce the number of actions needed to correct ASR outputs. WSR computed before and after every network reordering shows a gain of about 17.7% (3.4 points).
Keywords :
speech recognition; ASR; KSR; WSR; computer-assisted transcription; confusion network reordering; human review; keystroke saving rate; large vocabulary automatic speech recognition technology; word stroke ratio; Computational modeling; Computers; Lattices; Manuals; Mathematical model; Speech; Speech recognition; Automatic correction; Cache models; Confusion network; Speech recognition;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location :
Prague
Print_ISBN :
978-1-4577-0538-0
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2011.5947450