DocumentCode :
3625854
Title :
Language Modeling For Computer-Aided Transcription
Author :
Cagdas Kayra Akman;Murat Saraclar
Author_Institution :
Elektrik-Elektronik M?hendisli?i B?l?m?, Bo?azi?i ?niversitesi, Bebek, ?stanbul, T?rkiye. kayra.akman@boun.edu.tr
fYear :
2007
fDate :
6/1/2007 12:00:00 AM
Firstpage :
1
Lastpage :
4
Abstract :
Speech recognition and language processing systems require large amounts of transcribed speech corpora. Manual transcription is expensive and slow. Computers may do the same task faster but with more errors. Computer aided transcription is a compromise between these two methods. The output lattices of an ASR engine are manipulated to be used as language models in combination with a letter-based N-gram language model. The combined model is used as the language model of the open source Dasher application. The resulting application allows easy transcription of speech data thanks to the combination of both models at letter level. It is shown that the combined model performs better than both a letter-based N-gram model and models combined at sentence level.
Keywords :
"Application software","Intersymbol interference","Speech recognition","Natural languages","Speech processing","Computer errors","Lattices","Automatic speech recognition","Engines"
Publisher :
ieee
Conference_Titel :
Signal Processing and Communications Applications, 2007. SIU 2007. IEEE 15th
ISSN :
2165-0608
Print_ISBN :
1-4244-0719-2
Type :
conf
DOI :
10.1109/SIU.2007.4298566
Filename :
4298566
Link To Document :
بازگشت