Title :
Turkish Speech Recognition Software with Adaptable Language Model
Author :
Osman Buyuk;Ali Haznedaroglu;Levent M. Arslan
Author_Institution :
Elektrik ve Elektronik M?hendisligi B?l?m?, Bogazi?i ?niversitesi, 34342, Bebek, ?stanbul. osman.buyuk@sestek.com.tr
fDate :
6/1/2007 12:00:00 AM
Abstract :
Turkish speech recognition studies have been accelerated recently. With these efforts, not only available speech and text corpus which can be used in recognition experiments but also proposed new methods to improve accuracy has increased. Agglutinative nature of Turkish causes out of vocabulary (OOV) problem in Large Vocabulary Continuous Speech Recognition (LVCSR) tasks. In order to overcome OOV problem, usage of sub-word units has been proposed. In addition to LVCSR experiments, there have been some efforts to implement a speech recognizer in limited domains such as radiology. In this paper, we will present Turkish speech recognition software, which has been developed by utilizing recent studies. Both interface of software and recognition accuracies in two different test sets will be summarized. The performance of software has been evaluated using radiology and large vocabulary test sets. In order to solve OOV problem practically, we propose to adapt language models using frequent words or sentences. In recognition experiments, 90% and 44% word accuracies have been achieved in radiology and large vocabulary test sets respectively.
Keywords :
"Speech recognition","Natural languages","Vocabulary","Radiology","Performance evaluation","Software testing","Acceleration","Text recognition","Software performance"
Conference_Titel :
Signal Processing and Communications Applications, 2007. SIU 2007. IEEE 15th
Print_ISBN :
1-4244-0719-2
DOI :
10.1109/SIU.2007.4298561