DocumentCode :
2769797
Title :
Advances in Arabic broadcast news transcription at RWTH
Author :
Rybach, David ; Hahn, Stefan ; Gollan, Christian ; Schlüter, Ralf ; Ney, Hermann
Author_Institution :
RWTH Aachen Univ., Aachen
fYear :
2007
fDate :
9-13 Dec. 2007
Firstpage :
449
Lastpage :
454
Abstract :
This paper describes the RWTH speech recognition system for Arabic. Several design aspects of the system, including cross-adaptation, multiple system design and combination, are analyzed. We summarize the semi-automatic lexicon generation for Arabic using a statistical approach to grapheme-to-phoneme conversion and pronunciation statistics. Furthermore, a novel ASR-based audio segmentation algorithm is presented. Finally, we discuss practical approaches for parallelized acoustic training and memory efficient lattice rescoring. Systematic results are reported on recent GALE evaluation corpora.
Keywords :
audio signal processing; broadcasting; natural language processing; speech recognition; statistical analysis; Arabic broadcast news transcription; GALE evaluation corpora; RWTH speech recognition system; audio segmentation algorithm; global autonomous language exploitation; grapheme-to-phoneme conversion; memory efficient lattice rescoring; parallelized acoustic training; pronunciation statistics; semi automatic lexicon generation; Broadcasting; Cepstral analysis; Hidden Markov models; Humans; Lattices; Loudspeakers; Mel frequency cepstral coefficient; Natural languages; Neural networks; Speech recognition; Audio Segmentation; Cross-Adaptation; Speech Recognition; System Combination;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Automatic Speech Recognition & Understanding, 2007. ASRU. IEEE Workshop on
Conference_Location :
Kyoto
Print_ISBN :
978-1-4244-1746-9
Electronic_ISBN :
978-1-4244-1746-9
Type :
conf
DOI :
10.1109/ASRU.2007.4430154
Filename :
4430154
Link To Document :
بازگشت