DocumentCode :
417253
Title :
Lightly supervised and data-driven approaches to Mandarin broadcast news transcription
Author :
Chen, Berlin ; Kuo, Jen-Wei ; Tsai, Wen-Hung
Author_Institution :
Graduate Inst. of Comput. Sci. & Inf. Eng., Nat. Taiwan Normal Univ., Taipei, Taiwan
Volume :
1
fYear :
2004
fDate :
17-21 May 2004
Abstract :
This paper investigates the use of several lightly supervised and data-driven approaches to Mandarin broadcast news transcription. First, with a consideration of the special structural properties of the Chinese language, a fast acoustic took-ahead technique for estimating the unexplored part of speech utterance was integrated into the lexical tree search to improve the search efficiency, in conjunction with the conventional language model look-ahead technique. Then, a verification-based method for automatic acoustic training data acquisition was developed to make use of the large amount of untranscribed speech data. Finally, two alternative strategies for language model adaptation were further studied for accurate language model estimation. With the above approaches, the system yielded an 11.94% character error rate on the Mandarin broadcast news collected in Taiwan.
Keywords :
adaptive estimation; error statistics; feature extraction; speech recognition; tree searching; Chinese language; Mandarin broadcast news transcription; Taiwan; acoustic took-ahead technique; automatic acoustic training data acquisition; character error rate; data-driven transcription; language model adaptation; language model estimation; language model look-ahead technique; lexical tree search; lightly supervised transcription; search efficiency; speech utterance; structural properties; verification-based method; Adaptation model; Cepstral analysis; Linear discriminant analysis; Multimedia communication; Natural languages; Prototypes; Radio broadcasting; Speech recognition; TV broadcasting; Training data;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
ISSN :
1520-6149
Print_ISBN :
0-7803-8484-9
Type :
conf
DOI :
10.1109/ICASSP.2004.1326101
Filename :
1326101
Link To Document :
بازگشت