مرکز منطقه ای اطلاع رساني علوم و فناوري - Unsupervised Training for Mandarin Broadcast News and Conversation Transcription

DocumentCode :

2701692

Title :

Unsupervised Training for Mandarin Broadcast News and Conversation Transcription

Author :

Wang, Lingfeng ; Gales, Mark J.F. ; Woodland, Philip C.

Author_Institution :

Dept. of Eng., Cambridge Univ., UK

Volume :

fYear :

2007

fDate :

15-20 April 2007

Abstract :

A significant cost in obtaining acoustic training data is the generation of accurate transcriptions. For some sources close-caption data is available. This allows the use of lightly-supervised training techniques. However, for some sources and languages close-caption is not available. In these cases unsupervised training techniques must be used. This paper examines the use of unsupervised techniques for discriminative training. In unsupervised training automatic transcriptions from a recognition system are used for training. As these transcriptions may be errorful data selection may be useful. Two forms of selection are described, one to remove non-target language shows, the other to remove segments with low confidence. Experiments were carried out on a Mandarin transcriptions task. Two types of test data were considered, broadcast news (BN) and broadcast conversations (BC). Results show that the gains from unsupervised discriminative training are highly dependent on the accuracy of the automatic transcriptions.

Keywords :

learning (artificial intelligence); speech processing; speech recognition; Mandarin broadcast news; Mandarin transcriptions task; broadcast conversations; conversation transcription; discriminative training; errorful data selection; lightly-supervised training techniques; unsupervised training; Acoustic signal detection; Acoustical engineering; Costs; Data engineering; Lattices; Maximum likelihood estimation; Natural languages; Radio broadcasting; Speech recognition; TV broadcasting; Speech Recognition; unsupervised learning;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on

Conference_Location :

Honolulu, HI

ISSN :

1520-6149

Print_ISBN :

1-4244-0727-3

Type :

conf

DOI :

10.1109/ICASSP.2007.366922

Filename :

4218110

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2701692