DocumentCode :
388378
Title :
Automatic labeling of speech
Author :
Spohrer, James C. ; Brown, Peter F. ; Roth, Robert
Author_Institution :
Verbex, Bedford, Mass, U.S.A.
Volume :
7
fYear :
1982
fDate :
30072
Firstpage :
1641
Lastpage :
1644
Abstract :
To evaluate the performance of a speech recognition system, large databases of labeled speech, including various speakers, noise conditions, and vocabularies, are necessary. This paper describes a method for automatically labeling speech data. In the past, speech has been labeled manually, typically by listening to and viewing waveforms through real-time, interactive computer I/O stations. This process is slow and tedious, and accounts for the shortage of large speech databases. The automatic labeling method reported here uses dynamic programming to align a script which is produced as the output of a recognition system, and a known script. The alignment gives a tentative labeling which can be refined by repeating the training, recognition, and alignment processes. The method was used to label a 50 speaker database of 140,000 digits.
Keywords :
Databases; Dynamic programming; Humans; Labeling; Speech analysis; Speech enhancement; Speech processing; Speech recognition; Testing; Vocabulary;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '82.
Type :
conf
DOI :
10.1109/ICASSP.1982.1171490
Filename :
1171490
Link To Document :
بازگشت