Title :
Manual Transcription of Conversational Speech at the Articulatory Feature Level
Author :
Livescu, Karen ; Bezman, A. ; Borges, M. ; Yung, L. ; Cetin, Omer ; Frankel, Jorg ; King, Simon ; Magimai-Doss ; Xuemin Xhi ; Lavoie, L.
Author_Institution :
MIT, MA, USA
Abstract :
We present an approach for the manual labeling of speech at the articulatory feature level, and a new set of labeled conversational speech collected using this approach. A detailed transcription, including overlapping or reduced gestures, is useful for studying the great pronunciation variability in conversational speech. It also facilitates the testing of feature classifiers, such as those used in articulatory approaches to automatic speech recognition. We describe an effort to transcribe a small set of utterances drawn from the Switchboard database using eight articulatory tiers. Two transcribers have labeled these utterances in a multi-pass strategy, allowing for correction of errors. We describe the data collection methods and analyze the data to determine how quickly and reliably this type of transcription can be done. Finally, we demonstrate one use of the new data set by testing a set of multilayer perceptron feature classifiers against both the manual labels and forced alignments.
Keywords :
feature extraction; multilayer perceptrons; speech processing; speech recognition; articulatory feature level; automatic speech recognition; conversational speech; data collection methods; manual transcription; multilayer perceptron feature classifiers; overlapping gestures; pronunciation variability; reduced gestures; Automatic speech recognition; Automatic testing; Data analysis; Educational institutions; Error correction; Labeling; Multilayer perceptrons; Spatial databases; Speech analysis; Speech recognition; Speech analysis; speech recognition;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
Conference_Location :
Honolulu, HI
Print_ISBN :
1-4244-0727-3
DOI :
10.1109/ICASSP.2007.367229