DocumentCode
2866993
Title
CTIMIT: a speech corpus for the cellular environment with applications to automatic speech recognition
Author
Brown, Kathy L. ; George, E. Bryan
Author_Institution
Signal Process. Center of Technol., Lockheed Sanders Avionics, Nashua, NH, USA
Volume
1
fYear
1995
fDate
9-12 May 1995
Firstpage
105
Abstract
This paper reports on techniques used in the generation of a continuous speech, multi-speaker, cellular bandwidth database and describes its application to automatic speech recognition in the cellular environment. CTIMIT (cellular TIMIT) has been generated by transmitting the TIMIT speech database over the cellular network. The CTIMIT database can have widespread applicability in the design and development of speech processing and speech recognition products for the cellular market. It describes the preliminary collection of the CTIMIT database and reports on several studies designed to test the utility of the database in a phoneme recognition task. Two HMM-based phoneme recognizers were trained using utterances drawn from the TIMIT database and the CTIMIT database, respectively. Each recognizer was then tested using the test utterances from CTIMIT. Phoneme recognition accuracy for the TIMIT-trained recognizer dropped 58% from its baseline performance on TIMIT test utterances. By comparison, phoneme recognition accuracy of the CTIMIT-trained recognizer increased 82% compared to that of the TIMIT-trained recognizer
Keywords
cellular radio; hidden Markov models; radio networks; speech processing; speech recognition; CTIMIT; CTIMIT database; CTIMIT-trained recognizer; HMM-based phoneme recognizers; TIMIT database; TIMIT speech database transmission; TIMIT test utterances; TIMIT-trained recognizer; automatic speech recognition; cellular TIMIT; cellular environment; cellular market; cellular network; continuous speech database; multi-speaker cellular bandwidth database; phoneme recognition; phoneme recognition accuracy; speech corpus; speech processing products; speech recognition products; Automatic speech recognition; Bandwidth; Hidden Markov models; Land mobile radio cellular systems; Loudspeakers; Noise cancellation; Spatial databases; Speech enhancement; Speech processing; Speech recognition; Testing;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
Conference_Location
Detroit, MI
ISSN
1520-6149
Print_ISBN
0-7803-2431-5
Type
conf
DOI
10.1109/ICASSP.1995.479284
Filename
479284
Link To Document