Title :
CTIMIT: a speech corpus for the cellular environment with applications to automatic speech recognition
Author :
Brown, Kathy L. ; George, E. Bryan
Author_Institution :
Signal Process. Center of Technol., Lockheed Sanders Avionics, Nashua, NH, USA
Abstract :
This paper reports on techniques used in the generation of a continuous speech, multi-speaker, cellular bandwidth database and describes its application to automatic speech recognition in the cellular environment. CTIMIT (cellular TIMIT) has been generated by transmitting the TIMIT speech database over the cellular network. The CTIMIT database can have widespread applicability in the design and development of speech processing and speech recognition products for the cellular market. It describes the preliminary collection of the CTIMIT database and reports on several studies designed to test the utility of the database in a phoneme recognition task. Two HMM-based phoneme recognizers were trained using utterances drawn from the TIMIT database and the CTIMIT database, respectively. Each recognizer was then tested using the test utterances from CTIMIT. Phoneme recognition accuracy for the TIMIT-trained recognizer dropped 58% from its baseline performance on TIMIT test utterances. By comparison, phoneme recognition accuracy of the CTIMIT-trained recognizer increased 82% compared to that of the TIMIT-trained recognizer
Keywords :
cellular radio; hidden Markov models; radio networks; speech processing; speech recognition; CTIMIT; CTIMIT database; CTIMIT-trained recognizer; HMM-based phoneme recognizers; TIMIT database; TIMIT speech database transmission; TIMIT test utterances; TIMIT-trained recognizer; automatic speech recognition; cellular TIMIT; cellular environment; cellular market; cellular network; continuous speech database; multi-speaker cellular bandwidth database; phoneme recognition; phoneme recognition accuracy; speech corpus; speech processing products; speech recognition products; Automatic speech recognition; Bandwidth; Hidden Markov models; Land mobile radio cellular systems; Loudspeakers; Noise cancellation; Spatial databases; Speech enhancement; Speech processing; Speech recognition; Testing;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
Conference_Location :
Detroit, MI
Print_ISBN :
0-7803-2431-5
DOI :
10.1109/ICASSP.1995.479284