CTIMIT: a speech corpus for the cellular environment with applications to automatic speech recognition

Author

Brown, Kathy L. ; George, E. Bryan

Author_Institution

Signal Process. Center of Technol., Lockheed Sanders Avionics, Nashua, NH, USA

Volume

1

fYear

1995

fDate

9-12 May 1995

Firstpage

105

Abstract

This paper reports on techniques used in the generation of a continuous speech, multi-speaker, cellular bandwidth database and describes its application to automatic speech recognition in the cellular environment. CTIMIT (cellular TIMIT) has been generated by transmitting the TIMIT speech database over the cellular network. The CTIMIT database can have widespread applicability in the design and development of speech processing and speech recognition products for the cellular market. It describes the preliminary collection of the CTIMIT database and reports on several studies designed to test the utility of the database in a phoneme recognition task. Two HMM-based phoneme recognizers were trained using utterances drawn from the TIMIT database and the CTIMIT database, respectively. Each recognizer was then tested using the test utterances from CTIMIT. Phoneme recognition accuracy for the TIMIT-trained recognizer dropped 58% from its baseline performance on TIMIT test utterances. By comparison, phoneme recognition accuracy of the CTIMIT-trained recognizer increased 82% compared to that of the TIMIT-trained recognizer

Keywords

cellular radio; hidden Markov models; radio networks; speech processing; speech recognition; CTIMIT; CTIMIT database; CTIMIT-trained recognizer; HMM-based phoneme recognizers; TIMIT database; TIMIT speech database transmission; TIMIT test utterances; TIMIT-trained recognizer; automatic speech recognition; cellular TIMIT; cellular environment; cellular market; cellular network; continuous speech database; multi-speaker cellular bandwidth database; phoneme recognition; phoneme recognition accuracy; speech corpus; speech processing products; speech recognition products; Automatic speech recognition; Bandwidth; Hidden Markov models; Land mobile radio cellular systems; Loudspeakers; Noise cancellation; Spatial databases; Speech enhancement; Speech processing; Speech recognition; Testing;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on

Conference_Location

Detroit, MI

ISSN

1520-6149

Print_ISBN

0-7803-2431-5

Type

conf

DOI

10.1109/ICASSP.1995.479284

Filename

479284