• DocumentCode
    2866993
  • Title

    CTIMIT: a speech corpus for the cellular environment with applications to automatic speech recognition

  • Author

    Brown, Kathy L. ; George, E. Bryan

  • Author_Institution
    Signal Process. Center of Technol., Lockheed Sanders Avionics, Nashua, NH, USA
  • Volume
    1
  • fYear
    1995
  • fDate
    9-12 May 1995
  • Firstpage
    105
  • Abstract
    This paper reports on techniques used in the generation of a continuous speech, multi-speaker, cellular bandwidth database and describes its application to automatic speech recognition in the cellular environment. CTIMIT (cellular TIMIT) has been generated by transmitting the TIMIT speech database over the cellular network. The CTIMIT database can have widespread applicability in the design and development of speech processing and speech recognition products for the cellular market. It describes the preliminary collection of the CTIMIT database and reports on several studies designed to test the utility of the database in a phoneme recognition task. Two HMM-based phoneme recognizers were trained using utterances drawn from the TIMIT database and the CTIMIT database, respectively. Each recognizer was then tested using the test utterances from CTIMIT. Phoneme recognition accuracy for the TIMIT-trained recognizer dropped 58% from its baseline performance on TIMIT test utterances. By comparison, phoneme recognition accuracy of the CTIMIT-trained recognizer increased 82% compared to that of the TIMIT-trained recognizer
  • Keywords
    cellular radio; hidden Markov models; radio networks; speech processing; speech recognition; CTIMIT; CTIMIT database; CTIMIT-trained recognizer; HMM-based phoneme recognizers; TIMIT database; TIMIT speech database transmission; TIMIT test utterances; TIMIT-trained recognizer; automatic speech recognition; cellular TIMIT; cellular environment; cellular market; cellular network; continuous speech database; multi-speaker cellular bandwidth database; phoneme recognition; phoneme recognition accuracy; speech corpus; speech processing products; speech recognition products; Automatic speech recognition; Bandwidth; Hidden Markov models; Land mobile radio cellular systems; Loudspeakers; Noise cancellation; Spatial databases; Speech enhancement; Speech processing; Speech recognition; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
  • Conference_Location
    Detroit, MI
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-2431-5
  • Type

    conf

  • DOI
    10.1109/ICASSP.1995.479284
  • Filename
    479284