• DocumentCode
    3059699
  • Title

    How humans perform on a connected-digits data base

  • Author

    Pols, Louis C W

  • Author_Institution
    Institute for Perception TNO, Soesterberg, The Netherlands
  • Volume
    7
  • fYear
    1982
  • fDate
    30072
  • Firstpage
    867
  • Lastpage
    870
  • Abstract
    Participating members of the international NATO Research Study Group RSG-10 on Speech Processing are presently using a data base of connected digits, spoken in different languages, to facilitate comparison of (connected) word recognition systems in the various countries. In order to be able to refer "system" results to human performance, we executed a listening experiment with a representative subset of the same recordings of connected digits. Four Dutch subjects listened to connected 3-to-5 digit groups, as well as to isolated digits, spoken in English and in Dutch. The English material was spoken by 4 native and 6 nonnative speakers of English. Apart from an undisturbed condition, subjects also identified the digit sequences in two noise conditions with speech-to-noise ratios of -3 and -9 dB. At SNR = -3 dB the listeners still do an excellent job. There is substantial speaker variation, but no systematic effect of language, sex, or native vs non-native speakers. Subjects showed a prolonged learning effect, and were especially sensitive to tempo under more difficult (noisy) listening conditions.
  • Keywords
    Automatic speech recognition; Error analysis; Feedback; Humans; Natural languages; Signal to noise ratio; Speech analysis; Speech recognition; System testing; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '82.
  • Type

    conf

  • DOI
    10.1109/ICASSP.1982.1171874
  • Filename
    1171874