• DocumentCode
    312047
  • Title

    Automatic transcription of general audio data: preliminary analyses

  • Author

    Spina, Michelle S. ; Zue, Victor W.

  • Author_Institution
    Lab. for Comput. Sci., MIT, Cambridge, MA, USA
  • Volume
    2
  • fYear
    1996
  • fDate
    3-6 Oct 1996
  • Firstpage
    594
  • Abstract
    The task of automatically transcribing general audio data is very different from the transcription task typically required of current automatic speech recognition systems. The general goal of this work is to quantify the difficult issues posed by such data, thus leading to an understanding of how a speech recognition system may have to be altered to accommodate the added complexities. Specifically, we describe some preliminary analyses and experiments we have conducted on data collected from a radio news program. We found that using relatively straightforward acoustic measurements and classification techniques, we were able to achieve better than 80% classification accuracy for seven salient sound classes present in the data, and nearly 94% classification accuracy for a speech/non-speech decision. In addition, lexical analysis revealed that while the vocabulary size of a single broadcast is moderate, it grows exponentially as more shows are added
  • Keywords
    acoustic variables measurement; audio acoustics; pattern classification; radio broadcasting; speech recognition; telecommunication computing; vocabulary; acoustic measurements; added complexities; automatic speech recognition systems; automatic transcription; classification accuracy; general audio data; lexical analysis; radio broadcast; radio news programme; radio shows; salient sound classes; speech/nonspeech decision; vocabulary size; Acoustic measurements; Automatic speech recognition; Computer science; Data analysis; Laboratories; Loudspeakers; Natural languages; Radio broadcasting; Speech recognition; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
  • Conference_Location
    Philadelphia, PA
  • Print_ISBN
    0-7803-3555-4
  • Type

    conf

  • DOI
    10.1109/ICSLP.1996.607431
  • Filename
    607431