• DocumentCode
    542216
  • Title

    Porting an audio partitioner across domains

  • Author

    Cettolo, Mauro

  • Author_Institution
    ITC-irst, Centro per la Ricerca Scientifica e Tecnologica, I-38010 Povo di Trento - Italy
  • Volume
    1
  • fYear
    2002
  • fDate
    13-17 May 2002
  • Abstract
    Partitioning an audio stream means to segment It In acoustically homogeneous chunks, classify segments into acoustic classes, and cluster speech segments. The process represents the earliest stage of automatic transcription stations, since it allows to filter out portions of the audio not containing speech and to improve recognition accuracy through the use of condition-dependent acoustic models and adaptation techniques. Hence, when transcription systems are applied to new domains, the process of porting involves the partitioner module too. In this work, the porting of the partitioner of the ITC-irst broadcast news transcription system to the domain of historical films is described in detail and experimentally evaluated. Moreover, a new technique that makes the porting easier for the automatic estimation of the working point of the BIC-based segmentation algorithm is introduced.
  • Keywords
    Argon; Computational modeling; Manuals; Niobium; Silicon carbide; Three dimensional displays; Wideband;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on
  • Conference_Location
    Orlando, FL, USA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-7402-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.2002.5743714
  • Filename
    5743714