• DocumentCode
    353639
  • Title

    Employing heterogeneous information in a multi-stream framework

  • Author

    Christensen, Heidi ; Lindberg, Borge ; Andersen, Ove

  • Author_Institution
    Center for PersonKommunikation, Aalborg Univ., Denmark
  • Volume
    3
  • fYear
    2000
  • fDate
    2000
  • Firstpage
    1571
  • Abstract
    A multi-stream speech recogniser is based on the combination of multiple feature streams each containing complementary information. In the past, multi-stream research has typically focused on systems that use a single feature extraction method. This heritage from conventional speech recognisers is an unnecessary restriction and both psychoacoustic and phonetic knowledge strongly motivate the use of heterogeneous features. In this paper we investigate how heterogeneous processing can be used in two different multi-stream configurations: first, a system where each stream handles a different frequency region of the speech (a multi-band recogniser) and, second a multi-stream recogniser where each stream handles the full frequency region. For each type of system we compare the performance using both homogeneous and heterogeneous processing. We demonstrate that the use of heterogeneous information significantly improves the clean speech recognition performance motivating us to continue exploring more specifically designed stream processing
  • Keywords
    feature extraction; speech recognition; clean speech recognition; heterogeneous features; heterogeneous information; multi-band recogniser; multi-stream framework; multi-stream speech recogniser; multiple feature stream; performance; Automatic speech recognition; Decoding; Feature extraction; Frequency; Hidden Markov models; Process design; Psychology; Signal processing; Speech processing; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2000. ICASSP '00. Proceedings. 2000 IEEE International Conference on
  • Conference_Location
    Istanbul
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-6293-4
  • Type

    conf

  • DOI
    10.1109/ICASSP.2000.861977
  • Filename
    861977