• DocumentCode
    3330205
  • Title

    Modeling sub-band correlation for noise-robust speech recognition

  • Author

    McAuley, James ; Ming, Ji ; Hanna, Philip ; Stewart, Darryl

  • Author_Institution
    Sch. of Comput. Sci., Queen´´s Univ., Belfast, UK
  • Volume
    1
  • fYear
    2004
  • fDate
    17-21 May 2004
  • Abstract
    The paper investigates the effect of modeling sub-band correlation for noisy speech recognition. Sub-band data streams are assumed to be independent in many sub-band based speech recognition systems. However, the structure and operation of the human vocal tract suggests this assumption is unrealistic. A novel method is proposed to incorporate correlation into sub-band speech feature streams. All possible combinations of sub-bands are created and each combination is treated as a single frequency band by calculating a single feature vector for it. The resulting feature vectors capture information about every band in the combination as well as the dependency across the bands. Experiments conducted on the TIDigits database demonstrate significantly improved robustness in comparison to an independent sub-band system in the presence of both stationary and non-stationary noise.
  • Keywords
    acoustic noise; cepstral analysis; channel bank filters; correlation methods; discrete cosine transforms; random noise; speech recognition; DCT; TIDigits database; discrete cosine transform; feature vector; filter-bank cepstral analysis; human vocal tract; noise-robust speech recognition; nonstationary noise; speech feature streams; stationary noise; sub-band correlation; Cepstral analysis; Computer science; Discrete cosine transforms; Feature extraction; Frequency; Human voice; Noise robustness; Spatial databases; Speaker recognition; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-8484-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.2004.1326161
  • Filename
    1326161