• DocumentCode
    1135508
  • Title

    Subband Correlation and Robust Speech Recognition

  • Author

    McAuley, James ; Ming, Ji ; Stewart, Darryl ; Hanna, Philip

  • Author_Institution
    Sch. of Comput. Sci., Queen´´s Univ. of Belfast, UK
  • Volume
    13
  • Issue
    5
  • fYear
    2005
  • Firstpage
    956
  • Lastpage
    964
  • Abstract
    This paper investigates the effect of modeling subband correlation for noisy speech recognition. Subband feature streams are assumed to be independent in many subband-based speech recognition systems. However, speech recognition experimental results suggest this assumption is unrealistic. In this paper, a method is proposed to incorporate correlation into subband speech feature streams. In the proposed method, all possible combinations of subbands are created and each combination is treated as a single frequency-band by calculating a single feature vector for it. The resulting feature vectors, therefore, capture information about every band in the combination, as well as the dependency across the bands. Although using the new features results in a higher computational complexity, our experimental results show that they effectively capture the correlation between the subbands while making minimal assumptions about the structure of the correlation. Experiments are conducted on the TIDigits database. The results demonstrate improved accuracy for clean speech recognition and improved robustness in the presence of both stationary and nonstationary band-selective noise, in comparison to a system assuming subband independence.
  • Keywords
    computational complexity; correlation methods; speech recognition; TIDigits database; computational complexity; noisy speech recognition; nonstationary band-selective noise; single feature vector; single frequency-band; subband correlation; Cepstral analysis; Computational complexity; Data mining; Degradation; Feature extraction; Frequency; Hidden Markov models; Noise robustness; Spatial databases; Speech recognition; Correlation; noise robustness; speech recognition; subband;
  • fLanguage
    English
  • Journal_Title
    Speech and Audio Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1063-6676
  • Type

    jour

  • DOI
    10.1109/TSA.2005.851952
  • Filename
    1495477