• DocumentCode
    2177516
  • Title

    Linear Predictive perceptual filtering for Acoustic Vector Sensors: Exploiting directional recordings for high quality speech enhancement

  • Author

    Shujau, M. ; Ritz, C.H. ; Burnett, I.S.

  • fYear
    2011
  • fDate
    22-27 May 2011
  • Firstpage
    5068
  • Lastpage
    5071
  • Abstract
    This paper investigates the performance of a new technique for speech enhancement which combines Linear Predictive (LP) spectrum-based perceptual filtering to the recordings obtained from an Acoustic Vector Sensor (AVS). The technique takes advantage of the directional polar responses of the AVS to obtain a significantly more accurate representation of the LP spectrum of a target speech signal in the presence of noise when compared to single channel, omni-directional recordings. Comparisons between the speech quality obtained from the proposed technique and existing beamforming-based speech enhancement techniques for the AVS are made through Perceptual Evaluation of Speech Quality (PESQ) tests and Mean Opinion Score (MOS) listening tests. Results show significant improvements in PESQ and MOS scores of 0.2 and 1.6, respectively, for the proposed enhancement technique. Being based on a miniature microphone array, the approach is particular suitable for hands free communication applications in mobile telephony.
  • Keywords
    array signal processing; filtering theory; microphone arrays; mobile radio; radiotelephony; sensor arrays; speech enhancement; AVS; LP spectrum-based perceptual filtering; MOS listening test; PESQ test; acoustic vector sensor; beamforming-based speech enhancement technique; directional polar response; directional recording; linear predictive spectrum-based perceptual filtering; mean opinion score listening test; miniature microphone array; mobile telephony; omnidirectional recording; perceptual evaluation of speech quality test; speech quality; target speech signal; Arrays; Microphones; Sensors; Signal to noise ratio; Speech; Speech enhancement; Acoustic Vector Sensors; Linear Prediction; Speech Coding; Speech Enhancement;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
  • Conference_Location
    Prague
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4577-0538-0
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2011.5947496
  • Filename
    5947496