• DocumentCode
    764946
  • Title

    A masking-threshold-adapted weighting filter for excitation search

  • Author

    Chang, Wen-Whei ; Wang, Chin-Tun

  • Author_Institution
    Dept. of Commun. Eng., Nat. Chiao Tung Univ., Hsinchu, Taiwan
  • Volume
    4
  • Issue
    2
  • fYear
    1996
  • fDate
    3/1/1996 12:00:00 AM
  • Firstpage
    124
  • Lastpage
    132
  • Abstract
    Most LPC-based audio coders improve the reproduction quality by using predictor coefficients to embody perceptual masking in noise spectral shaping. Since the predictor coefficients were originally derived to characterize sound production models, they cannot precisely describe the human ear´s nonlinear responses to frequency and loudness. We report on new approaches to exploiting the masking threshold in the design of a perceptual noise-weighting filter for excitation searches. To track the nonstationary evolution of a masking threshold, an autoregressive spectral analysis with finite order has been shown to be capable of providing sufficient accuracy. In seeking a faster response, an artificial neural network was also trained to extract autoregressive modeling parameters of the masking threshold from typical audio signals via mapping. Furthermore, we propose the concept of sinusoidal excitation representation to better track the intrinsic characteristics of prediction error signals. Simulation results indicate that the combined use of a multisinusoid excitation model and a masking-threshold-adapted weighting filter allows the implementation of an LPC-based audio coder that delivers near transparent quality at the rate of 96 kb/s
  • Keywords
    adaptive filters; adaptive signal processing; audio coding; autoregressive processes; filtering theory; hearing; linear predictive coding; multilayer perceptrons; search problems; signal representation; spectral analysis; 96 kbit/s; LPC based audio coders; artificial neural network; audio signals; autoregressive modeling parameters; autoregressive spectral analysis; excitation search; frequency; human ear; loudness; masking threshold adapted weighting filter; multisinusoid excitation model; noise spectral shaping; nonlinear responses; nonstationary evolution; perceptual masking; perceptual noise weighting filter; prediction error signals; predictor coefficients; reproduction quality; simulation results; sinusoidal excitation representation; sound production models; Acoustic noise; Artificial neural networks; Filters; Frequency; Humans; Masking threshold; Noise shaping; Predictive models; Production; Spectral analysis;
  • fLanguage
    English
  • Journal_Title
    Speech and Audio Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1063-6676
  • Type

    jour

  • DOI
    10.1109/89.486062
  • Filename
    486062