• DocumentCode
    1322010
  • Title

    Audio Inpainting

  • Author

    Adler, Amir ; Emiya, Valentin ; Jafari, Maria G. ; Elad, Michael ; Gribonval, Rémi ; Plumbley, Mark D.

  • Author_Institution
    Comput. Sci. Dept., Technion - Israel Inst. of Technol., Haifa, Israel
  • Volume
    20
  • Issue
    3
  • fYear
    2012
  • fDate
    3/1/2012 12:00:00 AM
  • Firstpage
    922
  • Lastpage
    932
  • Abstract
    We propose the audio inpainting framework that recovers portions of audio data distorted due to impairments such as impulsive noise, clipping, and packet loss. In this framework, the distorted data are treated as missing and their location is assumed to be known. The signal is decomposed into overlapping time-domain frames and the restoration problem is then formulated as an inverse problem per audio frame. Sparse representation modeling is employed per frame, and each inverse problem is solved using the Orthogonal Matching Pursuit algorithm together with a discrete cosine or a Gabor dictionary. The Signal-to-Noise Ratio performance of this algorithm is shown to be comparable or better than state-of-the-art methods when blocks of samples of variable durations are missing. We also demonstrate that the size of the block of missing samples, rather than the overall number of missing samples, is a crucial parameter for high quality signal restoration. We further introduce a constrained Matching Pursuit approach for the special case of audio declipping that exploits the sign pattern of clipped audio samples and their maximal absolute value, as well as allowing the user to specify the maximum amplitude of the signal. This approach is shown to outperform state-of-the-art and commercially available methods for audio declipping in terms of Signal-to-Noise Ratio.
  • Keywords
    audio signal processing; dictionaries; impulse noise; iterative methods; signal restoration; Gabor dictionary; audio data; audio declipping; audio inpainting; constrained matching pursuit; discrete cosine; impulsive noise; orthogonal matching pursuit algorithm; packet loss; restoration problem; sign pattern; signal-to-noise ratio; sparse representation modeling; time-domain frame; Distortion; Image restoration; Matching pursuit algorithms; Reliability; Speech; Time domain analysis; Time frequency analysis; Clipping; inpainting; matching pursuit (MP); sparse representation (SR);
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TASL.2011.2168211
  • Filename
    6020748