• DocumentCode
    69508
  • Title

    Musical Instrument Recognition in Polyphonic Audio Using Missing Feature Approach

  • Author

    Giannoulis, Dimitrios ; Klapuri, Anssi

  • Author_Institution
    Sch. of Electron. Eng. & Comput. Sci., Queen Mary Univ. of London, London, UK
  • Volume
    21
  • Issue
    9
  • fYear
    2013
  • fDate
    Sept. 2013
  • Firstpage
    1805
  • Lastpage
    1817
  • Abstract
    A method is described for musical instrument recognition in polyphonic audio signals where several sound sources are active at the same time. The proposed method is based on local spectral features and missing-feature techniques. A novel mask estimation algorithm is described that identifies spectral regions that contain reliable information for each sound source, and bounded marginalization is then used to treat the feature vector elements that are determined to be unreliable. The mask estimation technique is based on the assumption that the spectral envelopes of musical sounds tend to be slowly-varying as a function of log-frequency and unreliable spectral components can therefore be detected as positive deviations from an estimated smooth spectral envelope. A computationally efficient algorithm is proposed for marginalizing the mask in the classification process. In simulations, the proposed method clearly outperforms reference methods for mixture signals. The proposed mask estimation technique leads to a recognition accuracy that is approximately half-way between a trivial all-one mask (all features are assumed reliable) and an ideal “oracle” mask.
  • Keywords
    audio signal processing; feature extraction; musical instruments; bounded marginalization; feature vector elements; local spectral features; log-frequency function; mask estimation technique; missing feature approach; mixture signals; musical instrument recognition; musical sounds; polyphonic audio signals; positive deviations; sound source; spectral envelopes; spectral regions; unreliable spectral components; Estimation; Instruments; Interference; Music; Reliability; Speech; Vectors; Audio signal processing; bounded marginalization; harmonic sound; missing data techniques; musical instrument recognition;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TASL.2013.2248720
  • Filename
    6470658