• DocumentCode
    134258
  • Title

    Spectral patch based sparse coding for acoustic event detection

  • Author

    Xugang Lu ; Yu Tsao ; Peng Shen ; Hori, Chiori

  • Author_Institution
    Nat. Inst. of Inf. & Commun. Technol., Koganei, Japan
  • fYear
    2014
  • fDate
    12-14 Sept. 2014
  • Firstpage
    317
  • Lastpage
    320
  • Abstract
    In most algorithms for acoustic event detection (AED), frame based acoustic representations are used in acoustic modeling. Due to lack of context information in feature representation, large model confusions may occur during modeling. We have proposed a feature learning and representation algorithm to explore context information from temporal-frequency patches of signal for AED. With the algorithm, a sparse feature was extracted based on an acoustic dictionary composed of a bag of spectral patches. In our previous algorithm, the feature was obtained based on a definition of Euclidian distance between input signal and acoustic dictionary. In this study, we formulate the sparse feature extraction as l1 regularization in signal reconstruction. The sparsity of the representation is efficiently controlled via varying a regularization parameter. A support vector machine (SVM) classifier was built on the extracted sparse feature for AED. Our experimental results showed that the spectral patch based sparse representation effectively improved the performance by incorporating temporal-frequency context information in modeling.
  • Keywords
    acoustic signal detection; signal reconstruction; speech processing; support vector machines; Euclidian distance; SVM classifier; acoustic dictionary; acoustic event detection; feature learning; feature representation; frame based acoustic representation; regularization parameter; signal reconstruction; sparse coding; sparse feature extraction; spectral patch; support vector machine; temporal-frequency context information; temporal-frequency patches; Acoustics; Dictionaries; Encoding; Event detection; Feature extraction; Hidden Markov models; Support vector machines; Acoustic event detection; sparse coding; support vector machine;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Chinese Spoken Language Processing (ISCSLP), 2014 9th International Symposium on
  • Conference_Location
    Singapore
  • Type

    conf

  • DOI
    10.1109/ISCSLP.2014.6936651
  • Filename
    6936651