Monophonic sound source separation by non-negative sparse autoencoders

Author

Zen, Kartinah ; Suzuki, M. ; Sato, Hikaru ; Oyama, Shinya ; Kurihara, Masazumi

Author_Institution

Grad. Sch. of Inf. Sci. & Technol., Hokkaido Univ., Sapporo, Japan

fYear

2014

fDate

5-8 Oct. 2014

Firstpage

3623

Lastpage

3626

Abstract

Monophonic sound source separation is an essential subject on the fields where sound, such as voice, music and noise, is dealt with. In particular, unsupervised approaches to this problem have high versatility in comparison with supervised approaches. Non-negative matrix factorization is the most frequently used algorithm for the monophonic sound source separation without prior knowledge. This is also applied to various applications, including data clustering, face recognition, gene expression classification. However, non-negative matrix factorization cannot be efficiently used in online learning. In order to solve this difficulty, the non-negative sparse autoencoder was proposed in the literature. Although several successful applications have been reported, this is not yet applied to the monophonic sound source separation. This paper shows that the non-negative sparse autoencoder can perform the monophonic sound source separation without prior knowledge in online learning.

Keywords

audio signal processing; matrix decomposition; source separation; unsupervised learning; monophonic sound source separation; nonnegative matrix factorization; nonnegative sparse autoencoders; online learning; unsupervised learning; Artificial neural networks; IP networks; Image reconstruction; Neurons; Source separation; Sparse matrices; Vectors; NMF; monophonic sound source separation; non-negative sparse autoencoder; online learning; unsupervised learning;

fLanguage

English

Publisher

ieee

Conference_Titel

Systems, Man and Cybernetics (SMC), 2014 IEEE International Conference on

Conference_Location

San Diego, CA

Type

conf

DOI

10.1109/SMC.2014.6974492

Filename

6974492