Automatic Transcription for Music with Two Timbres from Monaural Sound Source

Author

Wang, Yuh-Shyang ; Hu, Ting-Yao ; Jeng, Shyh-Kang

Author_Institution

Dept. of Electr. Eng., Nat. Taiwan Univ., Taipei, Taiwan

fYear

2010

fDate

13-15 Dec. 2010

Firstpage

314

Lastpage

317

Abstract

A new approach to automatic music transcription for music with two timbres from a monaural sound source is proposed in this paper. The system is mainly composed of two parts, a fundamental frequency detector and a timbre discriminator. In the fundamental frequency detector, the short time Fourier transform (STFT) and a peak detection algorithm are used. By combining the relative magnitude of the fundamental frequency and its harmonics, a characteristic timbre vector is computed. The timbre discrimination is done by the classification of the timbre vectors with the support vector machine (SVM). Particularly, by designing the training and classification procedure in SVM, the mixed-timbre signal can also be classified in this system. Using a small database of polyphonic music of two timbres as the testing input, a 73% hit rate is achieved by this system.

Keywords

Fourier transforms; audio signal processing; multimedia computing; music; pattern classification; support vector machines; automatic music transcription; frequency detector; mixed timbre signal; monaural sound source; multimedia content analysis; peak detection; polyphonic music database; short time Fourier transform; support vector machine; timbre discriminator; timbre vector; applications; multimedia content analysis;

fLanguage

English

Publisher

ieee

Conference_Titel

Multimedia (ISM), 2010 IEEE International Symposium on

Conference_Location

Taichung

Print_ISBN

978-1-4244-8672-4

Electronic_ISBN

978-0-7695-4217-1

Type

conf

DOI

10.1109/ISM.2010.54

Filename

5693859