• DocumentCode
    2204681
  • Title

    Automatic Transcription for Music with Two Timbres from Monaural Sound Source

  • Author

    Wang, Yuh-Shyang ; Hu, Ting-Yao ; Jeng, Shyh-Kang

  • Author_Institution
    Dept. of Electr. Eng., Nat. Taiwan Univ., Taipei, Taiwan
  • fYear
    2010
  • fDate
    13-15 Dec. 2010
  • Firstpage
    314
  • Lastpage
    317
  • Abstract
    A new approach to automatic music transcription for music with two timbres from a monaural sound source is proposed in this paper. The system is mainly composed of two parts, a fundamental frequency detector and a timbre discriminator. In the fundamental frequency detector, the short time Fourier transform (STFT) and a peak detection algorithm are used. By combining the relative magnitude of the fundamental frequency and its harmonics, a characteristic timbre vector is computed. The timbre discrimination is done by the classification of the timbre vectors with the support vector machine (SVM). Particularly, by designing the training and classification procedure in SVM, the mixed-timbre signal can also be classified in this system. Using a small database of polyphonic music of two timbres as the testing input, a 73% hit rate is achieved by this system.
  • Keywords
    Fourier transforms; audio signal processing; multimedia computing; music; pattern classification; support vector machines; automatic music transcription; frequency detector; mixed timbre signal; monaural sound source; multimedia content analysis; peak detection; polyphonic music database; short time Fourier transform; support vector machine; timbre discriminator; timbre vector; applications; multimedia content analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multimedia (ISM), 2010 IEEE International Symposium on
  • Conference_Location
    Taichung
  • Print_ISBN
    978-1-4244-8672-4
  • Electronic_ISBN
    978-0-7695-4217-1
  • Type

    conf

  • DOI
    10.1109/ISM.2010.54
  • Filename
    5693859