• DocumentCode
    2555669
  • Title

    An Efficient Voice Transcription Scheme for Music Retrieval

  • Author

    Byeong-jun Han ; Rho, Seungmin ; Hwang, Eenjun

  • Author_Institution
    Korea Univ., Seoul
  • fYear
    2007
  • fDate
    26-28 April 2007
  • Firstpage
    366
  • Lastpage
    371
  • Abstract
    In this paper, we propose a new scheme for transcribing sung or hummed queries into a sequence of pitch and duration pairs automatically for efficient music retrieval. More specifically, we present two novel methods called WAE (windowed average energy) and dynamic threshold method for ADF onsets for note segmentation and onset/offset detection in acoustic signal, respectively. The former improves previous energy-based approaches such as AE by defining small but coherent windows with local and global threshold values. The latter also improves the traditional global/local threshold method. By performing various experiments on our prototype music retrieval system, we show the effectiveness of our proposed scheme.
  • Keywords
    acoustic signal processing; information retrieval; music; acoustic signal; dynamic threshold method; efficient voice transcription scheme; hummed queries; music retrieval; note segmentation; onset/offset detection; windowed average energy; Acoustic signal detection; Educational institutions; Frequency; Information technology; Music information retrieval; Prototypes; Query processing; Signal detection; Speech; Time domain analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multimedia and Ubiquitous Engineering, 2007. MUE '07. International Conference on
  • Conference_Location
    Seoul
  • Print_ISBN
    0-7695-2777-9
  • Type

    conf

  • DOI
    10.1109/MUE.2007.72
  • Filename
    4197301