• DocumentCode
    1860090
  • Title

    A multimodal approach to music transcription

  • Author

    Paleari, Marco ; Huet, Benoit ; Schutz, Antony ; Slock, Dirk

  • Author_Institution
    Multimedia Dept., Eurecom Inst., Sophia Antipolis
  • fYear
    2008
  • fDate
    12-15 Oct. 2008
  • Firstpage
    93
  • Lastpage
    96
  • Abstract
    Music transcription refers to extraction of a human readable and interpretable description from a recording of a music performance. Automatic music transcription remains, nowadays, a challenging research problem when dealing with polyphonic sounds or when removing certain constraints. Some instruments like guitars and violins add ambiguity to the problem as the same note can be played at different positions. When dealing with guitar music tablature are, often, preferred to the usual music score, as they present information in a more accessible way. Here, we address this issue with a system which uses the visual modality to support traditional audio transcription techniques. The system is composed of four modules which have been implemented and evaluated: a system which tracks the position of the fretboard on a video stream, a system which automatically detects the position of the guitar on the first fret to initialize the first system, a system which detects the position of the hand on the guitar, and finally a system which fuses the visual and audio information to extract a tablature. Results show that this kind of multimodal approach can easily disambiguate 89% of notes in a deterministic way.
  • Keywords
    music; musical instruments; video signal processing; audio transcription; fretboard tracking; guitar music tablature; multimodal approach; music score; music transcription; polyphonic sounds; video cameras; visual modality; Data mining; Fingers; Fuses; Humans; Information analysis; Instruments; Multiple signal classification; Music; Streaming media; Timbre; Guitar; Multimodal; Music Transcription; Tablature;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Image Processing, 2008. ICIP 2008. 15th IEEE International Conference on
  • Conference_Location
    San Diego, CA
  • ISSN
    1522-4880
  • Print_ISBN
    978-1-4244-1765-0
  • Electronic_ISBN
    1522-4880
  • Type

    conf

  • DOI
    10.1109/ICIP.2008.4711699
  • Filename
    4711699