Title : 
Probabilistic model of two-dimensional rhythm tree structure representation for automatic transcription of polyphonic MIDI signals
         
        
            Author : 
Tsuchiya, Masahiro ; Ochiai, K. ; Kameoka, Hirokazu ; Sagayama, Shigeki
         
        
            Author_Institution : 
Grad. Sch. of Inf. Sci. & Technol., Univ. of Tokyo, Hongo, Japan
         
        
        
            fDate : 
Oct. 29 2013-Nov. 1 2013
         
        
        
        
            Abstract : 
This paper proposes a Bayesian approach for automatic music transcription of polyphonic MIDI signals based on generative modeling of onset occurrences of musical notes. Automatic music transcription involves two subproblems that are interdependent of each other: rhythm recognition and tempo estimation. When we listen to music, we are able to recognize its rhythm and tempo (or beat location) fairly easily even though there is ambiguity in determining the individual note values and tempo. This may be made possible through our empirical knowledge about rhythm patterns and tempo variations that possibly occur in music. To automate the process of recognizing the rhythm and tempo of music, we propose modeling the generative process of a MIDI signal of polyphonic music by combining the sub-process by which a musically natural tempo curve is generated and the sub-process by which a set of note onset positions is generated based on a 2-dimensional rhythm tree structure representation of music, and develop a parameter inference algorithm for the proposed model. We show some of the transcription results obtained with the present method.
         
        
            Keywords : 
Bayes methods; music; pattern recognition; signal representation; tree data structures; 2-dimensional rhythm tree structure representation; Bayesian approach; automatic music transcription; beat location; generative onset occurrence modeling; generative process modeling; musical notes; musically natural tempo curve; note onset positions; parameter inference algorithm; polyphonic MIDI signals; rhythm patterns; rhythm recognition; tempo estimation; tempo variations; Estimation; Hidden Markov models; Multiple signal classification; Production; Rhythm; Speech recognition; Vocabulary;
         
        
        
        
            Conference_Titel : 
Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2013 Asia-Pacific
         
        
            Conference_Location : 
Kaohsiung
         
        
        
            DOI : 
10.1109/APSIPA.2013.6694308