|
Edinburgh Research Archive >
Centre for Speech Technology Research >
CSTR publications >
Please use this identifier to cite or link to this item:
http://hdl.handle.net/1842/961
|
| Title: | Hidden Markov Model for Automatic Transcription of MIDI Signals |
| Authors: | Takeda, Haruto Saito, Naoki Otsuki, Tomoshi Nakai, Mitsuru Shimodaira, Hiroshi Sagayama, Shigeki |
| Issue Date: | Dec-2002 |
| Citation: | In Multimedia Signal Processing, 2002 IEEE Workshop on, 9-11 Dec. 2002 Page(s):428 - 431 |
| Publisher: | IEEE |
| Abstract: | This paper describes a Hidden Markov Model (HMM)-based method of automatic transcription of MIDI (Musical Instrument Digital Interface) signals of performed music. The problem is formulated as recognition of a given sequence of fluctuating note durations to find the most likely intended note sequence utilizing the modern continuous speech recognition technique. Combining a stochastic model of deviating note durations and a stochastic grammar representing possible sequences of notes, the maximum likelihood estimate of the note sequence is searched in terms of Viterbi algorithm. The same principle is successfully applied to a joint problem of bar line allocation, time measure recognition, and tempo estimation. Finally, durations of consecutive n notes are combined to form a "rhythm vector" representing tempo-free relative durations of the notes and treated in the same framework. Significant improvements compared with conventional "quantization" techniques are shown. |
| URI: | http://ieeexplore.ieee.org/servlet/opac?punumber=8561 http://hdl.handle.net/1842/961 |
| Appears in Collections: | CSTR publications
|
Items in ERA are protected by copyright, with all rights reserved, unless otherwise indicated.
|