|
Edinburgh Research Archive >
Centre for Speech Technology Research >
CSTR publications >
Please use this identifier to cite or link to this item:
http://hdl.handle.net/1842/2006
|
| Title: | Automatic Head Motion Prediction from Speech Data |
| Authors: | Hofer, Gregor Shimodaira, Hiroshi |
| Issue Date: | 2007 |
| Citation: | Gregor Hofer and Hiroshi Shimodaira. Automatic head motion prediction from speech data. In Proc. Interspeech 2007, Antwerp, Belgium, 2007. |
| Abstract: | In this paper we present a novel approach to generate a
sequence of head motion units given some speech. The
modelling approach is based on the notion that head motion
can be divided into a number of short homogeneous units that
can be modelled individually. The system is based on Hidden
Markov Models (HMM), which are trained on motion units
and act as a sequence generator. They can be evaluated by
an accuracy measure. A database of motion capture data was
collected and manually annotated for head motion and is used
to train the models. It was found that the model is good at
distinguishing high activity regions from regions with less
activity with accuracies around 75 percent. Furthermore the
model is able to distinguish different head motion patterns
based on speech features somewhat reliably, with accuracies
reaching almost 70 percent. |
| Keywords: | speech technology |
| URI: | http://hdl.handle.net/1842/2006 |
| Appears in Collections: | CSTR publications
|
Items in ERA are protected by copyright, with all rights reserved, unless otherwise indicated.
|