Home

Caltech
Center for Neuromorphic Systems Engineering

Home
Research
News
People

[back]

Primitives for Human Motion: a Dynamical Approach
D. Del Vecchio, R.M. Murray, P. Perona

Abstract. Using tools from d dynamical systems theory and systems identification theory we develop the study of primitives for human motion which we refer to as movemes. We introduce basic definitions of dynamical independence of LTI systems and segmentability of signals and we develop classification and segmentation algorithms for two dimensional motions. We test our ideas on data sampled from four human subjects who were engaged in a simple real­life activity including two movemes. Our experiments show that we are able to distinguish between the two movemes and recognize them even when they take place in an activity containing more than one moveme.

Introduction and Motivation: Building systems that can detect and recognize hu­man actions and activities is an important goal of modern engineering. Applications range from human­machine interfaces, to security to entertainment. The first fundamental problem in achieving this goal is one of representation. Our point of view is that human activity should be decomposed into its building blocks which belong to an ``alphabet'' of elementary actions that the machine knows. We refer to these primitives of motion as movemes. This word first came up in the work by (Bregler and Malik, 1997). Their approach

Figure 1

D oes not include an input and therefore is only applicable to periodic or stereotypical motions, such as walking or running where the motion is always the same. (Goncalves et al., 1998) also proposed to divide human motion into elementary trajectories called movemes. They dealt with the problem in a phenomenological and non­causal way: each moveme was parameterized by goal and style parameters. We attempt here to define movemes in terms of causal dynamical systems; this way a moveme could be parameterized by a small set of dynamical parameters and by an input which drives the overall dynamics. Our aim is to build an ``alphabet of movemes'' which one can compose to represent and describe human motion similar to the way phonemes are used in speech. Two more problems we address are the ones of segmentation and classification: can a continuous trajectory of the human body be decomposed automatically into its component movemes? We validate our ideas by analyzing the mouse trajectories generated by computer users as they ``pointand­click'' (we call this the reach moveme) and trace straight lines (we call this the draw moveme).


APPROACH: represent movemes as causal dynamical systems








Figure 2

Dynamically independent sets of dynamical systems are roughly defined in Fig.2 : they are systems parameterized by parameters which lie in separated subsets of the parameter space (see figure). Then the elements MR and MD of the set {MR,MD} of mutually dynamically independent systems are said to be movemes. The trajectories in space that we can measure are then represented by a sequence in time of moveme's outputs y(t), see Fig.3.

We say in this case that the signal y(t) is segmentable since it can be decomposed into two moveme outputs S1(t) and S2(t)









Figure 3

Do movemes exist in practice?
In other words can we observe human actions whose underling dynamics are characterized by parameters that in parameter space separate as in Fig.2? It is clear that if this were the case there would exist human activities which can be classified and recognized on the basis of their dynamics. On the basis of the computer experiments we carried out the answer seems to be yes, movemes exist. In particular we found two actions which can be defined as movemes: reach moveme and draw moveme. The sequences of experiments are represented in Fig.4. In the draw sequence the user had to redraw a randomly appearing line, while in the reach sequence the user had to reach three randomly appearing points


Figure 4

Figure 5

The resulting parameters plotted in parameter space are represented in Fig.5. From such figures we find the two sets CR and CD in parameter space that we defined in Fig.2. Such sets are linearly separable with small errors (the training error is about 4%). At this point we can claim that the two sets CR and CD parameterize two dynamical model sets that we previously called MR and MD which according to the definition are dynamically independent and therefore they form a set {MR,MD} of movemes. We say also that the set {MR,MD} constitute an alphabet of movemes. Thanks to the mutual positions of the sets CR and CD we can solve a classification problem, that is: given a signal y(t) (that is a reach or a draw action) determine, on the basis of our alphabet, what is the moveme y(t) is more likely to come from. The results of the classification problem are reported in the second column of the table in Fig.5, and they are satisfactory. Segmentation problem. Once the classification problem has been solved, we want to solve the problem of recognizing a reach or a draw action even when it takes place in a composed activity:

Effective segmentation point and segmentation point which results from the segmentation algorithm


top