nips nips2008 nips2008-247 nips2008-247-reference knowledge-graph by maker-knowledge-mining
Source: pdf
Author: Silvia Chiappa, Jens Kober, Jan R. Peters
Abstract: Motor primitives or motion templates have become an important concept for both modeling human motor control as well as generating robot behaviors using imitation learning. Recent impressive results range from humanoid robot movement generation to timing models of human motions. The automatic generation of skill libraries containing multiple motion templates is an important step in robot learning. Such a skill learning system needs to cluster similar movements together and represent each resulting motion template as a generative model which is subsequently used for the execution of the behavior by a robot system. In this paper, we show how human trajectories captured as multi-dimensional time-series can be clustered using Bayesian mixtures of linear Gaussian state-space models based on the similarity of their dynamics. The appropriate number of templates is automatically determined by enforcing a parsimonious parametrization. As the resulting model is intractable, we introduce a novel approximation method based on variational Bayes, which is especially designed to enable the use of efficient inference algorithms. On recorded human Balero movements, this method is not only capable of finding reasonable motion templates but also yields a generative model which works well in the execution of this complex task on a simulated anthropomorphic SARCOS arm.
[1] T. Flash and B. Hochner. Motor primitives in vertebrates and invertebrates. Current Opinion in Neurobiology, 15(6):660–666, 2005.
[2] B. Williams, M. Toussaint, and A. Storkey. Modelling motion primitives and their timing in biologically executed movements. In Advances in Neural Information Processing Systems 20, pages 1609–1616, 2008.
[3] A. Ijspeert, J. Nakanishi, and S. Schaal. Learning attractor landscapes for learning motor primitives. In Advances in Neural Information Processing Systems 15, pages 1547–1554, 2003.
[4] S. Calinon, F. Guenter, and A. Billard. On learning, representing and generalizing a task in a humanoid robot. IEEE Transactions on Systems, Man and Cybernetics, Part B, 37(2):286–298, 2007.
[5] J. Durbin and S. J. Koopman. Time Series Analysis by State Space Methods. Oxford Univ. Press, 2001.
[6] Y. Xiong and D-Y. Yeung. Mixtures of ARMA models for model-based time series clustering. In Proceedings of the IEEE International Conference on Data Mining, pages 717–720, 2002.
[7] C. Li and G. Biswas. A Bayesian approach to temporal data clustering using hidden Markov models. In Proceedings of the International Conference on Machine Learning, pages 543–550, 2000.
[8] J. Kober, B. Mohler and J. Peters. Learning perceptual coupling for motor primitives. International Conference on Intelligent Robots and Systems, pages 834–839, 2008.
[9] S. Fogel, J. Jacob, and C. Smith. Increased sleep spindle activity following simple motor procedural learning in humans. Actas de Fisiologia, 7(123), 2001.
[10] D. J. C. MacKay. Information Theory, Inference and Learning Algorithms. Cambridge Univ. Press, 2003.
[11] D. Wipf and J. Palmer and B. Rao. Perspectives on Sparse Bayesian Learning. In Advances in Neural Information Processing Systems 16, 2004.
[12] S. Chiappa and D. Barber. Dirichlet Mixtures of Bayesian Linear Gaussian State-Space Models: a Variational Approach. Technical Report no. 161, MPI for Biological Cybernetics, Tübingen, Germany, 2007.
[13] K. Kurihara, M. Welling, and Y. W. Teh. Collapsed variational Dirichlet process mixture models. In Proceedings of the International Joint Conference on Artificial Intelligence, pages 2796–2801, 2007.
[14] D. Barber and S. Chiappa. Unified inference for variational Bayesian linear Gaussian statespace models. In Advances in Neural Information Processing Systems 19, pages 81–88, 2007.
[15] H. Miyamoto and S. Schaal and F. Gandolfo and Y. Koike and R. Osu and E. Nakano and Y. Wada and M. Kawato. A Kendama learning robot based on bi-directional theory. Neural Networks, 9(8): 1281–1302, 1996 8