nips nips2005 nips2005-120 nips2005-120-reference knowledge-graph by maker-knowledge-mining

120 nips-2005-Learning vehicular dynamics, with application to modeling helicopters

Source: pdf

Author: Pieter Abbeel, Varun Ganapathi, Andrew Y. Ng

Abstract: We consider the problem of modeling a helicopter’s dynamics based on state-action trajectories collected from it. The contribution of this paper is two-fold. First, we consider the linear models such as learned by CIFER (the industry standard in helicopter identiﬁcation), and show that the linear parameterization makes certain properties of dynamical systems, such as inertia, fundamentally difﬁcult to capture. We propose an alternative, acceleration based, parameterization that does not suffer from this deﬁciency, and that can be learned as efﬁciently from data. Second, a Markov decision process model of a helicopter’s dynamics would explicitly model only the one-step transitions, but we are often interested in a model’s predictive performance over longer timescales. In this paper, we present an efﬁcient algorithm for (approximately) minimizing the prediction error over long time scales. We present empirical results on two different helicopters. Although this work was motivated by the problem of modeling helicopters, the ideas presented here are general, and can be applied to modeling large classes of vehicular dynamics. 1

reference text

[1] P. Abbeel and A. Y. Ng. Learning ﬁrst order Markov models for control. In NIPS 18, 2005.

[2] J. Bagnell and J. Schneider. Autonomous helicopter control using reinforcement learning policy search methods. In International Conference on Robotics and Automation. IEEE, 2001.

[3] V. Gavrilets, I. Martinos, B. Mettler, and E. Feron. Control logic for automated aerobatic ﬂight of miniature helicopter. In AIAA Guidance, Navigation and Control Conference, 2002.

[4] V. Gavrilets, I. Martinos, B. Mettler, and E. Feron. Flight test and simulation results for an autonomous aerobatic helicopter. In AIAA/IEEE Digital Avionics Systems Conference, 2002.

[5] J. Leishman. Principles of Helicopter Aerodynamics. Cambridge University Press, 2000.

[6] B. Mettler, M. Tischler, and T. Kanade. System identiﬁcation of small-size unmanned helicopter dynamics. In American Helicopter Society, 55th Forum, 1999.

[7] Andrew Y. Ng, Adam Coates, Mark Diel, Varun Ganapathi, Jamie Schulte, Ben Tse, Eric Berger, and Eric Liang. Autonomous inverted helicopter ﬂight via reinforcement learning. In International Symposium on Experimental Robotics, 2004.

[8] Andrew Y. Ng, H. Jin Kim, Michael Jordan, and Shankar Sastry. Autnonomous helicopter ﬂight via reinforcement learning. In NIPS 16, 2004.

[9] Jonathan M. Roberts, Peter I. Corke, and Gregg Buskey. Low-cost ﬂight control system for a small autonomous helicopter. In IEEE Int’l Conf. on Robotics and Automation, 2003.

[10] J. Seddon. Basic Helicopter Aerodynamics. AIAA Education Series. America Institute of Aeronautics and Astronautics, 1990.

[11] M.B. Tischler and M.G. Cauffman. Frequency response method for rotorcraft system identiﬁcation: Flight application to BO-105 couple rotor/fuselage dynamics. Journal of the American Helicopter Society, 1992.