nips nips2005 nips2005-32 nips2005-32-reference knowledge-graph by maker-knowledge-mining
Source: pdf
Author: Alan L. Yuille
Abstract: We show that linear generalizations of Rescorla-Wagner can perform Maximum Likelihood estimation of the parameters of all generative models for causal reasoning. Our approach involves augmenting variables to deal with conjunctions of causes, similar to the agumented model of Rescorla. Our results involve genericity assumptions on the distributions of causes. If these assumptions are violated, for example for the Cheng causal power theory, then we show that a linear Rescorla-Wagner can estimate the parameters of the model up to a nonlinear transformtion. Moreover, a nonlinear Rescorla-Wagner is able to estimate the parameters directly to within arbitrary accuracy. Previous results can be used to determine convergence and to estimate convergence rates. 1
[1]. R.A. Rescorla and A.R. Wagner. “A Theory of Pavlovian Conditioning”. In A.H. Black andW.F. Prokasy, eds. Classical Conditioning II: Current Research and Theory. New York. Appleton-Century-Crofts, pp 64-99. 1972.
[2] R.A. Rescorla. Journal of Comparative and Physiological Psychology. 79, 307. 1972.
[3]. B. A. Spellman. “Conditioning Causality”. In D.R. Shanks, K.J. Holyoak, and D.L. Medin, (eds). Causal Learning: The Psychology of Learning and Motivation, Vol. 34. San Diego, California. Academic Press. pp 167-206. 1996.
[4]. P. Cheng. “From Covariance to Causation: A Causal Power Theory”. Psychological Review, 104, pp 367-405. 1997.
[5]. M. Buehner and P. Cheng. “Causal Induction: The power PC theory versus the Rescorla-Wagner theory”. In Proceedings of the 19th Annual Conference of the Cognitive Science Society”. 1997.
[6]. J.B. Tenenbaum and T.L. Griffiths. “Structure Learning in Human Causal Induction”. Advances in Neural Information Processing Systems 12. MIT Press. 2001.
[7]. D. Danks, T.L. Griffiths, J.B. Tenenbaum. “Dynamical Causal Learning”. Advances in Neural Information Processing Systems 14. 2003.
[8] A.C. Courville, N.D. Dew, and D.S. Touretsky. “Similarity and discrimination in classical conditioning”. NIPS. 2004.
[9]. D. Danks. “Equilibria of the Rescorla-Wagner Model”. Journal of Mathematical Psychology. Vol. 47, pp 109-121. 2003.
[10] A.L. Yuille. “The Rescorla-Wagner algorithm and Maximum Likelihood estimation of causal parameters”. NIPS. 2004.
[11]. P. Dayan and S. Kakade. “Explaining away in weight space”. In Advances in Neural Information Processing Systems 13. 2001.
[12] B. Widrow and M.E. Hoff. “Adapting Switching Circuits”. 1960 IRE WESCON Conv. Record., Part 4, pp 96-104. 1960.
[13] A.G. Barto and R.S. Sutton. “Time-derivative Models of Pavlovian Conditioning”. In Learning and Computational Neuroscience: Foundations of Adaptive Networks. M. Gabriel and J. Moore (eds.). pp 497-537. MIT Press. Cambridge, MA. 1990.