nips nips2000 nips2000-49 nips2000-49-reference knowledge-graph by maker-knowledge-mining
Source: pdf
Author: Peter Dayan, Sham Kakade
Abstract: Explaining away has mostly been considered in terms of inference of states in belief networks. We show how it can also arise in a Bayesian context in inference about the weights governing relationships such as those between stimuli and reinforcers in conditioning experiments such as bacA, 'Ward blocking. We show how explaining away in weight space can be accounted for using an extension of a Kalman filter model; provide a new approximate way of looking at the Kalman gain matrix as a whitener for the correlation matrix of the observation process; suggest a network implementation of this whitener using an architecture due to Goodall; and show that the resulting model exhibits backward blocking.
Atick, JJ & Redlich, AN (1993) Convergent algorithm for sensory receptive field development. Neural Computation 5:45-60. Goodall, MC (1960) Performance of stochastic net. Nature 185:557-558. Jordan, MI, editor (1998) Learning in Graphical Models. Dordrecht: Kluwer. Kakade, S & Dayan, P (2000) Acquisition in autoshaping. In SA Solla, TK Leen & K-R Muller, editors, Advances in Neural Information Processing Systems, 12. Miller, RR & Matute, H (1996) . Biological significance in forward and backward blocking: Resolution of a discrepancy between animal conditioning and human causal judgment. Journal of Experimental Psychology: General 125:370-386. Rescorla, RA & Wagner, AR (1972) A theory of Pavlovian conditioning: The effectiveness of reinforcement and non-reinforcement. In AH Black & WF Prokasy, editors , Classical Conditioning II: Current Research and Theory. New York:Aleton-Century-Crofts, 64-69. Shanks, DR (1985). Forward and backward blocking in human contingency judgement. Quarterly Journal of Experimental Psychology: Comparative & Physiological P5ychology 37:1-21. Sutton, RS (1992). Gain adaptation beats least squares? In Proceedings of the 7th Yale Workshop on Adaptive and Learning Systems. Wagner, AR & Brandon, SE (1989). Evolution of a structured connectionist model of Pavlovian conditioning (AESOP). In SB Klein & RR Mowrer, editors, Contemporary Learning Theories. Hillsdale, NJ: Erlbaum, 149-189. Widrow, B & Stearns, SD (1985) Adaptive Signal Processing. Englewood Cliffs, NJ:Prentice-Hall.