nips nips2010 nips2010-268 nips2010-268-reference knowledge-graph by maker-knowledge-mining

268 nips-2010-The Neural Costs of Optimal Control


Source: pdf

Author: Samuel Gershman, Robert Wilson

Abstract: Optimal control entails combining probabilities and utilities. However, for most practical problems, probability densities can be represented only approximately. Choosing an approximation requires balancing the benefits of an accurate approximation against the costs of computing it. We propose a variational framework for achieving this balance and apply it to the problem of how a neural population code should optimally represent a distribution under resource constraints. The essence of our analysis is the conjecture that population codes are organized to maximize a lower bound on the log expected utility. This theory can account for a plethora of experimental data, including the reward-modulation of sensory receptive fields, GABAergic effects on saccadic movements, and risk aversion in decisions under uncertainty. 1


reference text

[1] C.H. Anderson and D.C. Van Essen. Neurobiological computational systems. Computational intelligence imitating life, pages 213–222, 1994.

[2] J.S. Anderson, I. Lampl, D.C. Gillespie, and D. Ferster. The contribution of noise to contrast invariance of orientation tuning in cat visual cortex. Science, 290(5498):1968, 2000.

[3] R.L. De Valois, E. William Yund, and N. Hepler. The orientation and direction selectivity of cells in macaque visual cortex. Vision Research, 22(5):531–544, 1982.

[4] K. Friston. The free-energy principle: a unified brain theory? Nature Reviews Neuroscience, 11(2):127–138, 2010. 8

[5] T. Furmston and D. Barber. Variational methods for reinforcement learning. Proceedings of the Thirteenth Conference on Artificial Intelligence and Statistics (AISTATS), 2010.

[6] S.J. Gershman, E. Vul, and J.B. Tenenbaum. Perceptual multistability as Markov Chain Monte Carlo inference. In Y. Bengio, D. Schuurmans, J. Lafferty, C. K. I. Williams, and A. Culotta, editors, Advances in Neural Information Processing Systems 22, pages 611–619. 2009.

[7] P.E. Gold. Role of glucose in regulating the brain and cognition. American Journal of Clinical Nutrition, 61:987S–995S, 1995.

[8] E.T. Jaynes. On the rationale of maximum-entropy methods. Proceedings of the IEEE, 70(9):939–952, 1982.

[9] M.I. Jordan, Z. Ghahramani, T.S. Jaakkola, and L.K. Saul. An introduction to variational methods for graphical models. Machine learning, 37(2):183–233, 1999.

[10] W.B. Levy and R.A. Baxter. Energy efficient neural codes. Neural Computation, 8(3):531–543, 1996.

[11] W.J. Ma, J.M. Beck, P.E. Latham, and A. Pouget. Bayesian inference with probabilistic population codes. Nature Neuroscience, 9(11):1432–1438, 2006.

[12] C.K. Machens, T. Gollisch, O. Kolesnikova, and A.V.M. Herz. Testing the efficiency of sensory coding with optimal stimulus ensembles. Neuron, 47(3):447–456, 2005.

[13] RJ McCrimmon, IJ Deary, BJP Huntly, KJ MacLeod, and BM Frier. Visual information processing during controlled hypoglycaemia in humans. Brain, 119(4):1277, 1996.

[14] R.M. McPeek and E.L. Keller. Deficits in saccade target selection after inactivation of superior colliculus. Nature neuroscience, 7(7):757–763, 2004.

[15] P.R. Montague and B. King-Casas. Efficient statistics, common currencies and the problem of reward-harvesting. Trends in cognitive sciences, 11(12):514–519, 2007.

[16] R.P.N. Rao. Bayesian computation in recurrent neural circuits. Neural Computation, 16(1):1– 38, 2004.

[17] M. Sahani. A biologically plausible algorithm for reinforcement-shaped representational learning. Advances in Neural Information Processing, 16, 2004.

[18] L.J. Savage. The Foundations of Statistics. Dover, 1972.

[19] G. Sclar and RD Freeman. Orientation selectivity in the cat’s striate cortex is invariant with stimulus contrast. Experimental Brain Research, 46(3):457–461, 1982.

[20] J.T. Serences. Value-based modulations in human visual cortex. Neuron, 60(6):1169–1181, 2008.

[21] L. Shi, N.H. Feldman, and T.L. Griffiths. Performing Bayesian inference with exemplar models. In Proceedings of the 30th annual conference of the cognitive science society, pages 745–750, 2008.

[22] Lei Shi and Thomas Griffiths. Neural implementation of hierarchical bayesian inference by importance sampling. In Y. Bengio, D. Schuurmans, J. Lafferty, C. K. I. Williams, and A. Culotta, editors, Advances in Neural Information Processing Systems 22, pages 1669–1677. 2009.

[23] M.G. Shuler and M.F. Bear. Reward timing in the primary visual cortex. Science, 311(5767):1606, 2006.

[24] H.A. Simon. Models of Bounded Rationality. MIT Press, 1982.

[25] A. Tversky and D. Kahneman. Advances in prospect theory: cumulative representation of uncertainty. Journal of Risk and uncertainty, 5(4):297–323, 1992.

[26] E. Vul, N.D. Goodman, T.L. Griffiths, and J.B. Tenenbaum. One and done? Optimal decisions from very few samples. In Proceedings of the 31st Annual Meeting of the Cognitive Science Society, Amseterdam, the Netherlands, 2009.

[27] R.C. Wilson and L.H. Finkel. A neural implementation of the kalman filter. In Y. Bengio, D. Schuurmans, J. Lafferty, C. K. I. Williams, and A. Culotta, editors, Advances in Neural Information Processing Systems 22, pages 2062–2070. 2009.

[28] R.S. Zemel, P. Dayan, and A. Pouget. Probabilistic interpretation of population codes. Neural Computation, 10(2):403–430, 1998. 9