Abstract: The proposal that cortical activity in the visual cortex is optimized for sparse neural activity is one of the most established ideas in computational neuroscience. However, direct experimental evidence for optimal sparse coding remains inconclusive, mostly due to the lack of reference values on which to judge the measured sparseness. Here we analyze neural responses to natural movies in the primary visual cortex of ferrets at different stages of development and of rats while awake and under different levels of anesthesia. In contrast with prediction from a sparse coding model, our data shows that population and lifetime sparseness decrease with visual experience, and increase from the awake to anesthetized state. These results suggest that the representation in the primary visual cortex is not actively optimized to maximize sparseness. 1

1 No evidence for active sparsification in the visual cortex o Pietro Berkes, Benjamin L. [sent-1, score-0.39]

2 White, and J´ zsef Fiser Volen Center for Complex Systems Brandeis University, Waltham, MA 02454 Abstract The proposal that cortical activity in the visual cortex is optimized for sparse neural activity is one of the most established ideas in computational neuroscience. [sent-2, score-0.784]

3 However, direct experimental evidence for optimal sparse coding remains inconclusive, mostly due to the lack of reference values on which to judge the measured sparseness. [sent-3, score-0.329]

4 Here we analyze neural responses to natural movies in the primary visual cortex of ferrets at different stages of development and of rats while awake and under different levels of anesthesia. [sent-4, score-1.044]

5 In contrast with prediction from a sparse coding model, our data shows that population and lifetime sparseness decrease with visual experience, and increase from the awake to anesthetized state. [sent-5, score-1.786]

6 These results suggest that the representation in the primary visual cortex is not actively optimized to maximize sparseness. [sent-6, score-0.429]

7 Computational models that optimize the sparseness of the responses of hidden units to natural images have been shown to reproduce the basic features of the receptive fields (RFs) of simple cells in V1 [3, 4, 5]. [sent-11, score-0.823]

8 Moreover, manipulation of the statistics of the environment of developing animals leads to changes in the RF structure that can be predicted by sparse coding models [6]. [sent-12, score-0.421]

9 Electrophysiological studies performed in primary visual cortex agree in reporting high sparseness values for neural activity [7, 8, 9, 10, 11, 12]. [sent-14, score-1.198]

10 However, it is contested whether the high degree of sparseness is due to a neural representation which is optimally sparse, or is an epiphenomenon due to neural selectivity [10, 12]. [sent-15, score-0.77]

11 This controversy is mostly due to a lack of reference measurement with which to judge the sparseness of the neural representation in relative, rather than absolute terms. [sent-16, score-0.63]

12 Another problem is that most of these studies have been performed on anesthetized animals [7, 9, 10, 11, 12], even though the effect of anesthesia might bias sparseness measurements (cf. [sent-17, score-1.007]

13 We compare this data 1 with theoretical predictions: 1) sparseness should increase with visual experience, and thus with age, as the visual system adapts to the statistics of the visual environment; 2) sparseness should be maximal in the “working regime” of the animal, i. [sent-21, score-1.75]

14 In both cases, the neural data shows a trend opposite to the one expected in a sparse coding system, suggesting that the visual system is not actively optimizing the sparseness of its representation. [sent-24, score-1.204]

15 The paper is organized as follows: We first introduce and discuss the lifetime and population sparseness measures we will be using throughout the paper. [sent-25, score-0.979]

16 Next, we present the classical, linear sparse coding model of natural images, and derive an equivalent, stochastic neural network, whose output firing rates correspond to Monte Carlo samples from the posterior distribution of visual elements given an image. [sent-26, score-0.638]

17 In the rest of the paper, we make use of this neural architecture in order to predict changes in sparseness over development and under anesthesia, and compare these predictions with electrophysiological recordings. [sent-27, score-0.773]

18 2 Lifetime and population sparseness The diverse benefits of sparseness mentioned in the introduction rely on different aspects of the neural code, which are captured to a different extent by two sparseness measures, referred to as lifetime and population sparseness. [sent-28, score-2.267]

19 Lifetime sparseness measures the distribution of the response of an individual cell to a set of stimuli, and is thus related to the cell’s selectivity. [sent-29, score-0.61]

20 These requirements of efficient coding are based upon the instantaneous population activity to stimuli and need to take into consideration the population sparseness of neural response. [sent-32, score-1.252]

21 Average lifetime and population sparseness are identical if the units are statistically independent, in which case the distribution is called ergodic [10, 14]. [sent-33, score-1.015]

22 Here we will use three measures of sparseness, two quantifying population sparseness, and one lifetime sparseness. [sent-36, score-0.412]

23 Moreover, in neural recordings we discard bins with no neural activity, as population TR is undefined in this case. [sent-43, score-0.336]

24 This seems to be adequate for our purposes, as the arguments for sparseness involve metabolic costs and coding arguments like redundancy reduction that are sensitive to overall firing rates. [sent-46, score-0.775]

25 Previous studies have shown that alternative measures of population and lifetime sparseness are highly correlated, therefore our choice does not affect the final results [15, 10]. [sent-47, score-1.007]

26 Here we set the sparse prior distribution to a Student-t distribution with α degrees of freedom, 1 p(xk ) = Z 1 1+ α xk λ 2 − α+1 2 , (5) with λ chosen such that the distribution has unit variance. [sent-58, score-0.277]

27 This is a common prior for sparse coding models [3], and its analytical form allows the development of efficient inference and learning algorithms [16, 17]. [sent-59, score-0.371]

28 3 Figure 2: Neural implementation of Gibbs sampling in a sparse coding model. [sent-77, score-0.329]

29 4 Sampling, sparse coding neural network In order to gain some intuition about the neural operations that may underlie inference in this model, we derive an equivalent neural network architecture. [sent-81, score-0.604]

30 It has been suggested that neural activity is best interpreted as samples from the posterior probability of an internal, probabilistic model of the sensory input. [sent-82, score-0.302]

31 This assumption is consistent with many experimental observations, including high trial-by-trial variability and spontaneous activity in awake animals [18, 19, 20]. [sent-83, score-0.446]

32 Expanding the exponent, eliminating the terms that do not depend on xk , and noting that Rkk = −1, since the generative weights have unit norm, we get   1 1 2 1 Gik yi )xk + 2 ( Rjk xj )xk − 2 xk + f (xk ) . [sent-86, score-0.38]

33 5 Active sparsification over learning A simple, intuitive prediction for a system that optimizes for sparseness is that the sparseness of its representation should increase over learning. [sent-94, score-1.199]

34 Since a sparse coding system, including our model, might not directly maximize our measures of sparseness, TR and AS, we verify this intuition by analyzing the model’s representation of images at various stages of learning. [sent-95, score-0.429]

35 (A) The lines indicate the average sparseness over units and samples. [sent-100, score-0.646]

36 Since the three measures have very different values, we report the change in sparseness in percent of the first iteration. [sent-102, score-0.61]

37 Colored text: absolute values of sparseness at the end of learning. [sent-103, score-0.567]

38 (B) The lines indicate the average sparseness for different animals. [sent-104, score-0.567]

39 As anticipated, both population and lifetime sparseness increase monotonically. [sent-111, score-0.965]

40 Having confirmed our intuition with the sparse coding model, we turn to data from electrophysiological recordings. [sent-112, score-0.401]

41 We analyzed multi-unit recordings from arrays of 16 electrodes implanted in the primary visual cortex of 15 ferrets at various stages of development, from eye opening at postnatal day 29 or 30 (P29-30) to adulthood at P151 (see Suppl Mat for experimental details). [sent-113, score-0.62]

42 Over this maturation period, the visual system of ferrets adapts to the statistics of the environment [22, 23]. [sent-114, score-0.382]

43 For each animal, neural activity was recorded and collected in 10 ms bins for 15 sessions of 100 seconds each (for a total of 25 minutes), during which the animal was shown scenes from a movie. [sent-115, score-0.323]

44 We find that all three measures of sparseness decrease significantly with age1 . [sent-116, score-0.664]

45 Thus, during a period when the cortex actively adapts to the visual environment, the representation in primary visual cortex becomes less sparse, suggesting that the optimization of sparseness is not a primary objective for learning in the visual system. [sent-117, score-1.593]

46 The decrease in population sparseness seems to be due to an increase in the dependencies between neurons: Fig. [sent-118, score-0.814]

47 3C shows the Kullback-Leibler divergence between the joint distribution P of neural activity in 2 ms bins and the same distribution, factorized to eliminate N ˜ neural dependencies, i. [sent-119, score-0.328]

48 6 Active sparsification and anesthesia The sparse coding neural network architecture of Fig. [sent-128, score-0.75]

49 2 makes explicit that an optimal sparse coding representation requires a process of active sparsification: In general, because of input noise and the overcompleteness of the representation, there are multiple possible combinations of visual elements that could account for a given image. [sent-129, score-0.621]

50 A) Percent change in sparseness as the recurrent connections are weakened for various values of α. [sent-159, score-0.758]

51 Colored text: absolute values of sparseness at the end of learning. [sent-161, score-0.567]

52 B) Average sparseness measures for V1 responses at various levels of anesthesia. [sent-162, score-0.737]

53 9, it is clear that the recurrent connections are necessary in order to keep the activity of the neurons on the solution line, while the stochastic activation function makes sparse neural responses more likely. [sent-167, score-0.712]

54 In a system that optimizes sparseness, disrupting the active sparsification process will lead to lower lifetime and population sparseness. [sent-169, score-0.523]

55 For example, if we reduce the strength of the recurrent connections in the neural network architecture (Eq. [sent-170, score-0.29]

56 9) by a factor α,   1 1 2 1 Gik yi )xk + 2 α( Rjk xj )xk − 2 xk + f (xk ) , (10) p(xk |xi=k , y) ∝ exp  2 ( σy i σy 2σy j=k the neurons become more decoupled, and try to separately account for the input, as illustrated in Fig. [sent-171, score-0.267]

57 The decoupling will result in a reduction of population sparseness, as multiple neurons become active to explain the same input. [sent-173, score-0.327]

58 Also, lifetime sparseness will decrease, as the lack of competition between units means that individual units will be active more often. [sent-174, score-1.072]

59 We analyzed the parameters of the sparse coding model at the end of learning, and substituted the Gibbs sampling posterior distribution of Eq. [sent-177, score-0.37]

60 As predicted, decreasing α leads to a decrease in all sparseness measures. [sent-180, score-0.621]

61 We argue that a similar disruption of the active sparsification process can be obtained in electrophysiological experiments by comparing neural responses at different levels of isoflurane anesthesia. [sent-181, score-0.344]

62 In general, the evoked, feed-forward responses of V1 neurons under anesthesia are thought to remain 6 Figure 6: Neuronal response to a 3. [sent-182, score-0.482]

63 largely intact: Despite a decrease in average firing rate, the selectivity of neurons to orientation, frequency, and direction of motion has been shown to be very similar in awake and anesthetized animals [24, 25, 26]. [sent-187, score-0.569]

64 On the other hand, anesthesia disrupts contextual effects like figure-ground modulation [26] and pattern motion [27], which are known to be mediated by top-down and recurrent connections. [sent-188, score-0.388]

65 Other studies have shown that, at low concentrations, isoflurane anesthesia leaves the visual input to the cortex mostly intact, while the intracortical recurrent and top-down signals are disrupted [28, 29]. [sent-189, score-0.724]

66 Thus, if the representation in the visual cortex is optimally sparse, disrupting the active sparsification by anesthesia should decrease sparseness. [sent-190, score-0.766]

67 We analyzed multi-unit neural activity from bundles of 16 electrodes implanted in primary visual cortex of 3 adult Long-Evans rats (5-11 units per recording session, for a total of 39 units). [sent-191, score-0.796]

68 Recordings were made in the awake state and under four levels on anesthesia, from very light to deep (corresponding to concentrations of isoflurane between 0. [sent-192, score-0.317]

69 In order to confirm that the effect of the anesthetic does not prevent visual information to reach the cortex, we presented the animals with a full-field periodic stimulus (flashing) at 3. [sent-195, score-0.269]

70 8 Hz, and defined the amplitude of the noise, due to spontaneous activity and neural variability, as the average amplitude between 1 and 3. [sent-200, score-0.329]

71 The amplitude of the evoked signal decreases with increasing isoflurane concentration, due to a decrease in overall firing rate; however, the background noise is also suppressed with anesthesia, so that overall the signal-to-noise ratio does not decrease significantly with anesthesia (Fig. [sent-202, score-0.47]

72 All three sparseness measures increase significantly with increasing concentration of isoflurane2 (Fig. [sent-207, score-0.639]

73 Contrary to what is expected in a sparse-coding system, the data suggests that the contribution of lateral and top-down connections in the awake state leads to a less sparse code. [sent-209, score-0.375]

74 7 Discussion We examined multi-electrode recordings from primary visual cortex of ferrets over development, and of rats at different levels of anesthesia. [sent-210, score-0.638]

75 We found that, contrary to predictions based on theoretical considerations regarding optimal sparse coding systems, sparseness decreases with visual experience, and increases with increasing concentration of anesthetic. [sent-211, score-1.07]

76 These data suggest that the 2 Lifetime sparseness, TR: ANOVA with different anesthesia groups, P < 0. [sent-212, score-0.286]

77 01; multiple comparison tests with Tukey-Kramer correction shows the mean of awake group is different from the mean of all other groups with P < 0. [sent-213, score-0.262]

78 01; multiple comparison shows the mean of the awake group is different from that of the light, medium, and deep anesthesia groups, P < 0. [sent-215, score-0.54]

79 01, multiple comparison shows the mean of the awake group is different from that of the light, medium, and deep anesthesia groups, P < 0. [sent-217, score-0.54]

80 7 high sparseness levels that have been reported in previous accounts of sparseness in the visual cortex [7, 8, 9, 10, 11, 12], and which are otherwise consistent with our measurements (Fig. [sent-219, score-1.484]

81 3B, 5), are most likely a side effect of the high selectivity of neurons, or an overestimation due to the effect of anesthesia (Fig. [sent-220, score-0.363]

82 5; with the exception of [8], where sparseness was measured on awake animals), but do not indicate an active optimization of sparse responses (cf. [sent-221, score-1.056]

83 Our measurements of sparseness from neural data are based on multi-unit recording. [sent-223, score-0.63]

84 By collecting spikes from multiple cells, we are in fact reporting a lower bound of the true sparseness values. [sent-224, score-0.595]

85 While a precise measurement of the absolute value of these quantities would require single-unit measurement, our conclusions are based on relative comparisons of sparseness under different conditions, and are thus not affected. [sent-225, score-0.567]

86 Our theoretical predictions were verified with a common sparse coding model [3]. [sent-226, score-0.329]

87 Despite these specific choices, we expect the model results to be general to the entire class of sparse coding models. [sent-228, score-0.329]

88 Alternatively, one could assume a deterministic neural architecture, with a network dynamic that would drive the activity of the units to values that maximize the image probability [3, 30, 31]. [sent-230, score-0.331]

89 Although our analysis found no evidence for active sparsification in the primary visual cortex, ideas derived from and closely related to the sparse coding principle are likely to remain important for our understanding of visual processing. [sent-233, score-0.845]

90 Efficient coding remains a most plausible functional account of coding in more peripheral parts of the sensory pathway, and particularly in the retina, from where raw visual input has to be sent through the bottleneck formed by the optic nerve without significant loss of information [32, 33]. [sent-234, score-0.642]

91 Independent component filters of natural images compared with simple cells in primary visual cortex. [sent-267, score-0.318]

92 Responses of neurons in primary and inferior temporal visual cortices to natural scenes. [sent-291, score-0.402]

93 Sparse coding and decorrelation in primary visual cortex during natural vision. [sent-298, score-0.633]

94 Selectivity and sparseness in the responses of striate complex cells. [sent-315, score-0.7]

95 Heterogeneity in the responses of adjacent neurons to natural stimuli in cat striate cortex. [sent-323, score-0.275]

96 The sparseness of neuronal responses in ferret primary visual cortex. [sent-331, score-0.955]

97 Development of orientation selectivity in ferret visual cortex and effects of deprivation. [sent-399, score-0.455]

98 The contribution of sensory experience to the maturation of orientation selectivity in ferret visual cortex. [sent-407, score-0.44]

99 Figure-ground activity in primary visual cortex is suppressed by anesthesia. [sent-432, score-0.54]

100 Sparse coding via thresholding and local competition in neural circuits. [sent-470, score-0.301]

