nips nips2010 nips2010-157 nips2010-157-reference knowledge-graph by maker-knowledge-mining
Source: pdf
Author: Dan Goodman, Romain Brette
Abstract: To localise the source of a sound, we use location-specific properties of the signals received at the two ears caused by the asymmetric filtering of the original sound by our head and pinnae, the head-related transfer functions (HRTFs). These HRTFs change throughout an organism’s lifetime, during development for example, and so the required neural circuitry cannot be entirely hardwired. Since HRTFs are not directly accessible from perceptual experience, they can only be inferred from filtered sounds. We present a spiking neural network model of sound localisation based on extracting location-specific synchrony patterns, and a simple supervised algorithm to learn the mapping between synchrony patterns and locations from a set of example sounds, with no previous knowledge of HRTFs. After learning, our model was able to accurately localise new sounds in both azimuth and elevation, including the difficult task of distinguishing sounds coming from the front and back. Keywords: Auditory Perception & Modeling (Primary); Computational Neural Models, Neuroscience, Supervised Learning (Secondary) 1
Algazi, V. R., C. Avendano, and R. O. Duda (2001, March). Elevation localization and head-related transfer function analysis at low frequencies. The Journal of the Acoustical Society of America 109(3), 1110–1122. Brand, A., O. Behrend, T. Marquardt, D. McAlpine, and B. Grothe (2002). Precise inhibition is essential for microsecond interaural time difference coding. Nature 417(6888), 543. Colburn, H. S. (1973, December). Theory of binaural interaction based on auditory-nerve data. i. general strategy and preliminary results on interaural discrimination. The Journal of the Acoustical Society of America 54(6), 1458–1470. Davison, A. P. and Y. Frgnac (2006, May). Learning Cross-Modal spatial transformations through spike Timing-Dependent plasticity. J. Neurosci. 26(21), 5604–5615. Gaik, W. (1993, July). Combined evaluation of interaural time and intensity differences: Psychoacoustic results and computer modeling. The Journal of the Acoustical Society of America 94(1), 98–110. Gerstner, W., R. Kempter, J. L. van Hemmen, and H. Wagner (1996). A neuronal learning rule for submillisecond temporal coding. Nature 383(6595), 76. Glasberg, B. R. and B. C. Moore (1990, August). Derivation of auditory filter shapes from notched-noise data. Hearing Research 47(1-2), 103–138. PMID: 2228789. Goodman, D. F. M. and R. Brette (2009). The Brian simulator. Frontiers in Neuroscience 3(2), 192–197. Goodman, D. F. M. and R. Brette (in press). Spike-timing-based computation in sound localization. PLoS Comp. Biol.. Harper, N. S. and D. McAlpine (2004). Optimal neural population coding of an auditory spatial cue. Nature 430(7000), 682–686. Hofman, P. M., J. G. V. Riswick, and A. J. V. Opstal (1998). Relearning sound localization with new ears. Nat Neurosci 1(5), 417–421. Jeffress, L. A. (1948, February). A place theory of sound localization. Journal of Comparative and Physiological Psychology 41(1), 35–9. PMID: 18904764. Joris, P. and T. C. T. Yin (2007, February). A matter of time: internal delays in binaural processing. Trends in Neurosciences 30(2), 70–8. PMID: 17188761. Joris, P. X., B. V. de Sande, D. H. Louage, and M. van der Heijden (2006). Binaural and cochlear disparities. Proceedings of the National Academy of Sciences 103(34), 12917. Lindemann, W. (1986, December). Extension of a binaural cross-correlation model by contralateral inhibition. i. simulation of lateralization for stationary signals. The Journal of the Acoustical Society of America 80(6), 1608–1622. Litovsky, R. Y., H. S. Colburn, W. A. Yost, and S. J. Guzman (1999, October). The precedence effect. The Journal of the Acoustical Society of America 106(4), 1633–1654. Liu, J., H. Erwin, S. Wermter, and M. Elsaid (2008). A biologically inspired spiking neural network for sound localisation by the inferior colliculus. In Artificial Neural Networks - ICANN 2008, pp. 396–405. 8 Lorenzi, C., F. Berthommier, F. Apoux, and N. Bacri (1999, October). Effects of envelope expansion on speech recognition. Hearing Research 136(1-2), 131–138. Macdonald, J. A. (2008, June). A localization algorithm based on head-related transfer functions. The Journal of the Acoustical Society of America 123(6), 4290–4296. PMID: 18537380. Reed, M. C. and J. J. Blum (1990, September). A model for the computation and encoding of azimuthal information by the lateral superior olive. The Journal of the Acoustical Society of America 88(3), 1442– 1453. PMID: 2229677. Song, S. and L. F. Abbott (2001, October). Cortical development and remapping through spike TimingDependent plasticity. Neuron 32(2), 339–350. Wagner, H., A. Asadollahi, P. Bremen, F. Endler, K. Vonderschen, and M. von Campenhausen (2007). Distribution of interaural time difference in the barn owl’s inferior colliculus in the low- and High-Frequency ranges. J. Neurosci. 27(15), 4191–4200. Witten, I. B., E. I. Knudsen, and H. Sompolinsky (2008, August). A hebbian learning rule mediates asymmetric plasticity in aligning sensory representations. J Neurophysiol 100(2), 1067–1079. Yin, T. C. and J. C. Chan (1990). Interaural time sensitivity in medial superior olive of cat. J Neurophysiol 64(2), 465–488. Zahorik, P., P. Bangayan, V. Sundareswaran, K. Wang, and C. Tam (2006, July). Perceptual recalibration in human sound localization: Learning to remediate front-back reversals. The Journal of the Acoustical Society of America 120(1), 343–359. Zhou, Y., L. H. Carney, and H. S. Colburn (2005, March). A model for interaural time difference sensitivity in the medial superior olive: Interaction of excitatory and inhibitory synaptic inputs, channel dynamics, and cellular morphology. J. Neurosci. 25(12), 3046–3058. 9