nips nips2011 nips2011-225 nips2011-225-reference knowledge-graph by maker-knowledge-mining
Source: pdf
Author: Richard Turner, Maneesh Sahani
Abstract: A number of recent scientific and engineering problems require signals to be decomposed into a product of a slowly varying positive envelope and a quickly varying carrier whose instantaneous frequency also varies slowly over time. Although signal processing provides algorithms for so-called amplitude- and frequencydemodulation (AFD), there are well known problems with all of the existing methods. Motivated by the fact that AFD is ill-posed, we approach the problem using probabilistic inference. The new approach, called probabilistic amplitude and frequency demodulation (PAFD), models instantaneous frequency using an auto-regressive generalization of the von Mises distribution, and the envelopes using Gaussian auto-regressive dynamics with a positivity constraint. A novel form of expectation propagation is used for inference. We demonstrate that although PAFD is computationally demanding, it outperforms previous approaches on synthetic and real signals in clean, noisy and missing data settings. 1
[1] P. J. Loughlin and B. Tacer. On the amplitude- and frequency-modulation decomposition of signals. The Journal of the Acoustical Society of America, 100(3):1594–1601, 1996.
[2] J. L. Flanagan. Parametric coding of speech spectra. The Journal of the Acoustical Society of America, 68:412–419, 1980.
[3] P. Clark and L.E. Atlas. Time-frequency coherent modulation filtering of nonstationary signals. Signal Processing, IEEE Transactions on, 57(11):4323 –4332, nov. 2009.
[4] J. L. Flanagan and R. M. Golden. Phase vocoder. Bell System Technical Journal, pages 1493–1509, 1966.
[5] S. M. Schimmel. Theory of Modulation Frequency Analysis and Modulation Filtering, with Applications to Hearing Devices. PhD thesis, University of Washington, 2007.
[6] L. E. Atlas and C. Janssen. Coherent modulation spectral filtering for single-channel music source separation. In Proceedings of the IEEE Conference on Acoustics Speech and Signal Processing, 2005.
[7] Z. M. Smith, B. Delgutte, and A. J. Oxenham. Chimaeric sounds reveal dichotomies in auditory perception. Nature, 416(6876):87–90, 2002.
[8] J. Dugundji. Envelopes and pre-envelopes of real waveforms. IEEE Transactions on Information Theory, 4:53–57, 1958.
[9] O. Ghitza. On the upper cutoff frequency of the auditory critical-band envelope detectors in the context of speech perception. The Journal of the Acoustical Society of America, 110(3):1628–1640, 2001.
[10] F. G. Zeng, K. Nie, S. Liu, G. Stickney, E. Del Rio, Y. Y. Kong, and H. Chen. On the dichotomy in auditory perception between temporal envelope and fine structure cues (L). The Journal of the Acoustical Society of America, 116(3):1351–1354, 2004.
[11] D. Vakman. On the analytic signal, the Teager-Kaiser energy algorithm, and other methods for defining amplitude and frequency. IEEE Journal of Signal Processing, 44(4):791–797, 1996.
[12] G. Girolami and D. Vakman. Instantaneous frequency estimation and measurement: a quasi-local method. Measurement Science and Technology, 13(6):909–917, 2002.
[13] Y. Qi, T. P. Minka, and R. W. Picard. Bayesian spectrum estimation of unevenly sampled nonstationary data. In International Conference on Acoustics, Speech, and Signal Processing, 2002.
[14] A. T. Cemgil and S. J. Godsill. Probabilistic Phase Vocoder and its application to Interpolation of Missing Values in Audio Signals. In 13th European Signal Processing Conference, Antalya/Turkey, 2005.
[15] C. Bishop. Pattern Recognition and Machine Learning. Springer, 2006.
[16] R. Gatto and S. R. Jammalamadaka. The generalized von mises distribution. Statistical Methodology, 4:341–353, 2007.
[17] G. Sell and M. Slaney. Solving demodulation as an optimization problem. IEEE Transactions on Audio, Speech and Language Processing, 18:2051–2066, November 2010.
[18] R. E. Turner and M. Sahani. Probabilistic amplitude demodulation. In Independent Component Analysis and Signal Separation, pages 544–551, 2007.
[19] R. E. Turner and M. Sahani. Statistical inference for single- and multi-band probabilistic amplitude demodulation. In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pages 5466–5469, 2010.
[20] R. E. Turner and M. Sahani. Demodulation as probabilistic inference. IEEE Transactions on Audio, Speech and Language Processing, 2011.
[21] J. Breckling. The analysis of directional time series: Application to wind speed and direction. SpringerVerlag, 1989.
[22] J. P. Cunningham. Algorithms for Understanding Motor Cortical Processing and Neural Prosthetic Systems. PhD thesis, Stanford University, Department of Electrical Engineering, (Stanford, California, USA, 2009.
[23] T. Minka. A family of algorithms for approximate Bayesian inference. PhD thesis, MIT Media Lab, 2001.
[24] T. Heskes and O. Zoeter. Expectation propagation for approximate inference in dynamic bayesian networks. In A. Darwiche and N. Friedman, pages 216–233. Morgan Kaufmann Publishers, 2002. 9