nips nips2001 nips2001-168 nips2001-168-reference knowledge-graph by maker-knowledge-mining

168 nips-2001-Sequential Noise Compensation by Sequential Monte Carlo Method


Source: pdf

Author: K. Yao, S. Nakamura

Abstract: We present a sequential Monte Carlo method applied to additive noise compensation for robust speech recognition in time-varying noise. The method generates a set of samples according to the prior distribution given by clean speech models and noise prior evolved from previous estimation. An explicit model representing noise effects on speech features is used, so that an extended Kalman filter is constructed for each sample, generating the updated continuous state estimate as the estimation of the noise parameter, and prediction likelihood for weighting each sample. Minimum mean square error (MMSE) inference of the time-varying noise parameter is carried out over these samples by fusion the estimation of samples according to their weights. A residual resampling selection step and a Metropolis-Hastings smoothing step are used to improve calculation efficiency. Experiments were conducted on speech recognition in simulated non-stationary noises, where noise power changed artificially, and highly non-stationary Machinegun noise. In all the experiments carried out, we observed that the method can have significant recognition performance improvement, over that achieved by noise compensation with stationary noise assumption. 1


reference text

[1] A. Varga and R.K. Moore, “Hidden markov model decomposition of speech and noise,” in ICASSP, 1990, pp. 845–848.

[2] N.S. Kim, “Nonstationary environment compensation based on sequential estimation,” IEEE Signal Processing Letters, vol. 5, no. 3, March 1998.

[3] K. Yao, K. K. Paliwal, and S. Nakamura, “Sequential noise compensation by a sequential kullback proximal algorithm,” in EUROSPEECH, 2001, pp. 1139–1142, extended paper submitted for publication.

[4] K. Yao, B. E. Shi, S. Nakamura, and Z. Cao, “Residual noise compensation by a sequential em algorithm for robust speech recognition in nonstationary noise,” in ICSLP, 2000, vol. 1, pp. 770–773.

[5] B. Frey, L. Deng, A. Acero, and T. Kristjansson, “Algonquin: Iterating laplace’s method to remove multiple types of acoustic distortion for robust speech recognition,” in EUROSPEECH, 2001, pp. 901–904.

[6] J. S. Liu and R. Chen, “Sequential monte carlo methods for dynamic systems,” J. Am. Stat. Assoc, vol. 93, pp. 1032–1044, 1998.

[7] W. K. Hastings, “Monte carlo sampling methods using markov chains and their applications,” Biometrika, vol. 57, pp. 97–109, 1970.