nips nips2002 nips2002-31 nips2002-31-reference knowledge-graph by maker-knowledge-mining

31 nips-2002-Application of Variational Bayesian Approach to Speech Recognition


Source: pdf

Author: Shinji Watanabe, Yasuhiro Minami, Atsushi Nakamura, Naonori Ueda

Abstract: In this paper, we propose a Bayesian framework, which constructs shared-state triphone HMMs based on a variational Bayesian approach, and recognizes speech based on the Bayesian prediction classification; variational Bayesian estimation and clustering for speech recognition (VBEC). An appropriate model structure with high recognition performance can be found within a VBEC framework. Unlike conventional methods, including BIC or MDL criterion based on the maximum likelihood approach, the proposed model selection is valid in principle, even when there are insufficient amounts of data, because it does not use an asymptotic assumption. In isolated word recognition experiments, we show the advantage of VBEC over conventional methods, especially when dealing with small amounts of data.


reference text

[1] H. Attias, “A Variational Bayesian Framework for Graphical Models,” NIPS12, MIT Press, (2000).

[2] W. Chou and W. Reichl, “Decision Tree State Tying Based on Penalized Bayesian Information Criterion,” Proc. ICASSP’99, vol. 1, pp. 345-348, (1999).

[3] Z. Ghahramani and M. J. Beal, “Variational Inference for Bayesian Mixtures of Factor Analyzers,” NIPS12, MIT Press, (2000).

[4] J. J. Odell, “The Use of Context in Large Vocabulary Speech Recognition,” PhD thesis, Cambridge University, (1995).

[5] K. Shinoda and T. Watanabe, “Acoustic Modeling Based on the MDL Principle for Speech Recognition,” Proc. EuroSpeech’97, vol. 1, pp. 99-102, (1997).

[6] N. Ueda and Z. Ghahramani, “Bayesian Model Search for Mixture Models Based on Optimizing Variational Bounds,” Neural Networks, vol. 15, pp. 1223-1241, (2002).

[7] S. Watanabe et. al., “Constructing Shared-State Hidden Markov Models Based on a Bayesian Approach,” Proc. ICSLP’02, vol. 4, pp. 2669-2672, (2002).

[8] S. Waterhouse et. al., “Bayesian Methods for Mixture of Experts,” NIPS8, MIT Press, (1995).