acl acl2010 acl2010-91 acl2010-91-reference knowledge-graph by maker-knowledge-mining

91 acl-2010-Domain Adaptation of Maximum Entropy Language Models


Source: pdf

Author: Tanel Alumae ; Mikko Kurimo

Abstract: We investigate a recently proposed Bayesian adaptation method for building style-adapted maximum entropy language models for speech recognition, given a large corpus of written language data and a small corpus of speech transcripts. Experiments show that the method consistently outperforms linear interpolation which is typically used in such cases.


reference text

Ciprian Chelba and Alex Acero. 2006. Adaptation of maximum entropy capitalizer: Little data can help a lot. Computer Speech & Language, 20(4):382–399, October. S. F. Chen and R. Rosenfeld. 2000. A survey of smoothing techniques for ME models. IEEE Transactions on Speech and Audio Processing, 8(1):37– 50. S. F. Chen. 2009a. Performance prediction for exponential language models. In Proceedings of HLTNAACL, pages 450–458, Boulder, Colorado. Stanley F. Chen. 2009b. Shrinking exponential language models. In Proceedings of HLT-NAACL, pages 468–476, Boulder, Colorado. H. Daume III. 2007. Frustratingly easy domain adaptation. In Proceedings of ACL, pages 256–263. P. Del e´glise, Y. Est e´ve, S. Meignier, and T. Merlin. 2005. The LIUM speech transcription system: a CMU Sphinx III-based system for French broadcast news. In Proceedings of Interspeech, Lisboa, Portugal. J. R. Finkel and Ch. Manning. 2009. Hierarchical Bayesian domain adaptation. In Proceedings of HLT-NAACL, pages 602–610, Boulder, Colorado. J. Goodman. 2001 . Classes for fast maximum entropy training. In Proceedings of ICASSP, Utah, USA. H.-J. Kaalep and T. Vaino. 2001 . Complete morphological analysis in the linguist’s toolbox. In Congressus Nonus Internationalis Fenno-Ugristarum Pars V, pages 9–16, Tartu, Estonia. R. Kneser and H. Ney. 1993. Improved clustering techniques for class-based statistical language modelling. In Proceedings of the European Conference 305 on Speech Communication 973–976. R. and Technology, pages Rosenfeld. 1996. A maximum entropy approach to statistical language modeling. Computer, Speech and Language, 10: 187–228. adaptive and S. Khudanpur. 2002. Building a topicdependent maximum entropy model for very large corpora. In Proceedings of ICASSP, Orlando, J. Wu Florida, USA. 306