acl acl2010 acl2010-162 acl2010-162-reference knowledge-graph by maker-knowledge-mining

162 acl-2010-Learning Common Grammar from Multilingual Corpus


Source: pdf

Author: Tomoharu Iwata ; Daichi Mochihashi ; Hiroshi Sawada

Abstract: We propose a corpus-based probabilistic framework to extract hidden common syntax across languages from non-parallel multilingual corpora in an unsupervised fashion. For this purpose, we assume a generative model for multilingual corpora, where each sentence is generated from a language dependent probabilistic contextfree grammar (PCFG), and these PCFGs are generated from a prior grammar that is common across languages. We also develop a variational method for efficient inference. Experiments on a non-parallel multilingual corpus of eleven languages demonstrate the feasibility of the proposed method.


reference text

Phil Blunsom, Trevor Cohn, and Miles Osborne. 2009. Bayesian synchronous grammar induction. In D. Koller, D. Schuurmans, Y. Bengio, and L. Bottou, editors, Advances in Neural Information Processing Systems 21, pages 161–168. Alexandre Bouchard-C oˆt´ e, Percy Liang, Thomas Griffiths, and Dan Klein. 2008. A probabilistic approach to language change. In J.C. Platt, D. Koller, Y. Singer, and S. Roweis, editors, Advances in Neural Information Processing Systems 20, pages 169–176, Cambridge, MA. MIT Press. Glenn Carroll and Eugene Charniak. 1992. Two experiments on learning probabilistic dependency grammars from corpora. In Working Notes of the Workshop Statistically- Based NLP Techniques, pages 1–13. AAAI. David Chiang. 2005. A hierarchical phrase-based model for statistical machine translation. In ACL ’05: Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics, pages 263–270, Morristown, NJ, USA. Association for Computational Linguistics. Norm Chomsky. 1965. Aspects of the Theory of Syntax. MIT Press. Shay B. Cohen and Noah A. Smith. 2009. Shared logistic normal distributions for soft parameter tying in unsupervised grammar induction. In NAACL ’09: Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, pages 74–82, Morristown, NJ, USA. Association for Computational Linguistics. Hal Daum e´ III. 2009. Bayesian multitask learning with latent hierarchies. In Proceedings of the Twenty-Fifth Annual Conference on Uncertainty in Artificial Intelligence (UAI-09), pages 135–142, Corvallis, Oregon. AUAI Press. Jason Eisner. 2003. Learning non-isomorphic tree mappings for machine translation. In ACL ’03: Proceedings of the 41st Annual Meeting on Association for Computational Linguistics, pages 205–208, Morristown, NJ, USA. Association for Computational Linguistics. Russell D. Gray and Quentin D. Atkinson. 2003. Languagetree divergence times support the Anatolian theory of Indo-European November. origin. Nature, 426(6965):435–439, Mark Johnson, Thomas Griffiths, and Sharon Goldwater. 2007. Bayesian inference for PCFGs via Markov chain Monte Carlo. In Human Language Technologies 2007: The Conference of the North American Chapter of the Association for Computational Linguistics; Proceedings of the Main Conference, pages 139–146, Rochester, New York, April. Association for Computational Linguistics. J. F. C. Kingman. 1982. The coalescent. Stochastic Processes and their Applications, 13:235–248. Dan Klein and Christopher D. Manning. 2002. A generative constituent-context model for improved grammar induction. In ACL ’02: Proceedings of the 40th Annual Meeting on Association for Computational Linguistics, pages 128–135, Morristown, NJ, USA. Association for Computational Linguistics. Dan Klein and Christopher D. Manning. 2004. Corpusbased induction of syntactic structure: models of dependency and constituency. In ACL ’04: Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics, page 478, Morristown, NJ, USA. Association for Computational Linguistics. Philipp Koehn. 2005. Europarl: A parallel corpus for statistical machine translation. In Proceedings of the 10th Machine Translation Summit, pages 79–86. Kenichi Kurihara and Taisuke Sato. 2004. An applica- tion of the variational Bayesian approach to probabilistic context-free grammars. In International Joint Conference on Natural Language Processing Workshop Beyond Shallow Analysis. K. Lari and S.J. Young. 1990. The estimation of stochastic context-free grammars using the inside-outside algorithm. Computer Speech and Language, 4:35–56. Percy Liang, Slav Petrov, Michael I. Jordan, and Dan Klein. 2007. The infinite PCFG using hierarchical dirichlet processes. In EMNLP ’07: Proceedings of the Empirical Methods on Natural Language Processing, pages 688– 697. I. Dan Melamed. 2003. Multitext grammars and synchronous parsers. In NAACL ’03: Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology, pages 79–86, Morristown, NJ, USA. Association for Computational Linguistics. Thomas Minka. 2000. Estimating a Dirichlet distribution. Technical report, M.I.T. Michael P. Oakes. 2000. Computer estimation of vocabulary in a protolanguage from word lists in four daughter languages. Journal of Quantitative Linguistics, 7(3):233– 243. Steven Pinker. 1994. The Language Instinct: How the Mind Creates Language. HarperCollins, New York. Benjamin Snyder, Tahira Naseem, and Regina Barzilay. 2009. Unsupervised multilingual grammar induction. In Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP, pages 73–81, Suntec, Singapore, August. Association for Computational Linguistics. Andreas Stolcke and Stephen M. Omohundro. 1994. Inducing probabilistic grammars by Bayesian model merging. In ICGI ’94: Proceedings of the Second International Colloquium on Grammatical Inference and Applications, pages 106–1 18, London, UK. Springer-Verlag. Dekai Wu. 1997. Stochastic inversion transduction grammars and bilingual parsing of parallel corpora. Comput. Linguist., 23(3):377–403. Kai Yu, Volker Tresp, and Anton Schwaighofer. 2005. Learning gaussian processes from multiple tasks. In ICML ’05: Proceedings of the 22nd International Conference on Machine Learning, pages 1012–1019, New York, NY, USA. ACM. 188