emnlp emnlp2010 emnlp2010-116 emnlp2010-116-reference knowledge-graph by maker-knowledge-mining

116 emnlp-2010-Using Universal Linguistic Knowledge to Guide Grammar Induction

Source: pdf

Author: Tahira Naseem ; Harr Chen ; Regina Barzilay ; Mark Johnson

Abstract: We present an approach to grammar induction that utilizes syntactic universals to improve dependency parsing across a range of languages. Our method uses a single set of manually-specified language-independent rules that identify syntactic dependencies between pairs of syntactic categories that commonly occur across languages. During inference of the probabilistic model, we use posterior expectation constraints to require that a minimum proportion of the dependencies we infer be instances of these rules. We also automatically refine the syntactic categories given in our coarsely tagged input. Across six languages our approach outperforms state-of-theart unsupervised methods by a significant margin.1

reference text

Mark C. Baker. 2001 . The Atoms of Language: The Mind’s Hidden Rules of Grammar. Basic Books. Emily M. Bender. 2009. Linguistically na¨ ıve != language independent: Why NLP needs linguistic typology. In Proceedings of the EACL 2009 Workshop on the Interaction between Linguistics and Computational Linguistics: Virtuous, Vicious or Vacuous?, pages 26–32. Taylor Berg-Kirkpatrick and Dan Klein. 2010. Phylogenetic grammar induction. In Proceedings of ACL, pages 1288–1297. Christopher M. Bishop. 2006. Pattern Recognition and Machine Learning. Information Science and Statistics. Springer. Sabine Buchholz and Erwin Marsi. 2006. CoNLL-X shared task on multilingual dependency parsing. In Proceedings of CoNLL, pages 149–164. David Burkett and Dan Klein. 2008. Two languages are better than one (for syntactic parsing). In Proceedings of EMNLP, pages 877–886. Andrew Carnie. 2002. Syntax: A Generative Introduction (Introducing Linguistics). Blackwell Publishing. Ming-Wei Chang, Lev Ratinov, and Dan Roth. 2007. Guiding semi-supervision with constraintdriven learning. In Proceedings of ACL, pages 280– 287. Shay B. Cohen and Noah A. Smith. 2009a. Shared lo- gistic normal distributions for soft parameter tying in unsupervised grammar induction. In Proceedings of NAACL/HLT, pages 74–82. Shay B. Cohen and Noah A. Smith. 2009b. Variational inference for grammar induction with prior knowledge. In Proceedings of ACL/IJCNLP 2009 Conference Short Papers, pages 1–4. Michael Collins. 1999. Head-driven statistical models for natural language parsing. Ph.D. thesis, University of Pennsylvania. Hal Daum e´ III and Lyle Campbell. 2007. A bayesian model for discovering typological implications. In Proceedings of ACL, pages 65–72. Gregory Druck, Gideon Mann, and Andrew McCallum. 2009. Semi-supervised learning of dependency parsers using generalized expectation criteria. In Proceedings of ACL/IJCNLP, pages 360–368. Thomas S. Ferguson. 1973. A bayesian analysis of some nonparametric problems. Annals of Statistics, 1(2):209–230. Jenny Rose Finkel, Trond Grenager, and Christopher D. Manning. 2007. The infinite tree. In Proceedings of ACL, pages 272–279. 1244 Kuzman Ganchev, Jennifer Gillenwater, and Ben Taskar. 2009. Dependency grammar induction via bitext projection constraints. In Proceedings of ACL/IJCNLP, pages 369–377. Kuzman Ganchev, Jo˜ ao Gra ¸ca, Jennifer Gillenwater, and Ben Taskar. 2010. Posterior regularization for structured latent variable models. Journal of Machine Learning Research, 11:2001–2049. Jo˜ ao Gra ¸ca, Kuzman Ganchev, Ben Taskar, and Fernando Pereira. 2009. Posterior vs. parameter sparsity in latent variable models. InAdvances in NIPS, pages 664– 672. Jo˜ ao Gra ¸ca, Kuzman Ganchev, and Ben Taskar. 2007. Expectation maximization and posterior constraints. In Advances in NIPS, pages 569–576. Aria Haghighi and Dan Klein. 2006. Prototype-driven grammar induction. In Proceedings of ACL, pages 881–888. William P. Headden III, Mark Johnson, and David McClosky. 2009. Improving unsupervised dependency parsing with richer contexts and smoothing. In Proceedings of NAACL/HLT, pages 101–109. Dan Klein and Christopher Manning. 2004. Corpusbased induction of syntactic structure: Models of dependency and constituency. In Proceedings of ACL, pages 478–485. Jonas Kuhn. 2004. Experiments in parallel-text based grammar induction. In Proceedings of ACL, pages 470–477. Percy Liang, Slav Petrov, Michael Jordan, and Dan Klein. 2007. The infinite PCFG using hierarchical Dirichlet processes. In Proceedings of EMNLP/CoNLL, pages 688–697. Percy Liang, Michael I. Jordan, and Dan Klein. 2009a. Learning from measurements in exponential families. In Proceedings of ICML, pages 641–648. Percy Liang, Michael I. Jordan, and Dan Klein. 2009b. Probabilistic grammars and hierarchical Dirichlet processes. The Handbook of Applied Bayesian Analysis. Mitchell P. Marcus, Beatrice Santorini, and Mary Ann Marcinkiewicz. 1993. Building a large annotated corpus of english: The penn treebank. Computational Linguistics, 19(2):313–330. Frederick J. Newmeyer. 2005. Possible and Probable Languages: A Generative Perspective on Linguistic Typology. Oxford University Press. Slav Petrov and Dan Klein. 2007. Learning and inference for hierarchically split PCFGs. In Proceeding of AAAI, pages 1663–1666. Benjamin Snyder, Tahira Naseem, and Regina Barzilay. 2009. Unsupervised multilingual grammar induction. In Proceedings of ACL/IJCNLP, pages 73–81. Lydia White. 2003. Second Language Acquisition and Universal Grammar. Cambridge University Press.