acl acl2013 acl2013-70 acl2013-70-reference knowledge-graph by maker-knowledge-mining

70 acl-2013-Bilingually-Guided Monolingual Dependency Grammar Induction

Source: pdf

Author: Kai Liu ; Yajuan Lu ; Wenbin Jiang ; Qun Liu

Abstract: This paper describes a novel strategy for automatic induction of a monolingual dependency grammar under the guidance of bilingually-projected dependency. By moderately leveraging the dependency information projected from the parsed counterpart language, and simultaneously mining the underlying syntactic structure of the language considered, it effectively integrates the advantages of bilingual projection and unsupervised induction, so as to induce a monolingual grammar much better than previous models only using bilingual projection or unsupervised induction. We induced dependency gram- mar for five different languages under the guidance of dependency information projected from the parsed English translation, experiments show that the bilinguallyguided method achieves a significant improvement of 28.5% over the unsupervised baseline and 3.0% over the best projection baseline on average.

reference text

H. Alshawi. 1996. Head automata for speech translation. In Proc. of ICSLP. James K Baker. 1979. Trainable grammars for speech recognition. The Journal of the Acoustical Society of America, 65:S132. T. Berg-Kirkpatrick, A. Bouchard-C oˆt´ e, J. DeNero, and D. Klein. 2010. Painless unsupervised learning with features. In HLT: NAACL, pages 582–590. Rens Bod. 2006. An all-subtrees approach to unsupervised parsing. In Proc. of the 21st ICCL and the 44th ACL, pages 865–872. S. Buchholz and E. Marsi. 2006. Conll-x shared task on multilingual dependency parsing. In Proc. of the 2002 Conference on EMNLP. Proc. CoNLL. Eugene Charniak and Mark Johnson. 2005. Coarseto-fine n-best parsing and maxent discriminative reranking. In Proc. of the 43rd ACL, pages 173–180, Ann Arbor, Michigan, June. W. Chen, J. Kazama, and K. Torisawa. 2010. Bitext dependency parsing with bilingual subtree constraints. In Proc. of ACL, pages 21–29. S.B. Cohen, D. Das, and N.A. Smith. 2011. Unsupervised structure prediction with non-parallel multilingual guidance. In Proc. of the Conference on EMNLP, pages 50–61. Michael Collins. 2002. Discriminative training methods for hidden markov models: Theory and experiments with perceptron algorithms. In Proc. of the 2002 Conference on EMNLP, pages 1–8, July. Michael Collins. 2003. Head-driven statistical models for natural language parsing. In Computational Linguistics. D. Das and S. Petrov. 2011. Unsupervised part-ofspeech tagging with bilingual graph-based projections. In Proc. of ACL. K. Ganchev, J. Gillenwater, and B. Taskar. 2009. De- pendency grammar induction via bitext projection constraints. In Proc. of IJCNLP of the AFNLP: Volume 1-Volume 1, pages 369–377. R. Hwa, P. Resnik, A. Weinberg, and O. Kolak. 2002. Evaluating translational correspondence using annotation projection. In Proc. of ACL, pages 392–399. R. Hwa, M. Osborne, A. Sarkar, and M. Steedman. 2003. Corrected co-training for statistical parsers. In ICML-03 Workshop on the Continuum from Labeled to Unlabeled Data in Machine Learning and Data Mining, Washington DC. R. Hwa, P. Resnik, A. Weinberg, C. Cabezas, and O. Kolak. 2005. Bootstrapping parsers via syntactic projection across parallel texts. Natural language engineering, 11(3):3 11–325. W. Jiang and Q. Liu. 2010. Dependency parsing and projection based on word-pair classification. In Proc. of ACL, pages 12–20. D. Klein and C.D. Manning. 2004. Corpus-based induction of syntactic structure: Models of dependency and constituency. In Proc. of ACL, page 478. Terry Koo and Michael Collins. 2010. Efficient thirdorder dependency parsers. In Proc. of the 48th ACL, pages 1–1 1, July. T. Koo, X. Carreras, and M. Collins. 2008. Simple semi-supervised dependency parsing. pages 595– 603. R. McDonald and F. Pereira. 2006. Online learning of approximate dependency parsing algorithms. In Proc. of the 11th Conf. of EACL. R. McDonald, K. Crammer, and F. Pereira. 2005a. Online large-margin training of dependency parsers. In Proc. of ACL, pages 91–98. R. McDonald, F. Pereira, K. Ribarov, and J. Haji cˇ. 2005b. Non-projective dependency parsing using spanning tree algorithms. In Proc. of EMNLP, pages 523–530. R. McDonald, K. Lerman, and F. Pereira. 2006. Multilingual dependency analysis with a two-stage discriminative parser. In Proc. of CoNLL, pages 216– 220. 1071 R. McDonald, S. Petrov, and K. Hall. 2011. Multisource transfer of delexicalized dependency parsers. In Proc. of EMNLP, pages 62–72. ACL. T. Naseem, B. Snyder, J. Eisenstein, and R. Barzilay. 2009. Multilingual part-of-speech tagging: Two unsupervised approaches. Journal of Artificial Intelligence Research, 36(1):341–385. Tahira Naseem, Regina Barzilay, and Amir Globerson. 2012. Selective sharing for multilingual dependency parsing. In Proc. of the 50th ACL, pages 629–637, July. J. Nivre, J. Hall, J. Nilsson, G. Eryi g˜it, and S. Marinov. 2006. Labeled pseudo-projective dependency parsing with support vector machines. In Proc. of CoNLL, pages 221–225. J. Nivre, J. Hall, J. Nilsson, A. Chanev, G. Eryigit, S. K ¨ubler, S. Marinov, and E. Marsi. 2007. Maltparser: A language-independent system for datadriven dependency parsing. Natural Language Engineering, 13(02):95–135. Slav Petrov, Leon Barrett, Romain Thibaux, and Dan Klein. 2006. Learning accurate, compact, and interpretable tree annotation. In Proc. of the 21st ICCL & 44th ACL, pages 433–440, July. A. Sarkar. 2001. Applying co-training methods to statistical parsing. In Proc. of NAACL, pages 1–8. L. Shen, G. Satta, and A. Joshi. 2007. Guided learning for bidirectional sequence classification. In Annual Meeting-, volume 45, page 760. N.A. Smith and J. Eisner. 2005. Contrastive estimation: Training log-linear models on unlabeled data. In Proc. of ACL, pages 354–362. D.A. Smith and J. Eisner. 2009. Parser adaptation and projection with quasi-synchronous grammar features. In Proc. of EMNLP: Volume 2-Volume 2, pages 822–83 1. B. Snyder, T. Naseem, and R. Barzilay. 2009. Unsupervised multilingual grammar induction. In Proc. ofIJCNLP ofthe AFNLP: Volume 1-Volume 1, pages 73–81. Anders Søgaard. 2011. Data point selection for crosslanguage adaptation of dependency parsers. In Proc. of the 49th ACL: HLT, pages 682–686. Valentin I. Spitkovsky, Hiyan Alshawi, and Daniel Jurafsky. 2010. From baby steps to leapfrog: How “less is more” in unsupervised dependency parsing. In HLT: NAACL, pages 751–759, June. O. T¨ ackstr o¨m, R. McDonald, and J. Uszkoreit. 2012. Cross-lingual word clusters for direct transfer of linguistic structure. William, M. Johnson, and D. McClosky. 2009. Improving unsupervised dependency parsing with richer contexts and smoothing. In Proc. of NAACL, pages 101–109. D. Yarowsky, G. Ngai, and R. Wicentowski. 2001 . Inducing multilingual text analysis tools via robust projection across aligned corpora. In Proc. of HLT, pages 1–8. Daniel Zeman and Philip Resnik. 2008. Crosslanguage parser adaptation between related languages. In Proc. of the IJCNLP-08. Proc. CoNLL. Ciyou Zhu, Richard H Byrd, Peihuang Lu, and Jorge Nocedal. 1997. Algorithm 778: L-bfgs-b: Fortran subroutines for large-scale bound-constrained optimization. ACM Transactions on Mathematical Software (TOMS), 23(4):550–560. 1072