acl acl2011 acl2011-192 acl2011-192-reference knowledge-graph by maker-knowledge-mining

192 acl-2011-Language-Independent Parsing with Empty Elements

Source: pdf

Author: Shu Cai ; David Chiang ; Yoav Goldberg

Abstract: We present a simple, language-independent method for integrating recovery of empty elements into syntactic parsing. This method outperforms the best published method we are aware of on English and a recently published method on Chinese.

reference text

E. Black, S. Abney, D. Flickinger, C. Gdaniec, R. Grishman, P. Harrison, D. Hindle, R. Ingria, F. Jelinek, J. Klavans, M. Liberman, M. Marcus, S . Roukos, B . Santorini, and T. Strzalkowski. 1991 . A procedure for quantitatively comparing the syntactic coverage of English grammars. In Proc. DARPA Speech and Natural Language Workshop. Richard Campbell. 2004. Using linguistic principles to recover empty categories. In Proc. ACL. J.-C. Chappelier, M. Rajman, R. Aragu¨es, and A. Rozenknop. 1999. Lattice parsing for speech recognition. In Proc. Traitement Automatique du Langage Naturel (TALN). Tagyoung Chung and Daniel Gildea. 2010. Effects of empty categories on machine translation. In Proc. EMNLP. Pe´ter Dienes and Amit Dubey. 2003. Antecedent recovery: Experiments with a trace tagger. In Proc. EMNLP. Ryan Gabbard, Seth Kulick, and Mitchell Marcus. 2006. Fully parsing the Penn Treebank. In Proc. NAACL HLT. Yoav Goldberg and Michael Elhadad. 2011 . Joint Hebrew segmentation and parsing using a PCFG-LA lattice parser. In Proc. of ACL. Yoav Goldberg and Reut Tsarfaty. 2008. A single generative model for joint morphological segmentation and syntactic parsing. In Proc. of ACL. Spence Green and Christopher D. Manning. 2010. Better Arabic parsing: Baselines, evaluations, and analysis. In Proc of COLING-2010. Keith B . Hall. 2005 . Best-first word-lattice parsing: techniques for integrated syntactic language modeling. Ph.D. thesis, Brown University, Providence, RI, USA. Mark Johnson. 2002. A simple pattern-matching algorithm for recovering empty nodes and their antecedents. In Proc. ACL. Mitchell P. Marcus, Beatrice Santorini, and Mary Ann Marcinkiewicz. 1993. Building a large annotated corpus of English: the Penn Treebank. Computational Linguistics, 19:3 13–330. 216 Slav Petrov, Leon Barrett, Romain Thibaux, and Dan Klein. 2006. Learning accurate, compact, and interpretable tree annotation. In Proc. COLING-ACL. Brian Roark, Mary Harper, Eugene Charniak, Bonnie Dorr, Mark Johnson, Jeremy G. Kahn, Yang Liu, Mari Ostendorf, John Hale, Anna Krasnyanskaya, Matthew Lease, Izhak Shafran, Matthew Snover, Robin Stewart, and Lisa Yung. 2006. SParseval: Evaluation metrics for parsing speech. In Proc. LREC. Helmut Schmid. 2006. Trace prediction and recovery with unlexicalized PCFGs and slash features. In Proc. COLING-ACL. Nianwen Xue, Fei Xia, Fu-dong Chiou, and Martha Palmer. 2005. The Penn Chinese TreeBank: Phrase structure annotation of a large corpus. Natural Language Engineering, 11(2):207–238. Yaqin Yang and Nianwen Xue. 2010. Chasing the ghost: recovering empty categories in the Chinese Treebank. In Proc. COLING.