acl acl2013 acl2013-144 acl2013-144-reference knowledge-graph by maker-knowledge-mining
Source: pdf
Author: Matt Post ; Shane Bergsma
Abstract: Syntactic features are useful for many text classification tasks. Among these, tree kernels (Collins and Duffy, 2001) have been perhaps the most robust and effective syntactic tool, appealing for their empirical success, but also because they do not require an answer to the difficult question of which tree features to use for a given task. We compare tree kernels to different explicit sets of tree features on five diverse tasks, and find that explicit features often perform as well as tree kernels on accuracy and always in orders of magnitude less time, and with smaller models. Since explicit features are easy to generate and use (with publicly avail- able tools) , we suggest they should always be included as baseline comparisons in tree kernel method evaluations.
Harald Baayen, Hans Van Halteren, and Fiona Tweedie. 1996. Outside the cave of shadows: Using syntactic annotation to enhance authorship attribution. Literary and Linguistic Computing, 11(3) :121. Shane Bergsma, Matt Post, and David Yarowsky. 2012. Stylometric analysis of scientific articles. In Proc. of NAACL-HLT, pages 327–337, Montr ´eal, Canada, June. Association for Computational Linguistics. Rens Bod. 1993. Using an annotated corpus as a stochastic grammar. In Proc. of ACL, Columbus, Ohio, USA, June. Eugene Charniak and Mark Johnson. 2005. Coarse-to-fine n-best parsing and MaxEnt dis- criminative reranking. In Proc. of A CL, pages 173–180, Ann Arbor, Michigan, USA, June. Colin Cherry and Chris Quirk. 2008. Discriminative, syntactic language modeling through latent SVMs. In Proc. of AMTA, Waikiki, Hawaii, USA, October. Michael Collins and Nigel Duffy. 2001. Convolution kernels for natural language. In Proc. of NIPS. Michael Collins and Nigel Duffy. 2002. New ranking algorithms for parsing and tagging: kernels over discrete structures, and the voted perceptron. In Proc. of A CL, pages 173–180, Philadelphia, Pennsylvania, USA, July. Aron Culotta and Jeffrey Sorensen. 2004. Dependency tree kernels for relation extraction. In Proc. of ACL, pages 423–429. Rong-En Fan, Kai-Wei Chang, Cho-Jui Hsieh, Xiang-Rui Wang, and Chih-Jen Lin. 2008. LIBLINEAR: A library for large linear classification. Journal of Machine Learning Research, 9:1871– 1874. Jennifer Foster and Øistein E. Andersen. 2009. GenERRate: Generating errors for use in gram- matical error detection. In Proceedings of the fourth workshop on innovative use of NLP for building educational applications, pages 82–90. Sylviane Granger, Estelle Dagneaux, Fanny Meunier, and Magali Paquot. 2009. The International Corpus of Learner English. Version 2. Handbook and CD-Rom. Moshe Koppel, Shlomo Argamon, and Anat Rachel Shimoni. 2003. Automatically categorizing written texts by author gender. Literary and Linguistic Computing, 17(4) :401–412. Xin Li and Dan Roth. 2002. Learning question classifiers. In Proc. of COLING, pages 1–7. Ding Liu and Daniel Gildea. 2005. Syntactic features for evaluation of machine translation. In Proceedings of the ACL Workshop on Intrinsic and Extrinsic Evaluation Measures for Machine Translation and/or Summarization, pages 25– 32. Mitchell P. Marcus, Mary Ann Marcinkiewicz, and Beatrice Santorini. 1993. Building a large annotated corpus of English: The Penn Treebank. Computational Linguistics, 19(2) :330. Alessandro Moschitti, Silvia Quarteroni, Roberto Basili, and Suresh Manandhar. 2007. Exploit- ing syntactic and shallow semantic kernels for question answer classification. In Proc. of ACL, pages 776–783, Prague, Czech Republic, June. Alessandro Moschitti. 2004. A study on convolution kernels for shallow semantic parsing. In Proc. of ACL. Alessandro Moschitti. 2006. Making tree kernels practical for natural language learning. In Proc. of EA CL, volume 6, pages 113–120. Frederick Mosteller and David L. Wallace. 1984. Applied Bayesian and Classical Inference: The Case of the Federalist Papers. Springer-Verlag. Daisuke Okanohara and Jun’ichi Tsujii. 2007. A discriminative language model with pseudonegative samples. In Proc. of A CL, Prague, Czech Republic, June. Slav Petrov, Leon Barrett, Romain Thibaux, and Dan Klein. 2006. Learning accurate, compact, and interpretable tree annotation. In Proc. of ACL, Sydney, Australia, July. Xuan-Hieu Phan. 2006. CRFTagger: CRF English POS Tagger. crftagger .sourceforge .net. Daniele Pighin and Alessandro Moschitti. 2009. Reverse engineering of tree kernel feature spaces. In Proc. of EMNLP, pages 111–120, Singapore, August. Matt Post and Daniel Gildea. 2009. Bayesian learning of a tree substitution grammar. In Proc. of A CL (short paper track), Suntec, Singapore, August. Matt Post. 2011. Judging grammaticality with tree substitution grammar derivations. In Proc. of ACL, Portland, Oregon, USA, June. Dragomir R. Radev, Pradeep Muthukrishnan, and Vahed Qazvinian. 2009. The ACL anthology network corpus. In Proc. of A CL Workshop on Natural Language Processing and Information Retrieval for Digital Libraries, pages 54–61. Remko Scha. 1990. Taaltheorie en taaltechnologie; competence en performance. In R. de Kort and G.L.J. Leerdam, editors, Computertoepassingen in de neerlandistiek, pages 7–22, Almere, the Netherlands. De Vereniging. 871 Aliaksei Severyn and Large-scale support tural kernels. Alessandro Moschitti. 2010. vector learning with struc- In Proc. of ECML/PKDD, pages 229–244. Libin Shen and Aravind K. Joshi. 2003. An SVMbased voting algorithm with application to parse reranking. In Proc. of CoNLL, pages 9–16. Jun Suzuki, Hideki Isozaki, and Eisaku Maeda. 2004. Convolution kernels with feature selection for natural language processing tasks. In Proc. of A CL, pages 119–126. Benjamin Swanson and Eugene Charniak. 2012. Native language detection with tree substitution grammars. In Proc. of A CL (short papers), pages 193–197, Jeju Island, Korea, July. Joel Tetreault, Daniel Blanchard, Aoife Cahill, and Martin Chodorow. 2012. Native tongues, lost and found: Resources and empirical evaluations in native language identification. In Proc. of COLING, pages 2585–2602, Mumbai, India, December. Laura Mayfield Tomokiyo and Rosie Jones. 2001. You’re not from ’round here, are you? Naive Bayes detection of non-native utterances. In Proc. of NAACL. Vladimir N. Vapnik. 1998. Statistical Learning Theory. John Wiley & Sons. Zhuang Wang, Koby Crammer, and Slobodan Vucetic. 2010. Multi-class pegasos on a budget. In ICML, pages 1143–1150. Sze-Meng Jojo Wong and Mark Dras. 2010. Parser features for sentence grammaticality classification. In Proceedings of the Australasian Language Technology Association Workshop, Melbourne, Australia, December. Sze-Meng Jojo Wong and Mark Dras. 2011. Exploiting parse structures for native language identification. In Proc. of EMNLP, pages 1600– 1610, Edinburgh, Scotland, UK., July. Xiaofeng Yang, Jian Su, and Chew Lim Tan. 2006. Kernel-based pronoun resolution with structured syntactic knowledge. In Proc. of Coling-ACL, pages 41–48. Dell Zhang and Wee Sun Lee. 2003. Question classification using support vector machines. In Proceedings of the 26th annual international A CM SIGIR conference on Research and development in informaion retrieval, SIGIR ’03, pages 26–32, New York, NY, USA. ACM. 872