emnlp emnlp2013 emnlp2013-76 emnlp2013-76-reference knowledge-graph by maker-knowledge-mining

76 emnlp-2013-Exploiting Discourse Analysis for Article-Wide Temporal Classification


Source: pdf

Author: Jun-Ping Ng ; Min-Yen Kan ; Ziheng Lin ; Wei Feng ; Bin Chen ; Jian Su ; Chew Lim Tan

Abstract: In this paper we classify the temporal relations between pairs of events on an article-wide basis. This is in contrast to much of the existing literature which focuses on just event pairs which are found within the same or adjacent sentences. To achieve this, we leverage on discourse analysis as we believe that it provides more useful semantic information than typical lexico-syntactic features. We propose the use of several discourse analysis frameworks, including 1) Rhetorical Structure Theory (RST), 2) PDTB-styled discourse relations, and 3) topical text segmentation. We explain how features derived from these frameworks can be effectively used with support vector machines (SVM) paired with convolution kernels. Experiments show that our proposal is effective in improving on the state-of-the-art significantly by as much as 16% in terms of F1, even if we only adopt less-than-perfect automatic discourse analyzers and parsers. Making use of more accurate discourse analysis can further boost gains to 35%.


reference text

ACE. 2005. The ACE 2005 (ACE05) Evaluation Plan. October. Regina Barzilay, Noemie Elhadad, and Kathleen McKeown. 2002. Inferring Strategies for Sentence Ordering in Multidocument News Summarization. Journal of Artificial Intelligence Research (JAIR), 17:35–55. Steven Bethard and James H. Martin. 2007. CU-TMP: Temporal Relation Classification Using Syntactic and Semantic Features. In Proceedings of the 4th International Workshop on Semantic Evaluations (SemEval), pages 129–132, June. Lynn Carlson and Daniel Marcu. 2001. Discourse tagging manual. Technical Report ISI-TR-545, Information Sciences Institute, University of Southern California, July. Bin Chen, Jian Su, Sinno Jialin Pan, and Chew Lim Tan. 2011. A Unified Event Coreference Resolution by Integrating Multiple Resolvers. In Proceedings of the 5th International Joint Conference on Natural Language Processing (IJCNLP), pages 102–1 10, November. Timothy Chklovski and Patrick Pantel. 2004. VerbOcean: Mining the Web for Fine-Grained Semantic Verb Relations. In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), pages 33–40, July. Michael Collins and Nigel Duffy. 2001. Convolution Kernels for Natural Language. In Proceedings of NIPS. Quang Xuan Do, Wei Lu, and Dan Roth. 2012. Joint Inference for Event Timeline Construction. In Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP), pages 677–689, July. Jennifer D’Souza and Vincent Ng. 2013. Classifying Temporal Relations with Rich Linguistic Knowledge. In Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACLHLT), pages 918–927, June. Vanessa Wei Feng and Graeme Hirst. 2012. Text-level Discourse Parsing with Rich Linguistics Features. In Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies (ACL), pages 60–68, July. Eun Young Ha, Alok Baikadi, Carlyle Licata, and James C. Lester. 2010. NCSU: Modeling Temporal Relations with Markov Logic and Lexical Ontology. In Proceedings of the 5th International Workshop on Semantic Evaluation (SemEval), pages 341–344, July. Marti A. Hearst. 1994. Multi-Paragraph Segmentation of Expository Text. In Proceedings of the 32nd Annual Meeting of the Association for Computational Linguistics (ACL), pages 9–16, June. Anna Kazantseva and Stan Szpakowicz. 2011. Linear Text Segmentation Using Affinity Propagation. In Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing (EMNLP), pages 284–293, July. Anup Kumar Kolya, Asif Ekbal, and Sivaji Bandyopad- hyay. 2010. JU CSE TEMP: A First Step Towards Evaluating Events, Time Expressions and Temporal Relations. In Proceedings of the 5th International Workshop on Semantic Evaluation (SemEval), pages 345–350, July. Alex Lascarides and Nicholas Asher. 1993. Temporal Interpretation, Discourse Relations and Commonsense Entailment. Linguistics and Philosophy, 16(5):437– 493. Ziheng Lin, Hwee Tou Ng, and Min-Yen Kan. 2013. A PDTB-styled End-to-End Discourse Parser. Natural Language Engineering, FirstView: 1–34, February. Inderjeet Mani, Marc Verhagen, Ben Wellner, Chong Min Lee, and James Pustejovsky. 2006. Machine Learning of Temporal Relations. In Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics (ACL), pages 753–760, July. 22 William C. Mann and Sandra A. Thompson. 1988. Rhetorical Structure Theory: Toward a Functional Theory of Text Organization. Text, 8(3):243–281. Daniel Marcu. 1997. From Discourse Structures to Text Summaries. In Proceedings of the ACL Workshop on Intelligent Scalable Text Summarization, volume 97, pages 82–88, July. Alessandro Moschitti. 2006. Efficient Convolution Kernels for Dependency and Constituent Syntactic Trees. In Proceedings of the 1 European Conference on 7th Machine Learning (ECML), September. Jun-Ping Ng and Min-Yen Kan. 2012. Improved Temporal Relation Classification using Dependency Parses and Selective Crowdsourced Annotations. In Proceedings ofthe International Conference on Computational Linguistics (COLING), pages 2109–2124, December. Daniele Pighin and Alessandro Moschitti. 2010. On Reverse Feature Engineering of Syntactic Tree Kernels. In Proceedings ofthe 14th Conference on Natural Language Learning (CoNLL), August. Rashmi Prasad, Nikhil Dinesh, Alan Lee, Eleni Miltsakaki, Livio Robaldo, Aravind Joshi, and Bonnie Webber. 2008. The Penn Discourse TreeBank 2.0. In Proceedings of the 6th International Conference on Language Resources and Evaluation (LREC), May. Andrea Setzer, Robert Gaizauskas, and Mark Hepple. 2003. Using Semantic Inferences for Temporal Annotation Comparison. In Proceedings of the 4th International Workshop on Inference in Computational Semantics (ICoS), September. Eduard F. Skorochod’Ko. 1972. Adaptive Method of Automatic Abstracting and Indexing. In Proceedings of the IFIP Congress, pages 1179–1 182. Carlota S. Smith. 2010. Temporal Structures in Dis- course. Text, Time, and Context, 87:285–302. Simone Teufel and Min-Yen Kan. 2011. Robust Argumentative Zoning for Sensemaking in Scholarly Documents. In Advanced Language Technologies for Digital Libraries, pages 154–170. Springer. Naushad Uzzaman and James F. Allen. 2010. TRIPS and TRIOS System for TempEval-2: Extracting Temporal Information. In Proceedings of the 5th International Workshop on Semantic Evaluation (SemEval), pages 276–283, July. Naushad Uzzaman, Hector Llorens, James F. Allen, Leon Derczynski, Marc Verhagen, and James Pustejovsky. 2012. TempEval-3: Evaluating Events, Time Expressions, and Temporal Relations. Computing Research Repository (CoRR), abs/1206.5333. Vladimir N. Vapnik, 1999. The Nature of Statistical Learning Theory, chapter 5. Springer. Marc Verhagen, Robert Gaizauskas, Frank Schilder, Mark Hepple, Jessica Moszkowicz, and James Pustejovsky. 2009. The TempEval Challenge: Identifying Temporal Relations in Text. Language Resources and Evaluation, 43(2): 161–179. Marc Verhagen, Roser Sauri, Tommaso Caselli, and James Pustejovsky. 2010. Semeval-2010 task 13: Tempeval-2. In Proceedings of the 5th International Workshop on Semantic Evaluation (SemEval), pages 57–62, July. Marc Verhagen. 2005. Temporal Closure in an Annotation Environment. Language Resources and Evaluation, 39(2-3):21 1–241. Bonnie Webber. 2004. D-LTAG: Extending Lexicalized TAG to Discourse. Cognitive Science, 28(5):751–779. Katsumasa Yoshikawa, Sebastian Riedel, Masayuki Asahara, and Yuji Matsumoto. 2009. Jointly Identifying Temporal Relations with Markov Logic. In Proceedings of the 47th Annual Meeting of the Association for Computational Linguistics (ACL) and the 4th Interna- tional Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing (AFNLP), pages 405–413, August. 23