acl acl2012 acl2012-99 acl2012-99-reference knowledge-graph by maker-knowledge-mining

99 acl-2012-Finding Salient Dates for Building Thematic Timelines


Source: pdf

Author: Remy Kessler ; Xavier Tannier ; Caroline Hagege ; Veronique Moriceau ; Andre Bittar

Abstract: We present an approach for detecting salient (important) dates in texts in order to automatically build event timelines from a search query (e.g. the name of an event or person, etc.). This work was carried out on a corpus of newswire texts in English provided by the Agence France Presse (AFP). In order to extract salient dates that warrant inclusion in an event timeline, we first recognize and normalize temporal expressions in texts and then use a machine-learning approach to extract salient dates that relate to a particular topic. We focused only on extracting the dates and not the events to which they are related.


reference text

Salah A ¨ıt-Mokhtar, Jean-Pierre Chanod, and Claude Roux. 2002. Robustness beyond Shallowness: Incremental Deep Parsing. Natural Language Engineering, 8: 121–144. James Allan, Rahul Gupta, and Vikas Khandelwal. 2001. Temporal summaries of new topics. In Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval, SIGIR ’01, pages 10–18. James Allan, editor. 2002. Topic Detection and Tracking. Springer. Omar Alonso, Ricardo Baeza-Yates, and Michael Gertz. 2007. Exploratory Search Using Timelines. In SIGCHI 2007 Workshop on Exploratory Search and HCI Workshop. Omar Rogelio Alonso. 2008. Temporal information retrieval. Ph.D. thesis, University of California at Davis, Davis, CA, USA. Adviser-Gertz, Michael. Regina Barzilay and Noemie Elhadad. 2002. Inferring Strategies for Sentence Ordering in Multidocument News Summarization. Journal of Artificial Intelligence Research, 17:35–55. Thorsten Brants, Francine Chen, and Ayman Farahat. 2003. A system for new event detection. In Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval, SIGIR ’03, pages 330–337, New York, NY, USA. ACM. Hai Leong Chieu and Yoong Keok Lee. 2004. Query based event extraction along a timeline. In Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval, SIGIR ’04, pages 425–432. Yoav Freund and Robert E. Schapire. 1997. A DecisionTheoretic Generalization of On-Line Learning and an Application to Boosting. Journal of Computer and System Sciences, 55(1): 119–139. Gabriel Pui Cheong Fung, Jeffrey Xu Yu, Philip S. Yu, and Hongjun Lu. 2005. Parameter free bursty events detection in text streams. In VLDB ’05: Proceedings of the 31st international conference on Very large data bases, pages 181–192. Caroline Hag e`ge and Xavier Tannier. 2008. XTM: A Robust Temporal Text Processor. In Computational Linguistics and Intelligent Text Processing, proceedings of 9th International Conference CICLing 2008, pages 23 1–240, Haifa, Israel, February. Springer Berlin / Heidelberg. Sanda Harabagiu and Cosmin Adrian Bejan. 2005. Question Answering Based on Temporal Inference. In Proceedings of the Workshop on Inference for Textual 738 Question Answering, Pittsburg, Pennsylvania, USA, July. Hyuckchul Jung, James Allen, Nate Blaylock, Will de Beaumont, Lucian Galescu, and Mary Swift. 2011. Building timelines from narrative clinical records: initial results based-on deep natural language understanding. In Proceedings of BioNLP 2011 Workshop, BioNLP ’ 11, pages 146–154, Stroudsburg, PA, USA. Association for Computational Linguistics. Nattiya Kanhabua. 2009. Exploiting temporal information in retrieval of archived documents. In Proceedings of the 32nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2009, Boston, MA, USA, July 1923, 2009, page 848. Youngho Kim and Jinwook Choi. 2011. Recognizing temporal information in korean clinical narratives through text normalization. Healthc Inform Res, 17(3): 150–5. Giridhar Kumaran and James Allen. 2004. Text classification and named entities for new event detection. In SIGIR ’04: Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval, pages 297–304. ACM. Wei Li, Wenjie Li, Qin Lu, and Kam-Fai Wong. 2005a. A Preliminary Work on Classifying Time Granularities of Temporal Questions. In Proceedings of Second international joint conference in NLP (IJCNLP 2005), Jeju Island, Korea, oct. Zhiwei Li, Bin Wang, Mingjing Li, and Wei-Ying Ma. 2005b. A Probabilistic Model for Restrospective News Event Detection. In Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Salvador, Brazil. ACM Press, New York City, NY, USA. Thomas Mestl, Olga Cerrato, Jon Ølnes, Per Myrseth, and Inger-Mette Gustavsen. 2009. Time Challenges Challenging Times for Future Information Search. DLib Magazine, 15(5/6). James Pustejovsky, Kiyong Lee, Harry Bunt, and Laurent Romary. 2010. Iso-timeml: An international standard for semantic annotation. In Nicoletta Calzolari (Conference Chair), Khalid Choukri, Bente Maegaard, Joseph Mariani, Jan Odijk, Stelios Piperidis, Mike Rosner, and Daniel Tapias, editors, Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC’10), Valletta, Malta, may. European Language Resources Association (ELRA). Claude Roux. 2004. Annoter les documents XML avec un outil d’analyse syntaxique. In 11` eme Confrence annuelle de Traitement Automatique des Langues Naturelles, F e`s, Morocco, April. ATALA. Estela Saquete, Jose L. Vicedo, Patricio Mart ı´nez-Barco, Rafael Mu˜ noz, and Hector Llorens. 2009. Enhancing QA Systems with Complex Temporal Question Processing Capabilities. Journal of Articifial Intelligence Research, 35:775–81 1. David A. Smith. 2002. Detecting events with date and place information in unstructured text. In JCDL ’02: Proceedings of the 2nd ACM/IEEE-CS joint conference on Digital libraries, pages 191–196, New York, NY, USA. ACM. Russell Swan and James Allen. 2000. Automatic generation of overview timelines. In Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval, SIGIR ’00, pages 49–56, New York, NY, USA. ACM. Marc Verhagen, Robert Gaizauskas, Franck Schilder, Mark Hepple, Graham Katz, and James Pustejovsky. 2007. SemEval-2007 - 15: TempEval Temporal Relation Identification. In Proceedings of SemEval workshop at ACL 2007, Prague, Czech Republic, June. Association for Computational Linguistics, Morristown, NJ, USA. Rui Yan, Liang Kong, Congrui Huang, Xiaojun Wan, Xiaoming Li, and Yan Zhang. 2011a. Timeline generation through evolutionary trans-temporal summarization. In Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, EMNLP 2011, 27-31 July 2011, Edinburgh, UK, pages 433–443. Rui Yan, Xiaojun Wan, Jahna Otterbacher, Liang Kong, Xiaoming Li, and Yan Zhang. 2011b. Evolutionary timeline summarization: a balanced optimization framework via iterative substitution. In Proceeding of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2011, Beijing, China, July 25-29, 2011, pages 745–754. Y. Yang, T. Pierce, and J. G. Carbonell. 1998. A study on retrospective and on-line event detection. In Proceedings ofthe 21stAnnual InternationalACM SIGIR Conference on Research and Development in Information Retrieval, Melbourne, Australia, August. ACM Press, New York City, NY, USA. 739