acl acl2012 acl2012-126 acl2012-126-reference knowledge-graph by maker-knowledge-mining
Source: pdf
Author: Nathanael Chambers
Abstract: Temporal reasoners for document understanding typically assume that a document’s creation date is known. Algorithms to ground relative time expressions and order events often rely on this timestamp to assist the learner. Unfortunately, the timestamp is not always known, particularly on the Web. This paper addresses the task of automatic document timestamping, presenting two new models that incorporate rich linguistic features about time. The first is a discriminative classifier with new features extracted from the text’s time expressions (e.g., ‘since 1999’). This model alone improves on previous generative models by 77%. The second model learns probabilistic constraints between time expressions and the unknown document time. Imposing these learned constraints on the discriminative model further improves its accuracy. Finally, we present a new experiment design that facil- itates easier comparison by future work.
Nathanael Chambers and Dan Jurafsky. 2008. Jointly combining implicit constraints improves temporal ordering. In Proceedings of the Conference on Empirical Methods on Natural Language Processing (EMNLP), Hawaii, USA. W. Dakka, L. Gravano, and P. G. Ipeirotis. 2008. Answering general time sensitive queries. In Proceedings of the 1 International ACM Conference on Informa7th tion and Knowledge Management, pages 1437–1438. Angelo Dalli and Yorick Wilks. 2006. Automatic dating of documents and temporal text classification. In Proceedings of the Workshop on Annotating and Reasoning about Time and Events, pages 17–22. Franciska de Jong, Henning Rode, and Djoerd Hiemstra. 2005. Temporal language models for the disclosure of historical text. In Humanities, computers and cultural heritage: Proceedings of the XVIth International Conference of the Association for History and Computing (AHC 2005). Fernando Diaz and Rosie Jones. 2004. Using temporal profiles of queries for precision prediction. In Proceedings of the 27th Annual International ACM Special Interest Group on Information Retrieval Conference. David Graff. 2002. English Gigaword. Linguistic Data Consortium. Nattiya Kanhabua and Kjetil Norvag. 2008. Improving temporal language models for determining time of non-timestamped documents. In Proceedings of the 12th European conference on Research and Advanced Technology for Digital Libraries. Nattiya Kanhabua and Kjetil Norvag. 2009. Using temporal language models for document dating. Lecture Notes in Computer Science: machine learning and knowledge discovery in databases, 5782. W. Kraaij. 2004. Variations on language modeling for information retrieval. Ph.D. thesis, University of Twente. Abhimanu Kumar, Matthew Lease, and Jason Baldridge. 2011. Supervised language modeling for temporal resolution of texts. In Proceedings of CIKM. Xiaoyan Li and W. Bruce Croft. 2003. Time-based language models. In Proceedings of the twelfth international conference on Information and knowledge management. Dolores M. Llid o´, Rafael Llavori, and Mari a´ J. Aramburu. 2001. Extracting temporal references to assign document event-time periods. In Proceedings of the 12th International Conference on Database and Expert Systems Applications. 106 Inderjeet Mani and George Wilson. 2000. Robust temporal processing of news. In Proceedings of the 38th Annual Meeting on Association for Computational Linguistics. Mitchell P. Marcus, Beatrice Santorini, and Mary Ann Marcinkiewicz. 1993. Building a large annotated corpus of english: The penn treebank. Computational Linguistics, 19. James Pustejovsky, Patrick Hanks, Roser Sauri, Andrew See, David Day, Lisa Ferro, Robert Gaizauskas, Marcia Lazo, Andrea Setzer, and Beth Sundheim. 2003. The timebank corpus. Corpus Linguistics, pages 647– 656. Marc Verhagen, Robert Gaizauskas, Frank Schilder, Mark Hepple, Graham Katz, and James Pustejovsky. 2007. Semeval-2007 task 15: Tempeval temporal relation identification. In Workshop on Semantic Evaluations. Marc Verhagen, Robert Gaizauskas, Frank Schilder, Mark Hepple, Jessica Moszkowicz, and James Pustejovsky. 2009. The tempeval challenge: identifying temporal relations in text. Special Issue: Computational Semantic Analysis of Language: SemEval-2007 and Beyond, 43(2): 161–179. Katsumasa Yoshikawa, Sebastian Riedel, Masayuki Asahara, and Yuji Matsumoto. 2009. Jointly identifying temporal relations with markov logic. In Proceedings of the Association for Computational Linguistics (ACL). Ruiqiang Zhang, Yi Chang, Zhaohui Zheng, Donald Metzler, and Jian yun Nie. 2009. Search result re-ranking by feedback control adjustment for timesensitive query. In Proceedings of the 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics.