acl acl2013 acl2013-142 acl2013-142-reference knowledge-graph by maker-knowledge-mining

142 acl-2013-Evolutionary Hierarchical Dirichlet Process for Timeline Summarization


Source: pdf

Author: Jiwei Li ; Sujian Li

Abstract: Timeline summarization aims at generating concise summaries and giving readers a faster and better access to understand the evolution of news. It is a new challenge which combines salience ranking problem with novelty detection. Previous researches in this field seldom explore the evolutionary pattern of topics such as birth, splitting, merging, developing and death. In this paper, we develop a novel model called Evolutionary Hierarchical Dirichlet Process(EHDP) to capture the topic evolution pattern in time- line summarization. In EHDP, time varying information is formulated as a series of HDPs by considering time-dependent information. Experiments on 6 different datasets which contain 3 156 documents demonstrates the good performance of our system with regard to ROUGE scores.


reference text

Amr Ahmed and Eric Xing. Dynamic non-parametric mixture models and the recurrent chinese restaurant process. 2008. In SDM. 559 James Allan, Rahul Gupta and Vikas Khandelwal. Temporal summaries of new topics. 2001 . In Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval David Blei, Andrew Ng and Micheal Jordan. 2003. Latent dirichlet allocation. In Journal of Machine Learning Research. David Blei and John Lafferty. Dynamic topic models. 2006. In Proceedings of the 23rd international conference on Machine learning. Francois Carol, Manuel Davy and Arnaud Doucet. Generalized poly urn for time-varying dirichlet process mixtures. 2007. In Proceedings of the International Conference on Uncertainty in Artificial Intelligence. Deepayan Chakrabarti, Ravi Kumar and Andrew Tomkins. Evolutionary Clustering. InProceedings of the 12th ACM SIGKDD international conference Knowledge discoveryand data mining. Hai-Leong Chieu and Yoong-Keok Lee. Query based event extraction along a timeline. In Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval Giridhar Kumaran and James Allan. 2004. Text classification and named entities for new event detection. In Proceedings of the 27th annual international ACM SIGIR04. Liangda Li, Ke Zhou, Gui-Rong Xue, Hongyuan Zha and Yong Yu. Enhancing diversity, coverage and balance for summarization through structure learning. In Proceedings of the 18th international conference on World wide web. Chin-Yew Lin and Eduard Hovy. Automatic evaluation of summaries using n-gram co-occurrence statistics. In Proceedings of the Human Language Technology Conference of the NAACL. 2003. Dragomar Radev, Hongyan. Jing, and Malgorzata Stys. 2004. Centroid-based summarization of multiple documents. In Information Processing and Management. Lu Ren, David Dunson and Lawrence Carin. The dynamic hierarchical Dirichlet process. 2008. In Proceedings of the 25th international conference on Machine Learning. Xiaojun Wan, Jianwu Yang and Jianguo Xiao. 2007. Manifold-ranking based topic-focused multidocument summarization. In Proceedings of International Joint Conference on Artificial Intelligence. Xuerui Wang and Andrew MaCallum. Topics over time: a non-Markov continuous-time model of topical trends. In Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining. Yee Whye Teh, Michael Jordan, Matthew Beal and David Blei. Hierarchical Dirichlet Processes. In American Statistical Association. Rui Yan, Liang Kong, Congrui Huang, Xiaojun Wan, Xiaoming Li and Yan Zhang. 2011a. Evolutionary Timeline Summarization: a Balanced Optimization Framework via Iterative Substitution. In Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval. Rui Yan, Liang Kong, Congrui Huang, Xiaojun Wan, Jahna Otterbacher, Xiaoming Li and Yan Zhang. Timeline Generation Evolutionary Trans-Temporal Summarization. 2011b. In Proceedings of the Conference on Empirical Methods in Natural Language Processing. Jianwen Zhang, Yangqiu Song, Changshui Zhang and Shixia Liu. 2010. Evolutionary Hierarchical Dirichlet Processes for Multiple Correlated Time-varying Corpora. In Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining. 560