acl acl2012 acl2012-205 acl2012-205-reference knowledge-graph by maker-knowledge-mining

205 acl-2012-Tweet Recommendation with Graph Co-Ranking

Source: pdf

Author: Rui Yan ; Mirella Lapata ; Xiaoming Li

Abstract: Mirella Lapata‡ Xiaoming Li†, \ ‡Institute for Language, \State Key Laboratory of Software Cognition and Computation, Development Environment, University of Edinburgh, Beihang University, Edinburgh EH8 9AB, UK Beijing 100083, China mlap@ inf .ed .ac .uk lxm@pku .edu .cn 2012.1 Twitter enables users to send and read textbased posts ofup to 140 characters, known as tweets. As one of the most popular micro-blogging services, Twitter attracts millions of users, producing millions of tweets daily. Shared information through this service spreads faster than would have been possible with traditional sources, however the proliferation of user-generation content poses challenges to browsing and finding valuable information. In this paper we propose a graph-theoretic model for tweet recommendation that presents users with items they may have an interest in. Our model ranks tweets and their authors simultaneously using several networks: the social network connecting the users, the network connecting the tweets, and a third network that ties the two together. Tweet and author entities are ranked following a co-ranking algorithm based on the intuition that that there is a mutually reinforcing relationship between tweets and their authors that could be reflected in the rankings. We show that this framework can be parametrized to take into account user preferences, the popularity of tweets and their authors, and diversity. Experimental evaluation on a large dataset shows that our model out- performs competitive approaches by a large margin.

reference text

Fabian Abel, Qi Gao, Geert-Jan Houben, and Ke Tao. 2011a. Analyzing temporal dynamics in Twitter profiles for personalized recommendations in the social web. In Proceedings of the ACM Web Science Conference 2011, pages 1–8, Koblenz, Germany. Fabian Abel, Qi Gao, Geert-Jan Houben, and Ke Tao. 2011b. Analyzing user modeling on Twitter for personalized news recommendations. User Modeling, Adaptation and Personalization, pages 1–12. Fabian Abel, Qi Gao, Geert-Jan Houben, and Ke Tao. 2011c. Semantic enrichment of twitter posts for user profile construction on the social web. The Semanic Web: Research and Applications, pages 375–389. David M. Blei, Andrew Y. Ng, and Michael I. Jordan. 2003. Latent Dirichlet aladdress. Journal of Machine Learning Research, 3:993–1022. Sergey Brin and Lawrence Page. 1998. The anatomy of a large-scale hypertextual web search engine. Proceedings of the 7th International Conference on World Wide Web, 30(1-7): 107–1 17. Jaime Carbonell and Jade Goldstein. 1998. The use of MMR, diversity-based reranking for reordering documents and producing summaries. In Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 335–336, Melbourne, Australia. Jilin Chen, Rowan Nairn, Les Nelson, Michael Bernstein, and Ed Chi. 2010. Short and tweet: experiments on recommending content from information streams. In Proceedings of the 28th International Conference on Human Factors in Computing Systems, pages 1185– 1194, Atlanta, Georgia. Junghoo Cho and Uri Schonfeld. 2007. Rankmass crawler: a crawler with high personalized pagerank coverage guarantee. In Proceedings of the 33rd International Conference on Very Large Data Bases, pages 375–386, Vienna, Austria. Anlei Dong, Ruiqiang Zhang, Pranam Kolari, Jing Bai, Fernando Diaz, Yi Chang, Zhaohui Zheng, and Hongyuan Zha. 2010. Time is of the essence: improving recency ranking using Twitter data. In Proceedings of the 19th International Conference on World Wide Web, pages 33 1–340, Raleigh, North Carolina. Yajuan Duan, Long Jiang, Tao Qin, Ming Zhou, and Heung-Yeung Shum. 2010. An empirical study on learning to rank of tweets. In Proceedings of the 23rd International Conference on Computational Linguistics, pages 295–303, Beijing, China. John Hannon, Mike Bennett, and Barry Smyth. 2010. Recommending twitter users to follow using content and collaborative filtering approaches. In Proceedings 524 of the 4th ACM Conference on Recommender Systems, pages 199–206, Barcelona, Spain. Liangjie Hong, Ovidiu Dan, and Brian D. Davison. 2011. Predicting popular messages in Twitter. In Proceedings of the 20th International Conference Companion on World Wide Web, pages 57–58, Hyderabad, India. Minlie Huang, Yi Yang, and Xiaoyan Zhu. 2011. Quality-biased ranking of short texts in microblogging services. In Proceedings of the 5th International Joint Conference on Natural Language Processing, pages 373–382, Chiang Mai, Thailand. Kalervo J¨ arvelin and Jaana Kek a¨l a¨inen. 2002. Cumulated gain-based evaluation of IR techniques. ACM Transactions on Information Systems, 20:422–446. Thorsten Joachims. 1999. Making large-scale svm learning practical. InAdvances in Kernel Methods: Support Vector Learning, pages 169–184. MIT press. Christopher D. Manning, Prabhakar Raghavan, and Hinrich Schutze. 2008. Introduction to Information Retrieval, volume 1. Cambridge University Press. Qiaozhu Mei, Jian Guo, and Dragomir Radev. 2010. Divrank: the interplay of prestige and diversity in information networks. In Proceedings of the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pages 1009–1018, Washington, DC. Emily Pitler, Annie Louis, and Ani Nenkova. 2010. Automatic evaluation of linguistic quality in multidocument summarization. In Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, pages 544–554, Uppsala, Sweden. Feng Qiu and Junghoo Cho. 2006. Automatic identification of user interest for personalized search. In Proceedings of the 15th International Conference on World Wide Web, pages 727–736, Edinburgh, Scotland. Sun Aaron R., Cheng Jiesi, Zeng, and Daniel Dajun. 2009. A novel recommendation framework for microblogging based on information diffusion. In Proceedings of the 19th Annual Workshop on Information Technologies and Systems, pages 199–216, Phoenix, Arizona. Daniel Ramage, Susan Dumais, and Dan Liebling. 2010. Characterizing microblogs with topic models. In International AAAI Conference on Weblogs and Social Media, pages 130–137. The AAAI Press. Gerard Salton and Christopher Buckley. 1988. Termweighting approaches in automatic text retrieval. Information Processing and Management, 24(5):5 13– 523. Jaime Teevan, Daniel Ramage, and Meredith Ringel Morris. 2011. #Twittersearch: a comparison of microblog search and web search. In Proceedings of the 4th ACM International Conference Mining, pages 35–44, on Web Search and Data Hong Kong, China. Ibrahim Uysal and W. Bruce Croft. 2011. User oriented tweet ranking: a filtering approach to microblogs. In Proceedings of the 20th ACM International Conference on Information and Knowledge Management, pages 2261–2264, Glasgow, Scotland. Xiaojun Wan, Jianwu Yang, and Jianguo Xiao. 2007. Single document summarization with document expansion. In Proceedings of the 22nd Conference on Artificial Intelligence, pages 931–936, Vancouver, British Columbia. Xiaojun Wan, Huiying Li, and Jianguo Xiao. 2010. Cross-language document summarization based on machine translation quality prediction. In Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, pages 917–926, Uppsala, Sweden. Rui Yan, Jian-Yun Nie, and Xiaoming Li. 2011. Summarize what you are interested in: An optimization framework for interactive personalized summarization. In Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, pages 1342–1351. Association for Computational Linguistics. Ding Zhou, Sergey A. Orshanskiy, Hongyuan Zha, and C. Lee Giles. 2007. Co-ranking authors and documents in a heterogeneous network. In Proceedings of the 7th IEEE International Conference on Data Mining, pages 739–744. IEEE. 525