jmlr jmlr2005 jmlr2005-12 jmlr2005-12-reference knowledge-graph by maker-knowledge-mining

12 jmlr-2005-An MDP-Based Recommender System

Source: pdf

Author: Guy Shani, David Heckerman, Ronen I. Brafman

Abstract: Typical recommender systems adopt a static view of the recommendation process and treat it as a prediction problem. We argue that it is more appropriate to view the problem of generating recommendations as a sequential optimization problem and, consequently, that Markov decision processes (MDPs) provide a more appropriate model for recommender systems. MDPs introduce two beneﬁts: they take into account the long-term effects of each recommendation and the expected value of each recommendation. To succeed in practice, an MDP-based recommender system must employ a strong initial model, must be solvable quickly, and should not consume too much memory. In this paper, we describe our particular MDP model, its initialization using a predictive model, the solution and update algorithm, and its actual performance on a commercial site. We also describe the particular predictive model we used which outperforms previous models. Our system is one of a small number of commercially deployed recommender systems. As far as we know, it is the ﬁrst to report experimental analysis conducted on a real commercial site. These results validate the commercial value of recommender systems, and in particular, of our MDP-based approach. Keywords: recommender systems, Markov decision processes, learning, commercial applications

reference text

F. Bacchus, C. Boutilier, and A. J. Grove. Rewarding behaviors. In Proceedings of the Thirteenth National Conference on Artiﬁcial Intelligence, Vol. 2, pages 1160–1167, Portalnad, OR, 1996. M. Balabanovic and Y. Shoham. Combining content-based and collaborative recommendation. Communications of the ACM, 40(3):62—72, March 1997. R. E. Bellman. Dynamic Programming. Princeton University Press, 1962. T. Bohnenberger and A. Jameson. When policies are better than plans: decision-theoretic planning of recommendation sequences. In IUI ’01: Proceedings of the 6th international conference on Intelligent user interfaces, pages 21–24, New York, NY, USA, 2001. ACM Press. ISBN 1-58113325-1. 1292 A N MDP-BASED R ECOMMENDER S YSTEM B. Bonet and H. Geffner. Faster heuristic search algorithms for planning with uncertainty and full feedback. In G. Gottlob, editor, 18th International Joint Conf. on Artiﬁcial Intelligence, pages 1233–1238, Acapulco, Mexico, 2003. Morgan Kaufmann Publishers Inc. C. Boutilier. A POMDP formulation of preference elicitation problems. In Eighteenth national conference on Artiﬁcial intelligence, pages 239–246, Edmonton, Alberta, Canada, 2002. American Association for Artiﬁcial Intelligence. C. Boutilier, R. Dearden, and M. Goldszmidt. Stochastic dynamic programming with factored representations. Artiﬁcial Intelligence, 121(1-2):49–107, 2000. J. S. Breese, D. Heckerman, and C. Kadie. Empirical analysis of predictive algorithms for collaborative ﬁltering. In Proceedings of the 14th conference on Uncertainity in Artiﬁcial Inteligence, pages 43–52, San Francisco, California, 1998. Morgan Kaufmann Publishers Inc. R. Burke. Knowledge-Based Recommender Systems, volume 69 of Encyclopedia of Library and Information Systems, supplement 32. A. Kent, New York, 2000. R. Burke. Hybrid recommender systems: Survey and experiments. User Modeling and UserAdapted Interaction, 12(4):331–370, 2002. S. F. Chen and J. Goodman. An empirical study of smoothing techniques for language modeling. In Arivind Joshi and Martha Palmer, editors, Proceedings of the Thirty-Fourth Annual Meeting of the Association for Computational Linguistics, pages 310–318, San Francisco, 1996. Morgan Kaufmann Publishers Inc. M. Claypool, P. Le, M. Wased, and D. Brown. Implicit interest indicators. In IUI ’01: Proceedings of the 6th international conference on Intelligent user interfaces, pages 33–40, Santa Fe, New Mexico, United States, 2001. ACM Press. N. Good, J. Ben Schafer, J. A. Konstan, A. Borchers, B. M. Sarwar, J. L. Herlocker, and J. Riedl. Combining collaborative ﬁltering with personal agents for better recommendations. In AAAI ’99/IAAI ’99: Proceedings of the sixteenth national conference on Artiﬁcial intelligence and the eleventh Innovative applications of artiﬁcial intelligence conference innovative applications of artiﬁcial intelligence, pages 439–446, Orlando, Florida, United States, 1999. American Association for Artiﬁcial Intelligence. D. Heckerman, D. M. Chickering, C. Meek, R. Rounthwaite, and C. M. Kadie. Dependency networks for inference, collaborative ﬁltering, and data visualization. Journal of Machine Learning Research, 1:49–75, 2000. R. A. Howard. Dynamic Programming and Markov Processes. MIT Press, 1960. C. M. Kadie, C. Meek, and D. Heckerman. CFW: A collaborative ﬁltering system using posterios over weights of evidence. In 18th Conference on Uncertainty in AI (UAI’02), pages 242–250, 2002. L. P. Kaelbling, M. L. Littman, and A. P. Moore. Reinforcement learning: A survey. Journal of Artiﬁcial Intelligence Research, 4:237–285, 1996. 1293 S HANI , B RAFMAN AND H ECKERMAN B. Kitts, D. Freed, and M. Vrieze. Cross-sell: a fast promotion-tunable customer-item recommendation method based on conditionally independent probabilities. In KDD ’00: Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining, pages 437–446, Boston, Massachusetts, United States, 2000. ACM Press. D. Koller and R. Parr. Policy iteration for factored mdps. In UAI ’00: Proceedings of the 16th Conference on Uncertainty in Artiﬁcial Intelligence, pages 326–334, San Francisco, CA, USA, 2000. Morgan Kaufmann Publishers Inc. Microsoft. Recommendation score. Microsoft Commerce Server 2002 Documentation, 2002. R. J. Mooney and L. Roy. Content-based book recommending using learning for text categorization. In DL ’00: Proceedings of the ﬁfth ACM conference on Digital libraries, pages 195–204, San Antonio, Texas, United States, 2000. ACM Press. M. Puterman. Markov Decision Processes. Wiley, New York, 1994. P. Resnick, N. Iacovou, M. Suchak, P. Bergstorm, and J. Riedl. GroupLens: An Open Architecture for Collaborative Filtering of Netnews. In Proceedings of ACM 1994 Conference on Computer Supported Cooperative Work, pages 175–186, Chapel Hill, North Carolina, 1994. ACM. P. Resnick and H. R. Varian. Recommender systems. Special issue of Communications of the ACM, pages 56–58, March 1997. B. Sarwar, G. Karypis, J. Konstan, and J. Riedl. Application of dimensionality reduction in recommender systems: a case study. In ACM WebKDD 2000 Web Mining for E-Commerce Workshop, August 2000a. B. Sarwar, G. Karypis, J. A. Konstan, and J. Riedl. Analysis of recommendation algorithms for e-commerce. In EC ’00: Proceedings of the 2nd ACM conference on Electronic commerce, pages 158–167, New York, NY, USA, 2000b. ACM Press. ISBN 1-58113-272-7. J. B. Schafer, J. A. Konstan, and J. Riedle. E-commerce recommendation applications. Data Mining Knowledge Discovery, 5(1-2):115–153, 2001. Z. Su, Q. Yang, and H. J. Zhang. A prediction system for multimedia pre-fetching in internet. In MULTIMEDIA ’00: Proceedings of the eighth ACM international conference on Multimedia, pages 3–11, Marina del Rey, California, United States, 2000. ACM Press. R. S. Sutton and A. G. Barto. Reinforcement Learning: An Introduction. MIT Press, Cambridge, MA, 1998. S. Thi´ baux, F. Kabanza, and J. Slaney. Anytime state-based solution methods for decision proe cesses with non-Markovian rewards. In 18th Conference on Uncertainty in AI (UAI’02), pages 501–510, Edmonton, Canada, July 2002. Morgan Kaufmann. A. Yeh. More accurate tests for the statistical signiﬁcance of result differences. In Proceedings of the 17th conference on Computational linguistics, pages 947–953, Saarbrcken, Germany, 2000. Association for Computational Linguistics. 1294 A N MDP-BASED R ECOMMENDER S YSTEM A. Zimdars, D. M. Chickering, and C. Meek. Using temporal data for making recommendations. In UAI ’01: Proceedings of the 17th Conference in Uncertainty in Artiﬁcial Intelligence, pages 580–588, San Francisco, CA, USA, 2001. Morgan Kaufmann Publishers Inc. 1295