nips nips2001 nips2001-41 nips2001-41-reference knowledge-graph by maker-knowledge-mining
Source: pdf
Author: Igor V. Cadez, Padhraic Smyth
Abstract: Massive transaction data sets are recorded in a routine manner in telecommunications, retail commerce, and Web site management. In this paper we address the problem of inferring predictive individual profiles from such historical transaction data. We describe a generative mixture model for count data and use an an approximate Bayesian estimation framework that effectively combines an individual’s specific history with more general population patterns. We use a large real-world retail transaction data set to illustrate how these profiles consistently outperform non-mixture and non-Bayesian techniques in predicting customer behavior in out-of-sample data. 1
Agrawal, R., Imielenski, T., and Swami, A. (1993) Mining association rules between sets of items in large databases, Proceedings of the ACM SIGMOD Conference on Management of Data (SIGMOD’98), New York: ACM Press, pp. 207–216. Cadez, I. V., Smyth, P., Ip, E., Mannila, H. (2001) Predictive profiles for transaction data using finite mixture models, Technical Report UCI-ICS-01-67, Information and Computer Science, University of California, Irvine (available online at www.datalab.uci.edu. Heckerman, D., Chickering, D. M., Meek, C., Rounthwaite, R., and Kadie, C. (2000) Dependency networks for inference, collaborative filtering, and data visualization. Journal of Machine Learning Research, 1, pp. 49–75. Hoffmann, T. (1999) Probabilistic latent sematic indexing, Proceedings of the ACM SIGIR Conference 1999, New York: ACM Press, 50–57. Lawrence, R.D., Almasi, G.S., Kotlyar, V., Viveros, M.S., Duri, S.S. (2001) Personalization of supermarket product recommendations, Data Mining and Knowledge Discovery, 5 (1/2). Lazarsfeld, P. F. and Henry, N. W. (1968) Latent Structure Analysis, New York: Houghton Mifflin. McCallum, A. (1999) Multi-label text classification with a mixture model trained by EM, in AAAI’99 Workshop on Text Learning. Strehl, A. and J. Ghosh (2000) Value-based customer grouping from large retail datasets, Proc. SPIE Conf. on Data Mining and Knowledge Discovery, SPIE Proc. Vol. 4057, Orlando, pp 33–42.