nips nips2004 nips2004-90 nips2004-90-reference knowledge-graph by maker-knowledge-mining
Source: pdf
Author: Scott J. Gaffney, Padhraic Smyth
Abstract: Clustering and prediction of sets of curves is an important problem in many areas of science and engineering. It is often the case that curves tend to be misaligned from each other in a continuous manner, either in space (across the measurements) or in time. We develop a probabilistic framework that allows for joint clustering and continuous alignment of sets of curves in curve space (as opposed to a fixed-dimensional featurevector space). The proposed methodology integrates new probabilistic alignment models with model-based curve clustering algorithms. The probabilistic approach allows for the derivation of consistent EM learning algorithms for the joint clustering-alignment problem. Experimental results are shown for alignment of human growth data, and joint clustering and alignment of gene expression time-course data.
[1] J.O. Ramsay and B. W. Silverman. Functional Data Analysis. Springer-Verlag, New York, NY, 1997.
[2] Scott J. Gaffney. Probabilistic Curve-Aligned Clustering and Prediction with Regression Mixture Models. Ph.D. Dissertation, University of California, Irvine, 2004.
[3] Z. Bar-Joseph et al. A new approach to analyzing gene expression time series data. Journal of Computational Biology, 10(3):341–356, 2003.
[4] B. J. Frey and N. Jojic. Transformation-invariant clustering using the EM algorithm. IEEE Trans. PAMI, 25(1):1–17, January 2003.
[5] H. Chui, J. Zhang, and A. Rangarajan. Unsupervised learning of an atlas from unlabeled pointsets. IEEE Trans. PAMI, 26(2):160–172, February 2004.
[6] A. D. J. Cross and E. R. Hancock. Graph matching with a dual-step EM algorithm. IEEE Trans. PAMI, 20(11):1236–1253, November 1998.
[7] S. J. Gaffney and P. Smyth. Curve clustering with random effects regression mixtures. In C. M. Bishop and B. J. Frey, editors, Proc. Ninth Inter. Workshop on Artificial Intelligence and Stats, Key West, FL, January 3–6 2003.
[8] D. Chudova, S. J. Gaffney, and P. J. Smyth. Probabilistic models for joint clustering and timewarping of multi-dimensional curves. In Proc. of the Nineteenth Conference on Uncertainty in Artificial Intelligence (UAI-2003), Acapulco, Mexico, August 7–10, 2003.
[9] D. Chudova, S. J. Gaffney, E. Mjolsness, and P. J. Smyth. Translation-invariant mixture models for curve clustering. In Proc. Ninth ACM SIGKDD Inter. Conf. on Knowledge Discovery and Data Mining, Washington D.C., August 24–27, New York, 2003. ACM Press.
[10] S. Gaffney and P. Smyth. Trajectory clustering with mixtures of regression models. In Surajit Chaudhuri and David Madigan, editors, Proc. Fifth ACM SIGKDD Inter. Conf. on Knowledge Discovery and Data Mining, August 15–18, pages 63–72, N.Y., 1999. ACM Press.
[11] P. H. C. Eilers. and B. D. Marx. Flexible smoothing with B-splines and penalties. Statistical Science, 11(2):89–121, 1996.
[12] A. Gelman, J. B. Carlin, H. S. Stern, and D. B. Rubin. Bayesian Data Analysis. Chapman & Hall, New York, NY, 1995.
[13] P. T. Spellman et al. Comprehensive identification of cell cycle-regulated genes of the yeast Saccharomyces cerevisiae by microarray hybridization. Molec. Bio. Cell, 9(12):3273–3297, December 1998.
[14] J. Aach and G. M. Church. Aligning gene expression time series with time warping algorithms. Bioinformatics, 17(6):495–508, 2001.