nips nips2012 nips2012-266 nips2012-266-reference knowledge-graph by maker-knowledge-mining

266 nips-2012-Patient Risk Stratification for Hospital-Associated C. diff as a Time-Series Classification Task

Source: pdf

Author: Jenna Wiens, Eric Horvitz, John V. Guttag

Abstract: A patient’s risk for adverse events is affected by temporal processes including the nature and timing of diagnostic and therapeutic activities, and the overall evolution of the patient’s pathophysiology over time. Yet many investigators ignore this temporal aspect when modeling patient outcomes, considering only the patient’s current or aggregate state. In this paper, we represent patient risk as a time series. In doing so, patient risk stratiﬁcation becomes a time-series classiﬁcation task. The task differs from most applications of time-series analysis, like speech processing, since the time series itself must ﬁrst be extracted. Thus, we begin by deﬁning and extracting approximate risk processes, the evolving approximate daily risk of a patient. Once obtained, we use these signals to explore different approaches to time-series classiﬁcation with the goal of identifying high-risk patterns. We apply the classiﬁcation to the speciﬁc task of identifying patients at risk of testing positive for hospital acquired Clostridium difﬁcile. We achieve an area under the receiver operating characteristic curve of 0.79 on a held-out set of several hundred patients. Our two-stage approach to risk stratiﬁcation outperforms classiﬁers that consider only a patient’s current state (p<0.05). 1

reference text

[1] M. M. Gaber, A. Zaslavsky, and S. Krishnaswamy. Mining data streams: A review. SIGMOD, 34(2), June 2005.

[2] Z. Xing, J. Pei, and E. Keogh. A brief survey on sequence classiﬁcation. ACM SIGKDD Explorations, 12(1):40–48, June 2010.

[3] E. R. Dubberke, K. A. Reske, Y. Yan, M. A. Olsen, L. C. McDonald, and V. J. Fraser. Clostridium difﬁcile - associated disease in a setting of endemicity: Identiﬁcation of novel risk factors. Clinical Infectious Diseases, 45:1543–9, December 2007.

[4] CDC. Rates for clostridium difﬁcile infection among hospitalized patients. Centers for Disease Control and Prevention Morbidity and Mortality Weekly Report, 60(34):1171, 2011.

[5] D. A. Katz, M.E. Lynch, and B. Littenber. Clinical prediction rules to optimize cytotoxin testing for clostridium difﬁcile in hospitalized patients with diarrhea. American Journal of Medicine, 100(5):487–95, 1996.

[6] J. Tanner, D. Khan, D. Anthony, and J. Paton. Waterlow score to predict patietns at risk of developing clostridium difﬁcile-associated disease. Journal of Hospital Infection, 71(3):239– 244, 2009.

[7] E. R. Dubberke, Y. Yan, K. A. Reske, A.M. Butler, J. Doherty, V. Pham, and V.J. Fraser. Development and validation of a clostridium difﬁcile infection risk prediction model. Infect Control Hosp Epidemiol, 32(4):360–366, 2011.

[8] K. W. Garey, T. K. Dao-Tran, Z. D. Jiang, M. P. Price, L. O. Gentry, and DuPont H. L. A clinical risk index for clostridium difﬁcile infection in hospitalized patients receiving broadspectrum antibiotics. Journal of Hospital Infections, 70(2):142–147, 2008.

[9] G. Krapohl. Preventing health care-associated infection: Development of a clinical prediction rule for clostridium difﬁcile infection. PhD Thesis, 2011.

[10] N. Peled, S. Pitlik, Z. Samra, A. Kazakov, Y. Bloch, and J. Bishara. Predicting clostridium difﬁcile toxin in hospitalized patients with antibiotic-associated diarrhea. Infect Control Hosp Epidemiol, 28(4):377–81, 2007.

[11] J. Wiens, E. Horvitz, and J. Guttag. Learning evolving patient risk processes for c. diff colonization. In ICML Workshop on Machine Learning from Clinical Data, 2012.

[12] T. W. Liao. Clustering of time series data - a survey. The Journal of the Pattern Recognition Society, January 2005.

[13] P. Bennett, S. Dumais, and E. Horvitz. The combination of test classiﬁers using reliability indicators. Information Retrieval, 8(1):67–100, 2005.

[14] H. Sakoe and S. Chiba. Dynamic programming algorithm optimization for spoken word recognition. IEEE Transactions on Acoustics, Speech, and Signal Processing, 26(1):43–49, 1978.

[15] C. Ratanamahatana and E. Keogh. Three myths about dynamic time warping data mining. In Proceedings of the Fifth SIAM International Conference on Data Mining, 2005.

[16] C. Bahlmann, B. Haasdonk, and Burkhardt H. On-line handwriting recognition with support vector machines - a kernel approach. Proceedings of the 8th International Workshop on Frontiers in Handwriting Recognition, 2002.

[17] L.R. Rabiner. A tutorial on hidden markov models and selected applications in speech recognition. Proceedings of the IEEE, 77(2), February 1989.

[18] K. Morik, P. Brockhausen, and T. Joachims. Combining statistical learning with a knowledgebased approach - a case study in intensive care monitoring. Proc. 16th International Conference on Machine Learning, 1999.

[19] T. Joachims. Making large-scale svm learning practical. advances in kernel methods - support vector learning, 1999.

[20] K. Murphy. Hidden Markov Model (HMM) Toolbox for Matlab. www.cs.ubc.ca/˜murphyk/Software/HMM/hmm.html. 9