nips nips2012 nips2012-266 nips2012-266-reference knowledge-graph by maker-knowledge-mining

266 nips-2012-Patient Risk Stratification for Hospital-Associated C. diff as a Time-Series Classification Task


Source: pdf

Author: Jenna Wiens, Eric Horvitz, John V. Guttag

Abstract: A patient’s risk for adverse events is affected by temporal processes including the nature and timing of diagnostic and therapeutic activities, and the overall evolution of the patient’s pathophysiology over time. Yet many investigators ignore this temporal aspect when modeling patient outcomes, considering only the patient’s current or aggregate state. In this paper, we represent patient risk as a time series. In doing so, patient risk stratification becomes a time-series classification task. The task differs from most applications of time-series analysis, like speech processing, since the time series itself must first be extracted. Thus, we begin by defining and extracting approximate risk processes, the evolving approximate daily risk of a patient. Once obtained, we use these signals to explore different approaches to time-series classification with the goal of identifying high-risk patterns. We apply the classification to the specific task of identifying patients at risk of testing positive for hospital acquired Clostridium difficile. We achieve an area under the receiver operating characteristic curve of 0.79 on a held-out set of several hundred patients. Our two-stage approach to risk stratification outperforms classifiers that consider only a patient’s current state (p<0.05). 1


reference text

[1] M. M. Gaber, A. Zaslavsky, and S. Krishnaswamy. Mining data streams: A review. SIGMOD, 34(2), June 2005.

[2] Z. Xing, J. Pei, and E. Keogh. A brief survey on sequence classification. ACM SIGKDD Explorations, 12(1):40–48, June 2010.

[3] E. R. Dubberke, K. A. Reske, Y. Yan, M. A. Olsen, L. C. McDonald, and V. J. Fraser. Clostridium difficile - associated disease in a setting of endemicity: Identification of novel risk factors. Clinical Infectious Diseases, 45:1543–9, December 2007.

[4] CDC. Rates for clostridium difficile infection among hospitalized patients. Centers for Disease Control and Prevention Morbidity and Mortality Weekly Report, 60(34):1171, 2011.

[5] D. A. Katz, M.E. Lynch, and B. Littenber. Clinical prediction rules to optimize cytotoxin testing for clostridium difficile in hospitalized patients with diarrhea. American Journal of Medicine, 100(5):487–95, 1996.

[6] J. Tanner, D. Khan, D. Anthony, and J. Paton. Waterlow score to predict patietns at risk of developing clostridium difficile-associated disease. Journal of Hospital Infection, 71(3):239– 244, 2009.

[7] E. R. Dubberke, Y. Yan, K. A. Reske, A.M. Butler, J. Doherty, V. Pham, and V.J. Fraser. Development and validation of a clostridium difficile infection risk prediction model. Infect Control Hosp Epidemiol, 32(4):360–366, 2011.

[8] K. W. Garey, T. K. Dao-Tran, Z. D. Jiang, M. P. Price, L. O. Gentry, and DuPont H. L. A clinical risk index for clostridium difficile infection in hospitalized patients receiving broadspectrum antibiotics. Journal of Hospital Infections, 70(2):142–147, 2008.

[9] G. Krapohl. Preventing health care-associated infection: Development of a clinical prediction rule for clostridium difficile infection. PhD Thesis, 2011.

[10] N. Peled, S. Pitlik, Z. Samra, A. Kazakov, Y. Bloch, and J. Bishara. Predicting clostridium difficile toxin in hospitalized patients with antibiotic-associated diarrhea. Infect Control Hosp Epidemiol, 28(4):377–81, 2007.

[11] J. Wiens, E. Horvitz, and J. Guttag. Learning evolving patient risk processes for c. diff colonization. In ICML Workshop on Machine Learning from Clinical Data, 2012.

[12] T. W. Liao. Clustering of time series data - a survey. The Journal of the Pattern Recognition Society, January 2005.

[13] P. Bennett, S. Dumais, and E. Horvitz. The combination of test classifiers using reliability indicators. Information Retrieval, 8(1):67–100, 2005.

[14] H. Sakoe and S. Chiba. Dynamic programming algorithm optimization for spoken word recognition. IEEE Transactions on Acoustics, Speech, and Signal Processing, 26(1):43–49, 1978.

[15] C. Ratanamahatana and E. Keogh. Three myths about dynamic time warping data mining. In Proceedings of the Fifth SIAM International Conference on Data Mining, 2005.

[16] C. Bahlmann, B. Haasdonk, and Burkhardt H. On-line handwriting recognition with support vector machines - a kernel approach. Proceedings of the 8th International Workshop on Frontiers in Handwriting Recognition, 2002.

[17] L.R. Rabiner. A tutorial on hidden markov models and selected applications in speech recognition. Proceedings of the IEEE, 77(2), February 1989.

[18] K. Morik, P. Brockhausen, and T. Joachims. Combining statistical learning with a knowledgebased approach - a case study in intensive care monitoring. Proc. 16th International Conference on Machine Learning, 1999.

[19] T. Joachims. Making large-scale svm learning practical. advances in kernel methods - support vector learning, 1999.

[20] K. Murphy. Hidden Markov Model (HMM) Toolbox for Matlab. www.cs.ubc.ca/˜murphyk/Software/HMM/hmm.html. 9