nips nips2013 nips2013-134 nips2013-134-reference knowledge-graph by maker-knowledge-mining

134 nips-2013-Graphical Models for Inference with Missing Data

Source: pdf

Author: Karthika Mohan, Judea Pearl, Jin Tian

Abstract: We address the problem of recoverability i.e. deciding whether there exists a consistent estimator of a given relation Q, when data are missing not at random. We employ a formal representation called ‘Missingness Graphs’ to explicitly portray the causal mechanisms responsible for missingness and to encode dependencies between these mechanisms and the variables being measured. Using this representation, we derive conditions that the graph should satisfy to ensure recoverability and devise algorithms to detect the presence of these conditions in the graph. 1

reference text

[1] P.D. Allison. Missing data series: Quantitative applications in the social sciences, 2002. 8

[2] T. Bu, N. Dufﬁeld, F.L. Presti, and D. Towsley. Network tomography on general topologies. In ACM SIGMETRICS Performance Evaluation Review, volume 30, pages 21–30. ACM, 2002.

[3] E.R. Buhi, P. Goodson, and T.B. Neilands. Out of sight, not out of mind: strategies for handling missing data. American journal of health behavior, 32:83–92, 2008.

[4] R.M. Daniel, M.G. Kenward, S.N. Cousens, and B.L. De Stavola. Using causal diagrams to guide analysis in missing data problems. Statistical Methods in Medical Research, 21(3):243–256, 2012.

[5] R. Dechter, I. Meiri, and J. Pearl. Temporal constraint networks. Artiﬁcial intelligence, 1991.

[6] C.K. Enders. Applied Missing Data Analysis. Guilford Press, 2010.

[7] U.M. Fayyad. Data mining and knowledge discovery: Making sense out of data. IEEE expert, 11(5):20– 25, 1996.

[8] F. M. Garcia. Deﬁnition and diagnosis of problematic attrition in randomized controlled experiments. Working paper, April 2013. Available at SSRN: http://ssrn.com/abstract=2267120.

[9] R.D. Gill and J.M. Robins. Sequential models for coarsening and missingness. In Proceedings of the First Seattle Symposium in Biostatistics, pages 295–305. Springer, 1997.

[10] R.D. Gill, M.J. Van Der Laan, and J.M. Robins. Coarsening at random: Characterizations, conjectures, counter-examples. In Proceedings of the First Seattle Symposium in Biostatistics, pages 255–294. Springer, 1997.

[11] J.W Graham. Missing Data: Analysis and Design (Statistics for Social and Behavioral Sciences). Springer, 2012.

[12] D.F. Heitjan and D.B. Rubin. Ignorability and coarse data. The Annals of Statistics, pages 2244–2253, 1991.

[13] R.J.A. Little and D.B. Rubin. Statistical analysis with missing data. Wiley, 2002.

[14] B.M. Marlin and R.S. Zemel. Collaborative prediction and ranking with non-random missing data. In Proceedings of the third ACM conference on Recommender systems, pages 5–12. ACM, 2009.

[15] B.M. Marlin, R.S. Zemel, S. Roweis, and M. Slaney. Collaborative ﬁltering and the missing at random assumption. In UAI, 2007.

[16] B.M. Marlin, R.S. Zemel, S.T. Roweis, and M. Slaney. Recommender systems: missing data and statistical model estimation. In IJCAI, 2011.

[17] P.E. McKnight, K.M. McKnight, S. Sidani, and A.J. Figueredo. Missing data: A gentle introduction. Guilford Press, 2007.

[18] Harvey J Miller and Jiawei Han. Geographic data mining and knowledge discovery. CRC, 2009.

[19] K. Mohan and J. Pearl. On the testability of models with missing data. To appear in the Proceedings of AISTAT-2014; Available at http://ftp.cs.ucla.edu/pub/stat ser/r415.pdf.

[20] J. Pearl. Probabilistic reasoning in intelligent systems: networks of plausible inference. Morgan Kaufmann, 1988.

[21] J. Pearl. Causality: models, reasoning and inference. Cambridge Univ Press, New York, 2009.

[22] J. Pearl and K. Mohan. Recoverability and testability of missing data: Introduction and summary of results. Technical Report R-417, UCLA, 2013. Available at http://ftp.cs.ucla.edu/pub/stat ser/r417.pdf.

[23] C.Y.J. Peng, M. Harwell, S.M. Liou, and L.H. Ehman. Advances in missing data methods and implications for educational research. Real data analysis, pages 31–78, 2006.

[24] J.L. Peugh and C.K. Enders. Missing data in educational research: A review of reporting practices and suggestions for improvement. Review of educational research, 74(4):525–556, 2004.

[25] D.B. Rubin. Inference and missing data. Biometrika, 63:581–592, 1976.

[26] D.B. Rubin. Multiple Imputation for Nonresponse in Surveys. Wiley Online Library, New York, NY, 1987.

[27] D.B. Rubin. Multiple imputation after 18+ years. Journal of the American Statistical Association, 91(434):473–489, 1996.

[28] J.L. Schafer and J.W. Graham. Missing data: our view of the state of the art. Psychological Methods, 7(2):147–177, 2002.

[29] F. Thoemmes and N. Rose. Selection of auxiliary variables in missing data problems: Not all auxiliary variables are created equal. Technical Report Technical Report R-002, Cornell University, 2013.

[30] M.J. Van der Laan and J.M. Robins. Uniﬁed methods for censored longitudinal data and causality. Springer Verlag, 2003.

[31] W. Wothke. Longitudinal and multigroup modeling with missing data. Lawrence Erlbaum Associates Publishers, 2000. 9