jmlr jmlr2008 jmlr2008-20 jmlr2008-20-reference knowledge-graph by maker-knowledge-mining

20 jmlr-2008-Causal Reasoning with Ancestral Graphs    (Special Topic on Causality)


Source: pdf

Author: Jiji Zhang

Abstract: Causal reasoning is primarily concerned with what would happen to a system under external interventions. In particular, we are often interested in predicting the probability distribution of some random variables that would result if some other variables were forced to take certain values. One prominent approach to tackling this problem is based on causal Bayesian networks, using directed acyclic graphs as causal diagrams to relate post-intervention probabilities to pre-intervention probabilities that are estimable from observational data. However, such causal diagrams are seldom fully testable given observational data. In consequence, many causal discovery algorithms based on data-mining can only output an equivalence class of causal diagrams (rather than a single one). This paper is concerned with causal reasoning given an equivalence class of causal diagrams, represented by a (partial) ancestral graph. We present two main results. The first result extends Pearl (1995)’s celebrated do-calculus to the context of ancestral graphs. In the second result, we focus on a key component of Pearl’s calculus—the property of invariance under interventions, and give stronger graphical conditions for this property than those implied by the first result. The second result also improves the earlier, similar results due to Spirtes et al. (1993). Keywords: ancestral graphs, causal Bayesian network, do-calculus, intervention


reference text

R.A. Ali, T. Richardson, and P. Spirtes. Markov equivalence for ancestral graphs. Technical Report 466, Department of Statistics, University of Washington, 2004. S. Andersson, D. Madigan, and M. Pearlman. A characterization of Markov equivalence classes of acyclic digraphs. The Annals of Statistics 25(2):505-541, 1997. D.M. Chickering. A transformational characterization of equivalent Bayesian network structures. In Proceedings of Eleventh Conference on Uncertainty in Artificial Intelligence, pages 87-98, Morgan Kaufmann, 1995. D.M. Chickering. Optimal structure identification with greedy search. Journal of Machine Learning Research 3:507-554, 2002. D. Geiger, T. Verma, and J. Pearl. Identifying independence in Bayesian networks. Networks 20, pages 507-534, 1990. Y. Huang and M. Valtorta. Pearl’s calculus of intervention is complete. In Proceedings of 22nd Conference on Uncertainty in Artificial Intelligence, pages 217-224, AUAI Press, 2006. C. Meek. Causal inference and causal explanation with background knowledge. In Proceedings of the Eleventh Conference on Uncertainty in Artificial Intelligence, pages 403-411, Morgan Kaufmann, 1995a. C. Meek. Strong completeness and faithfulness in Bayesian networks, In Proceedings of the Eleventh Conference on Uncertainty in Artificial Intelligence, pages 411-418, Morgan Kaufmann, 1995b. J. Pearl. Causal diagrams for empirical research. Biometrika 82:669-710, 1995. J. Pearl. Graphs, causality and structural equation models. Sociological Methods and Research 27:226-284, 1998. J. Pearl. Causality: Models, Reasoning, and Inference. Cambridge University Press, Cambridge, UK, 2000. J.W. Pratt and R. Schlaifer. On the interpretation and observation of laws. Journal of Econometrics 39:23-52, 1988. T. Richardson and P. Spirtes. Ancestral graph Markov models. The Annals of Statistics 30(4):9621030, 2002. T. Richardson and P. Spirtes. Causal inference via ancestral graph models. In P. Green, N. Hjort, and S. Richardson, editors, Highly Structured Stochastic Systems. Oxford University Press, USA, 2003. J. Robins. A new approach to causal inference in mortality studies with sustained exposure periods—applications to control of the healthy worker survivor effect. Mathematical Modeling 7:1393-1512, 1986. 1473 Z HANG S. Shimizu, P.O. Hoyer, A. Hyvarinen, and A. Kerminen. A linear non-Gaussian acyclic model for causal discovery. Journal of Machine Learning Research 7:2003-30, 2006. I. Shpitser and J. Pearl. Identification of conditional interventional distributions. In Proceedings of 22nd Conference on Uncertainty in Artificial Intelligence, pages 437-444, AUAI Press, 2006. P. Spirtes, C. Glymour, and R. Scheines. Causation, Prediction and Search. Springer-Verlag., New York, 1993. (2nd ed., MIT Press, Cambridge, MA, 2000.) P. Spirtes, C. Meek, and T. Richardson. An algorithm for causal inference in the presence of latent variables and selection bias. In C. Glymour and G.F. Cooper, editors, Computation, Causation, and Discovery. MIT Press, Cambridge, MA, 1999. P. Spirtes and T. Richardson. A polynomial time algorithm for determining DAG equivalence in the presence of latent variables and selection bias. In Proceedings of the 6th International Workshop on Artificial Intelligence and Statistics, 1996. URL http://citeseer.ist.psu.edu/spirtes97polynomial.html. P. Spirtes and T. Verma. Equivalence of causal models with latent variables. Technical Report Phil36, Department of Philosophy, Carnegie Mellon University, 1992. J. Tian and J. Pearl. On the identification of causal effects. Technical Report, Department of Computer Science, Iowa State University, 2004. J. Tian. Generating Markov equivalent maximal ancestral graphs by single edge replacement. In Proceedings of the 21st Conference on Uncertainty in Artificial Intelligence, pages 591-598, AUAI Press, 2005. C. Winship and L.S. Morgan. The estimation of causal effects from observational data. Annual Review of Sociology 25:659-706, 1999. J. Zhang and P.Spirtes. A transformational characterization of Markov equivalence for directed acyclic graphs with latent variables. In Proceedings of the 21st Conference on Uncertainty in Artificial Intelligence, pages 667-674, AUAI Press, 2005. J. Zhang. Causal Inference and Reasoning in Causally Insufficient Systems. PhD dissertation, Department of Philosophy, Carnegie Mellon University, 2006. URL www.hss.caltech.edu/∼jiji/dissertation.pdf. J. Zhang. On the completeness of orientation rules for causal discovery in the presence of latent confounders and selection bias. Artificial Intelligence, forthcoming. H. Zhao, Z. Zheng, and B. Liu. On the Markov equivalence of maximal ancestral graphs. Science in China (Mathematics), 48(4):548-562, 2005. 1474