nips nips2011 nips2011-194 nips2011-194-reference knowledge-graph by maker-knowledge-mining

194 nips-2011-On Causal Discovery with Cyclic Additive Noise Models


Source: pdf

Author: Joris M. Mooij, Dominik Janzing, Tom Heskes, Bernhard Schölkopf

Abstract: We study a particular class of cyclic causal models, where each variable is a (possibly nonlinear) function of its parents and additive noise. We prove that the causal graph of such models is generically identifiable in the bivariate, Gaussian-noise case. We also propose a method to learn such models from observational data. In the acyclic case, the method reduces to ordinary regression, but in the more challenging cyclic case, an additional term arises in the loss function, which makes it a special case of nonlinear independent component analysis. We illustrate the proposed method on synthetic data. 1


reference text

[1] J. Pearl. Causality: Models, Reasoning, and Inference. Cambridge University Press, 2000.

[2] K. A. Bollen. Structural Equations with Latent Variables. John Wiley & Sons, 1989.

[3] P. Spirtes, C. Glymour, and R. Scheines. Causation, Prediction, and Search. Springer-Verlag, 1993. (2nd ed. MIT Press 2000).

[4] C.W.J. Granger. Investigating causal relations by econometric models and cross-spectral methods. Econometrica, 37:424438, 1969.

[5] N. Friedman, K. Murphy, and S. Russell. Learning the structure of dynamic probabilistic networks. In Proceedings of the Fourteenth Conference on Uncertainty in Artificial Intelligence (UAI-98), pages 139–147, 1998.

[6] P. Spirtes. Directed cyclic graphical representations of feedback models. In Proceedings of the 11th Conference on Uncertainty in Artificial Intelligence (UAI-95), page 491499, 1995.

[7] T. Richardson. A discovery algorithm for directed cyclic graphs. In Proceedings of the Twelfth Conference on Uncertainty in Artificial Intelligence (UAI-1996), 1996.

[8] G. Lacerda, P. Spirtes, J. Ramsey, and P. O. Hoyer. Discovering cyclic causal models by independent components analysis. In Proceedings of the 24th Conference on Uncertainty in Artificial Intelligence (UAI-2008), 2008.

[9] M. Schmidt and K. Murphy. Modeling discrete interventional data using directed cyclic graphical models. In Proceedings of the 25th Annual Conference on Uncertainty in Artificial Intelligence (UAI-09), 2009.

[10] S. Itani, M. Ohannessian, K. Sachs, G. P. Nolan, and M. A. Dahleh. Structure learning in causal cyclic networks. In JMLR Workshop and Conference Proceedings, volume 6, page 165176, 2010.

[11] P.O. Hoyer, D.Janzing, J.M.Mooij, J.Peters, and B.Sch¨ lkopf. Nonlinear causal discovery o with additive noise models. In D. Koller, D. Schuurmans, Y. Bengio, and L. Bottou, editors, Advances in Neural Information Processing Systems 21 (NIPS*2008), pages 689–696, 2009.

[12] Jonas Peters, Joris M. Mooij, Dominik Janzing, and Bernhard Sch¨ lkopf. Identifiability of o causal graphs using functional models. In Proceedings of the 27th Annual Conference on Uncertainty in Artificial Intelligence (UAI-11), 2011.

[13] K. Zhang and A. Hyv¨ rinen. On the identifiability of the post-nonlinear causal model. In Proa ceedings of the 25th Conference on Uncertainty in Artificial Intelligence (UAI-09), Montreal, Canada, 2009.

[14] A.D. Polyanin and V.F. Zaitsev. Handbook of Nonlinear Partial Differential Equations. Chapman & Hall / CRC, 2004.

[15] A. Gretton, R. Herbrich, A. Smola, O. Bousquet, and B. Sch¨ lkopf. Kernel methods for o measuring independence. Journal of Machine Learning Research, 6:2075–2129, 2005. 9