nips nips2013 nips2013-318 nips2013-318-reference knowledge-graph by maker-knowledge-mining
Source: pdf
Author: Justin Domke
Abstract: A successful approach to structured learning is to write the learning objective as a joint function of linear parameters and inference messages, and iterate between updates to each. This paper observes that if the inference problem is “smoothed” through the addition of entropy terms, for fixed messages, the learning objective reduces to a traditional (non-structured) logistic regression problem with respect to parameters. In these logistic regression problems, each training example has a bias term determined by the current set of messages. Based on this insight, the structured energy function can be extended from linear factors to any function class where an “oracle” exists to minimize a logistic loss.
[1] Stephen Boyd and Lieven Vandenberghe. Convex Optimization. Cambridge University Press, 2004.
[2] N. Dalal and B. Triggs. Histograms of oriented gradients for human detection. In CVPR, 2005.
[3] Chaitanya Desai, Deva Ramanan, and Charless C. Fowlkes. Discriminative models for multi-class object layout. International Journal of Computer Vision, 95(1):1–12, 2011.
[4] Thomas G. Dietterich, Adam Ashenfelter, and Yaroslav Bulatov. Training conditional random fields via gradient tree boosting. In ICML, 2004.
[5] Justin Domke. Learning graphical model parameters with approximate marginal inference. PAMI, 35(10):2454–2467, 2013.
[6] Thomas Finley and Thorsten Joachims. Training structural svms when exact inference is intractable. In ICML, 2008.
[7] Jerome H. Friedman. Stochastic gradient boosting. Computational Statistics and Data Analysis, 38:367– 378, 1999.
[8] Stephen Gould, Jim Rodgers, David Cohen, Gal Elidan, and Daphne Koller. Multi-class segmentation with relative location prior. IJCV, 80(3):300–316, 2008.
[9] Tamir Hazan and Raquel Urtasun. Efficient learning of structured predictors in general graphical models. CoRR, abs/1210.2346, 2012.
[10] Xuming He, Richard S. Zemel, and Miguel Á. Carreira-Perpiñán. Multiscale conditional random fields for image labeling. In CVPR, 2004.
[11] Tom Heskes. Convexity arguments for efficient minimization of the bethe and kikuchi free energies. J. Artif. Intell. Res. (JAIR), 26:153–190, 2006.
[12] Sanjiv Kumar and Martial Hebert. Discriminative fields for modeling spatial dependencies in natural images. In NIPS, 2003.
[13] Lubor Ladicky, Christopher Russell, Pushmeet Kohli, and Philip H. S. Torr. Associative hierarchical CRFs for object class image segmentation. In ICCV, 2009.
[14] André F. T. Martins, Noah A. Smith, and Eric P. Xing. Polyhedral outer approximations with application to natural language parsing. In ICML, 2009.
[15] Ofer Meshi, Tommi Jaakkola, and Amir Globerson. Convergence rate analysis of MAP coordinate minimization algorithms. In NIPS. 2012.
[16] Ofer Meshi, David Sontag, Tommi Jaakkola, and Amir Globerson. Learning efficiently with approximate inference via dual losses. In ICML, 2010.
[17] Sebastian Nowozin, Peter V. Gehler, and Christoph H. Lampert. On parameter learning in CRF-based approaches to object class image segmentation. In ECCV, 2010.
[18] Sebastian Nowozin, Carsten Rother, Shai Bagon, Toby Sharp, Bangpeng Yao, and Pushmeet Kohli. Decision tree fields. In ICCV, 2011.
[19] Florian Schroff, Antonio Criminisi, and Andrew Zisserman. Object class segmentation using random forests. In BMVC, 2008.
[20] Jamie Shotton, John M. Winn, Carsten Rother, and Antonio Criminisi. Textonboost for image understanding: Multi-class object recognition and segmentation by jointly modeling texture, layout, and context. IJCV, 81(1):2–23, 2009.
[21] Nathan Silberman and Rob Fergus. Indoor scene segmentation using a structured light sensor. In ICCV Workshops, 2011.
[22] Benjamin Taskar, Carlos Guestrin, and Daphne Koller. Max-margin markov networks. In NIPS, 2003.
[23] Jakob J. Verbeek and Bill Triggs. Scene segmentation with crfs learned from partially labeled images. In NIPS, 2007.
[24] John M. Winn and Jamie Shotton. The layout consistent random field for recognizing and segmenting partially occluded objects. In CVPR, 2006.
[25] Jianxiong Xiao and Long Quan. Multiple view semantic segmentation for street view images. In ICCV, 2009. 9