cvpr cvpr2013 cvpr2013-406 cvpr2013-406-reference knowledge-graph by maker-knowledge-mining

406 cvpr-2013-Spatial Inference Machines


Source: pdf

Author: Roman Shapovalov, Dmitry Vetrov, Pushmeet Kohli

Abstract: This paper addresses the problem of semantic segmentation of 3D point clouds. We extend the inference machines framework of Ross et al. by adding spatial factors that model mid-range and long-range dependencies inherent in the data. The new model is able to account for semantic spatial context. During training, our method automatically isolates and retains factors modelling spatial dependencies between variables that are relevant for achieving higher prediction accuracy. We evaluate the proposed method by using it to predict 1 7-category semantic segmentations on sets of stitched Kinect scans. Experimental results show that the spatial dependencies learned by our method significantly improve the accuracy of segmentation. They also show that our method outperforms the existing segmentation technique of Koppula et al.


reference text

[1] L. Breiman. Random forests. Machine Learning, 45(1):5–32, 2001. 3, 6

[2] E. Brill. A simple rule-based part of speech tagger. In ACL, pages 112–1 16, Trento, IT, 1992. 1

[3] C. Desai, D. Ramanan, and C. Fowlkes. Discriminative models for multi-class object layout. In ICCV, pages 229–236, 2009. 2, 5

[4] J. Domke. Parameter Learning with Truncated Message-Passing. In CVPR, number x, pages 2937–2944, Colorado Springs, CO, 2011. 2

[5] B. Fulkerson, A. Vedaldi, and S. Soatto. Class segmentation and object localization with superpixel neighborhoods. In ICCV, pages 670–677, 2009. 1

[6] G. Heitz and D. Koller. Learning spatial context: Using stuff to find things. In ECCV, pages 30–43, Marseille, France, 2008. 2, 5

[7] P. Kohli, L. Ladick´ y, and P. H. S. Torr. Robust Higher Order Potentials for Enforcing Label Consistency. IJCV, 82(3):302–324, Jan. 2009. 1

[8] H. S. Koppula, A. Anand, T. Joachims, and A. Saxena. Semantic Labeling of 3D Point Clouds for Indoor Scenes. In NIPS, Granada, ES, 2011. 1, 2, 5, 6, 7, 8

[9] P. Kr ¨ahenb u¨hl and V. Koltun. Efficient inference in fully connected crfs with gaussian edge potentials. In NIPS, 2011. 1

[10] A. Montillo, J. Shotton, J. M. Winn, J. E. Iglesias, D. N. Metaxas, and A. Criminisi. Entangled decision forests and their application for semantic segmentation of ct images. In IPMI, 2011. 1

[11] D. Munoz, J. A. Bagnell, and M. Hebert. Stacked hierarchical labeling. In ECCV, Heraklion, Grece, 2010. 1, 4

[12] D. Munoz, J. A. Bagnell, N. Vandapel, and M. Hebert. Contextual classification with functional Max-Margin Markov Networks. In CVPR, pages 975–982, Miami, FL, June 2009. 1, 2

[13] S. Nowozin and C. H. Lampert. Global connectivity potentials for random field models. In CVPR, pages 818–825, June 2009. 1

[14] A. Rabinovich, A. Vedaldi, C. Galleguillos, E. Wiewiora, and S. Belongie. Objects in Context. In ICCV, 2007. 2

[15] S. Ross, D. Munoz, M. Hebert, and J. A. Bagnell. Learning MessagePassing Inference Machines for Structured Prediction. In CVPR, pages 2737–2744, Colorado Springs, CO, 2011. 1, 2, 3, 4, 8

[16] R. Shapovalov and A. Velizhev. Cutting-Plane Training of Nonassociative Markov Network for 3D Point Cloud Segmentation. In 3DIMPVT, pages 1–8, Hangzhou, China, 2011. 2

[17] J. Shotton, M. Johnson, and R. Cipolla. Semantic texton forests for image categorization and segmentation. In CVPR, June 2008. 1, 2

[18] R. Szeliski, R. Zabih, D. Scharstein, O. Veksler, V. Kolmogorov, A. Agarwala, M. Tappen, and C. Rother. A comparative study of energy minimization methods for Markov random fields. LNCS, 3952(6):16–29, 2006. 1

[19] Z. Tu. Auto-context and its application to high-level vision tasks. In CVPR, Anchorage, AL, June 2008. 1, 2

[20] O. J. Woodford, C. Rother, and V. Kolmogorov. A global perspective on MAP inference for low-level vision. In ICCV, number Iccv, pages 2319–2326. ICCV, 2009. 1

[21] X. Xiong, D. Munoz, J. A. Bagnell, and M. Hebert. 3-D Scene Analysis via Sequenced Predictions over Points and Regions. In ICRA, Shanghai, China, 2011. 1 222999999200