iccv iccv2013 iccv2013-33 iccv2013-33-reference knowledge-graph by maker-knowledge-mining
Source: pdf
Author: Fabio Galasso, Naveen Shankar Nagaraja, Tatiana Jiménez Cárdenas, Thomas Brox, Bernt Schiele
Abstract: Video segmentation research is currently limited by the lack of a benchmark dataset that covers the large variety of subproblems appearing in video segmentation and that is large enough to avoid overfitting. Consequently, there is little analysis of video segmentation which generalizes across subtasks, and it is not yet clear which and how video segmentation should leverage the information from the still-frames, as previously studied in image segmentation, alongside video specific information, such as temporal volume, motion and occlusion. In this work we provide such an analysis based on annotations of a large video dataset, where each video is manually segmented by multiple persons. Moreover, we introduce a new volume-based metric that includes the important aspect of temporal consistency, that can deal with segmentation hierarchies, and that reflects the tradeoff between over-segmentation and segmentation accuracy.
[1] P. Arbel ´aez, M. Maire, C. Fowlkes, and J. Malik. Contour detection and hierarchical image segmentation. PAMI, 33(5):898–916, 2011.
[2] W. Brendel and S. Todorovic. Video object segmentation by tracking regions. In ICCV, 2009.
[3] G. J. Brostow, J. Fauqueur, and R. Cipolla. Semantic object classes in video: A high-definition ground truth database. PRL, 30:88–97, 2009.
[4] T. Brox and J. Malik. Object segmentation by long term analysis of point trajectories. In ECCV, 2010.
[5] A. Y. C. Chen and J. J. Corso. Propagating multi-class pixel labels throughout video frames. In Western NY Image Worskshop, 2010.
[6] H.-T. Cheng and N. Ahuja. Exploiting nonlocal spatiotemporal structure for video segmentation. In CVPR, 2012.
[7] J. Corso, E. Sharon, S. Dube, S. El-Saden, U. Sinha, and A. Yuille. Efficient multilevel brain tumor segmentation with integrated bayesian model classification. TMI, 27(5):629– 640, 2008.
[8] D. DeMenthon. Spatio-temporal segmentation of video by hierarchical mean shift analysis. SMVP, 2002.
[9] K. Fragkiadaki and J. Shi. Video segmentation by tracing discontinuities in a trajectory embedding. In CVPR, 2012.
[10] F. Galasso, R. Cipolla, and B. Schiele. Video segmentation with superpixels. In ACCV, 2012.
[11] F. Galasso, M. Iwasaki, K. Nobori, and R. Cipolla. Spatiotemporal clustering of probabilistic region trajectories. In ICCV, 2011.
[12] H. Greenspan, J. Goldberger, and A. Mayer. A probabilistic framework for spatio-temporal video representation. In ECCV, 2002.
[13] M. Grundmann, V. Kwatra, M. Han, and I. Essa. Efficient hierarchical graph-based video segmentation. In CVPR, 2010.
[14] A. Kannan, N. Jojic, and B. J. Frey. Generative model for layers of appearance and deformation. In AISTATS, 2005.
[15] M. P. Kumar, P. Torr, and A. Zisserman. Learning layered motion segmentations of video. (76):301–3 19, 2008.
[16] A. Levinshtein, C. Sminchisescu, and S. Dickinson. Spatiotemporal closure. In ACCV, 2010.
[17] J. Lezama, K. Alahari, J. Sivic, and I. Laptev. Track to the future: Spatio-temporal video segmentation with long-range motion cues. In CVPR, 2011.
[18] D. Martin, C. Fowlkes, D. Tal, and J. Malik. A database of human segmented natural images and its application to evaluating segmentation algorithms and measuring ecological statistics. In ICCV, 2001 .
[19] P. Ochs and T. Brox. Object segmentation in video: a hierarchical variational approach for turning point trajectories into dense regions. In ICCV, 2011.
[20] S. Paris. Edge-preserving smoothing and mean-shift segmentation of video streams. In ECCV, 2008.
[21] J. Shi and J. Malik. Normalized cuts and image segmentation. TPAMI, 2000.
[22] A. Stein, D. Hoiem, and M. Hebert. Learning to find object
[23]
[24]
[25]
[26]
[27]
[28]
[29]
[30] boundaries using motion cues. In ICCV, 2007. P. Sundberg, T.Brox, M. Maire, P. Arbelaez, and J. Malik. Occlusion boundary detection and figure/ground assignment from optical flow. In CVPR, 2011. R. Tron and R. Vidal. A benchmark for the comparison of 3-D motion segmentation algoriths. In CVPR, 2007. D. Tsai, M. Flagg, and J. M. Rehg. Motion coherent tracking with multi-label mrf optimization. In BMVC, 2010. R. Unnikrishnan, C. Pantofaru, and M. Hebert. Toward objective evaluation of image segmentation algorithms. PAMI, 2007. A. Vazquez-Reina, S. Avidan, H. Pfister, and E. Miller. Multiple hypothesis video segmentation from superpixel flows. In ECCV, 2010. C. Xu and J. J. Corso. Evaluation of super-voxel methods for early video processing. In CVPR, 2012. C. Xu, C. Xiong, and J. J. Corso. Streaming hierarchical video segmentation. In ECCV, 2012. C. Zach, T. Pock, and H. Bischof. A duality based approach for realtime tv-l1 optical flow. In DAGM, 2007. 33552347