iccv iccv2013 iccv2013-286 iccv2013-286-reference knowledge-graph by maker-knowledge-mining
Source: pdf
Author: Kevin Matzen, Noah Snavely
Abstract: Geometry and geography can play an important role in recognition tasks in computer vision. To aid in studying connections between geometry and recognition, we introduce NYC3DCars, a rich dataset for vehicle detection in urban scenes built from Internet photos drawn from the wild, focused on densely trafficked areas of New York City. Our dataset is augmented with detailed geometric and geographic information, including full camera poses derived from structure from motion, 3D vehicle annotations, and geographic information from open resources, including road segmentations and directions of travel. NYC3DCars can be used to study new questions about using geometric information in detection tasks, and to explore applications of Internet photos in understanding cities. To demonstrate the utility of our data, we evaluate the use of the geographic information in our dataset to enhance a parts-based detection method, and suggest other avenues for future exploration.
[1] S. Agarwal, N. Snavely, I. Simon, S. M. Seitz, and R. Szeliski. Building Rome in a day. In ICCV, 2009.
[2] S. Bao, M. Bagra, Y.-W. Chao, and S. Savarese. Semantic structure from motion with points, regions, and objects. In CVPR, 2012.
[3] N. Cornelis, B. Leibe, K. Cornelis, and L. J. V. Gool. 3D urban scene modeling integrating recognition and reconstruction. IJCV, 78(2-3): 121–141, 2008.
[4] M. Everingham, L. Van Gool, C. K. I. Williams, J. Winn, and A. Zisserman. The pascal visual object classes (VOC) challenge. IJCV, 88(2):303–338, June 2010.
[5] P. F. Felzenszwalb, R. B. Girshick, D. McAllester, and D. Ramanan. Object detection with discriminatively trained partbased models. PAMI, 32, 2010.
[6] D. F. Fouhey, V. Delaitre, A. Gupta, A. A. Efros, I. Laptev, and J. Sivic. People watching: Human actions as a cue for single-view geometry. In ECCV, 2012.
[7] A. Geiger, P. Lenz, and R. Urtasun. Are we ready for autonomous driving? the KITTI vision benchmark suite. In CVPR, 2012.
[8] R. B. Girshick, P. F. Felzenszwalb, and D. McAllester. Discriminatively trained deformable part models, release 5. http://people.cs.uchicago.edu/˜rbg/latent-release5/.
[9] R. B. Girshick, P. F. Felzenszwalb, and D. A. McAllester. Object detection with grammar models. In NIPS, 2011.
[10] D. Glasner, M. Galun, S. Alpert, R. Basri, and G. Shakhnarovich. Viewpoint-aware object detection and pose estimation. In ICCV, 2011.
[11] J. Hays and A. Efros. IM2GPS: Estimating geographic information from a single image. In CVPR, 2008.
[12] V. Hedau, D. Hoiem, and D. A. Forsyth. Thinking inside the box: Using appearance models and context based on room geometry. In CVPR, 2010.
[13] M. Hejrati and D. Ramanan. Analyzing 3d objects in cluttered images. In NIPS, 2012.
[14] D. Hoiem, A. Efros, and M. Hebert. Geometric context from a single image. In ICCV, 2005.
[15] D. Hoiem, A. Efros, and M. Hebert. Putting objects in perspective. In CVPR, 2006.
[16] Y. Li, N. Snavely, D. Huttenlocher, and P. Fua. Worldwide pose estimation using 3D point clouds. In ECCV, 2012.
[17] J. Little, A. Abrams, and R. Pless. Tools for richer crowd source image annotations. In WACV, 2012.
[18] T. Malisiewicz, A. Gupta, and A. Efros. Ensemble of exemplar-SVMs for object detection and beyond. In ICCV, 2011.
[19] M. Ozuysal, V. Lepetit, and P. Fua. Pose estimation for category specific multiview object localization. In CVPR, 2009.
[20] B. Pepik, M. Stark, P. Gehler, and B. Schiele. Teaching 3D geometry to deformable part models. In CVPR, 2012.
[21] J. C. Platt. Probabilistic outputs for support vector machines and comparisons to regularized likelihood methods. In Advances in Large Margin Classifiers. MIT Press, 1999.
[22] B. C. Russell, A. Torralba, K. P. Murphy, and W. T. Freeman. LabelMe: A database and web-based tool for image annotation. IJCV, 77(1-3), 2008.
[23] S. Savarese and L. Fei-Fei. 3D generic object categorization, localization and pose estimation. In ICCV, 2007.
[24] M. Sun, S. Y.-Z. Bao, and S. Savarese. Object detection using geometrical context feedback. IJCV, 100(2), 2012. 776688