nips nips2005 nips2005-131 nips2005-131-reference knowledge-graph by maker-knowledge-mining
Source: pdf
Author: Cha Zhang, John C. Platt, Paul A. Viola
Abstract: A good image object detection algorithm is accurate, fast, and does not require exact locations of objects in a training set. We can create such an object detector by taking the architecture of the Viola-Jones detector cascade and training it with a new variant of boosting that we call MILBoost. MILBoost uses cost functions from the Multiple Instance Learning literature combined with the AnyBoost framework. We adapt the feature selection criterion of MILBoost to optimize the performance of the Viola-Jones cascade. Experiments show that the detection rate is up to 1.6 times better using MILBoost. This increased detection rate shows the advantage of simultaneously learning the locations and scales of the objects in the training set along with the parameters of the classifier. 1
[1] S. Andrews and T. Hofmann. Multiple-instance learning via disjunctive programming boosting. In S. Thrun, L. K. Saul, and B. Sch¨ lkopf, editors, Proc. NIPS, volume 16. MIT Press, 2004. o
[2] P. Auer and R. Ortner. A boosting approach to multiple instance learning. In Lecture Notes in Computer Science, volume 3201, pages 63–74, October 2004.
[3] M. C. Burl, T. K. Leung, and P. Perona. Face localization via shape statistics. In Proc. Int’l Workshop on Automatic Face and Gesture Recognition, pages 154–159, 1995.
[4] T. G. Dietterich, R. H. Lathrop, and T. Lozano-Pérez. Solving the multiple instance problem with axis-parallel rectangles. Artif. Intell., 89(1-2):31–71, 1997.
[5] R. Fergus, P. Perona, and A. Zisserman. Object class recognition by unsupervised scaleinvariant learning. In Proc. CVPR, volume 2, pages 264–271, 2003.
[6] D. Heckerman. A tractable inference algorithm for diagnosing multiple diseases. In Proc. UAI, pages 163–171, 1989.
[7] J. D. Keeler, D. E. Rumelhart, and W.-K. Leow. Integrated segmentation and recognition of hand-printed numerals. In NIPS-3: Proceedings of the 1990 conference on Advances in neural information processing systems 3, pages 557–563, San Francisco, CA, USA, 1990. Morgan Kaufmann Publishers Inc.
[8] O. Maron and T. Lozano-Perez. A framework for multiple-instance learning. In Proc. NIPS, volume 10, pages 570–576, 1998.
[9] L. Mason, J. Baxter, P. Bartlett, and M. Frean. Boosting algorithms as gradient descent in function space, 1999.
[10] S. J. Nowlan and J. C. Platt. A convolutional neural network hand tracker. In G. Tesauro, D. Touretzky, and T. Leen, editors, Advances in Neural Information Processing Systems, volume 7, pages 901–908. The MIT Press, 1995.
[11] R. E. Schapire and Y. Singer. Improved boosting algorithms using confidence-rated predictions. In Proc. COLT, volume 11, pages 80–91, 1998.
[12] C. Schmid and R. Mohr. Local grayvalue invariants for image retrieval. IEEE Trans. PAMI, 19(5):530–535, 1997.
[13] P. Viola and M. Jones. Robust real-time object detection. Int’l. J. Computer Vision, 57(2):137– 154, 2002.
[14] X. Xu and E. Frank. Logistic regression and boosting for labeled bags of instances. In Lecture Notes in Computer Science, volume 3056, pages 272–281, April 2004.