Abstract
Image categorization is a challenging problem when a label is provided for the entire training image only instead of the object region. To eliminate labeling ambiguity, image categorization and object localization should be performed simultaneously. Discriminative Multiple Instance Learning (MIL) can be used for this task by regarding each image as a bag and sub-windows in the image as instances. Learning a discriminative MI classifier requires an iterative solution. In each round, positive sub-windows for the next round should be selected. With standard approaches, selecting only one positive sub-window per positive bag may limit the search space for global optimum; meanwhile, selecting all temporal positive sub-windows may add noise into learning. We select a subset of sub-windows per positive bag to avoid those limitations. Spatial relations between sub-windows are used as clues for selection. Experimental results demonstrate that our approach outperforms previous discriminative MIL approaches and standard categorization approaches.
Chapter PDF
Similar content being viewed by others
References
Dietterich, T., Lathrop, R., Lozano-Perez, T.: Solving the Multiple-Instance Problem with Axis-Parallel Rectangles. Artificial Intelligence, 31–71 (1997)
Maronand, O., Lozano-Perez, T.: A Framework for Multiple Instance Learning. In: Advances in Neural Information Processing Systems, pp. 570–576 (1998)
Zhang, Q., Goldman, S.: EM-DD: An Improved Multiple Instance Learning Technique. In: Advances in Neural Information Processing Systems, pp. 1073–1080 (2002)
Chen, Y., Wang, J.Z.: Image Categorization by Learning and Reasoning with Regions. Journal of Machine Learning Research, 913–939 (2004)
Andrews, S., Tsochantaridi, I., Hofmann, T.: Support Vector Machines for Multiple-Instance Learning. In: Advances in Neural Information Processing Systems, pp. 561–568 (2003)
Chen, Y., Bi, J., Wang, J.: MILES: Multiple-Instance Learning via Embedded Instance Selection. IEEE Transactions on Pattern Analysis and Machine Intelligence, 1931–1947 (2006)
Fu, Z., Robles-Kelly, A.: An Instance Selection Approach to Multiple Instance Learning. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 911–918 (2009)
Fei-Fei, L., Fergus, R., Perona, P.: Learning generative visual models from few training examples: an incremental Bayesian approach tested on 101 object categories. In: Workshop on Generative-Model Based Vision, IEEE Conference on Computer Vision and Pattern Recognition (2004)
Nguyen, M.H., Torresani, L., Torre, F., Rother, C.: Weakly Supervised Discriminative Localization and Classification: A Joint Learning Process. In: IEEE Conference on Computer Vision and Pattern Recognition (2009)
Galleguillos, C., Belongie, S.: Context Based Object Categorization: A Critical Survey. In: Computer Vision and Image Understanding (2010)
Marques, O., Barenholtz, E., Charvillat, V.: Context Modeling in Computer Vision: Techniques, Implications, and Applications. Journal of Multimedia Tools and Applications (2010)
Zha, Z.J., Hua, X.S., Mei, T., Wang, J., Qi, G.J., Wang, Z.: Joint Multi-Label Multi-Instance Learning for Image Classification. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 1–8 (2008)
Divvala, S.K., Hoiem, D., Hays, J.H., Efros, A., Hebert, M.: An Empirical Study of Context in Object Detection. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 1271–1278 (2009)
Wolf, L., Bileschi, S.: A Critical View of Context. International Journal of Computer Vision (2006)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Ngo, T.D., Le, DD., Satoh, S. (2011). Improving Image Categorization by Using Multiple Instance Learning with Spatial Relation. In: Maino, G., Foresti, G.L. (eds) Image Analysis and Processing – ICIAP 2011. ICIAP 2011. Lecture Notes in Computer Science, vol 6978. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-24085-0_12
Download citation
DOI: https://doi.org/10.1007/978-3-642-24085-0_12
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-24084-3
Online ISBN: 978-3-642-24085-0
eBook Packages: Computer ScienceComputer Science (R0)