TY - JOUR
T1 - Weakly supervised fine-grained categorization with part-based image representation
AU - Zhang, Yu
AU - Wei, Xiu Shen
AU - Wu, Jianxin
AU - Cai, Jianfei
AU - Lu, Jiangbo
AU - Nguyen, Viet Anh
AU - Do, Minh N.
N1 - Funding Information:
Y. Zhang, J. Lu, V.-A. Nguyen, and M. N. Do are supported by the research grant for the Human-Centered Cyber- physical Systems Programme at the Advanced Digital Sciences Center from Singapores Agency for Science, Technology and Research (A∗STAR). J. Wu is supported in part by the National Natural Science Foundation of China under Grant No. 61422203. J. Cai is supported in part by Singapore MoE AcRF Tier-1 Grant RG138/14. M. N. Do is supported in part by the US National Science Foundation (NSF) grants CCF-1218682 and IIS 11-16012.
Publisher Copyright:
© 1992-2012 IEEE.
PY - 2016/4
Y1 - 2016/4
N2 - In this paper, we propose a fine-grained image categorization system with easy deployment. We do not use any object/part annotation (weakly supervised) in the training or in the testing stage, but only class labels for training images. Fine-grained image categorization aims to classify objects with only subtle distinctions (e.g., two breeds of dogs that look alike). Most existing works heavily rely on object/part detectors to build the correspondence between object parts, which require accurate object or object part annotations at least for training images. The need for expensive object annotations prevents the wide usage of these methods. Instead, we propose to generate multi-scale part proposals from object proposals, select useful part proposals, and use them to compute a global image representation for categorization. This is specially designed for the weakly supervised fine-grained categorization task, because useful parts have been shown to play a critical role in existing annotation-dependent works, but accurate part detectors are hard to acquire. With the proposed image representation, we can further detect and visualize the key (most discriminative) parts in objects of different classes. In the experiments, the proposed weakly supervised method achieves comparable or better accuracy than the state-of-the-art weakly supervised methods and most existing annotation-dependent methods on three challenging datasets. Its success suggests that it is not always necessary to learn expensive object/part detectors in fine-grained image categorization.
AB - In this paper, we propose a fine-grained image categorization system with easy deployment. We do not use any object/part annotation (weakly supervised) in the training or in the testing stage, but only class labels for training images. Fine-grained image categorization aims to classify objects with only subtle distinctions (e.g., two breeds of dogs that look alike). Most existing works heavily rely on object/part detectors to build the correspondence between object parts, which require accurate object or object part annotations at least for training images. The need for expensive object annotations prevents the wide usage of these methods. Instead, we propose to generate multi-scale part proposals from object proposals, select useful part proposals, and use them to compute a global image representation for categorization. This is specially designed for the weakly supervised fine-grained categorization task, because useful parts have been shown to play a critical role in existing annotation-dependent works, but accurate part detectors are hard to acquire. With the proposed image representation, we can further detect and visualize the key (most discriminative) parts in objects of different classes. In the experiments, the proposed weakly supervised method achieves comparable or better accuracy than the state-of-the-art weakly supervised methods and most existing annotation-dependent methods on three challenging datasets. Its success suggests that it is not always necessary to learn expensive object/part detectors in fine-grained image categorization.
KW - Fine grained categorization
KW - part selection
KW - weakly supervised
UR - http://www.scopus.com/inward/record.url?scp=84964246694&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84964246694&partnerID=8YFLogxK
U2 - 10.1109/TIP.2016.2531289
DO - 10.1109/TIP.2016.2531289
M3 - Article
AN - SCOPUS:84964246694
SN - 1057-7149
VL - 25
SP - 1713
EP - 1725
JO - IEEE Transactions on Image Processing
JF - IEEE Transactions on Image Processing
IS - 4
M1 - 7410088
ER -