HUSCAP logo Hokkaido Univ. logo

Hokkaido University Collection of Scholarly and Academic Papers >
Graduate School of Information Science and Technology / Faculty of Information Science and Technology >
Peer-reviewed Journal Articles, etc >

Bregman pooling : feature-space local pooling for image classification

Files in This Item:
BP.pdf345.82 kBPDFView/Open
Please use this identifier to cite or link to this item:http://hdl.handle.net/2115/62753

Title: Bregman pooling : feature-space local pooling for image classification
Authors: Najjar, Alameen Browse this author
Ogawa, Takahiro Browse this author →KAKEN DB
Haseyama, Miki Browse this author →KAKEN DB
Keywords: Image classification
Image representation
Feature pooling
Co-clustering
Bregman divergence
Issue Date: 4-Sep-2015
Publisher: Springer
Journal Title: International Journal of Multimedia Information Retrieval
Volume: 2015
Issue: 4
Start Page: 247
End Page: 259
Publisher DOI: 10.1007/s13735-015-0086-z
Abstract: In this paper, we propose a novel feature-space local pooling method for the commonly adopted architecture of image classification. While existing methods partition the feature space based on visual appearance to obtain pooling bins, learning more accurate space partitioning that takes semantics into account boosts performance even for a smaller number of bins. To this end, we propose partitioning the feature space over clusters of visual prototypes common to semantically similar images (i.e., images belonging to the same category). The clusters are obtained by Bregman co-clustering applied offline on a subset of training data. Therefore, being aware of the semantic context of the input image, our features have higher discriminative power than do those pooled from appearance-based partitioning. Testing on four datasets (Caltech-101, Caltech-256, 15 Scenes, and 17 Flowers) belonging to three different classification tasks showed that the proposed method outperforms methods in previous works on local pooling in the feature space for less feature dimensionality. Moreover, when implemented within a spatial pyramid, our method achieves comparable results on three of the datasets used.
Rights: The final publication is available at link.springer.com
Type: article (author version)
URI: http://hdl.handle.net/2115/62753
Appears in Collections:情報科学院・情報科学研究院 (Graduate School of Information Science and Technology / Faculty of Information Science and Technology) > 雑誌発表論文等 (Peer-reviewed Journal Articles, etc)

Submitter: Alameen Najjar

Export metadata:

OAI-PMH ( junii2 , jpcoar_1.0 )

MathJax is now OFF:


 

 - Hokkaido University