Hybrid pooling fusion in the BoW pipeline

Marc Law; Nicolas Thome; Matthieu Cord

Conference ProceedingsOPEN ACCESS

Hybrid pooling fusion in the BoW pipeline

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2012) 7585 LNCS(PART 3) 355-364

DOI: 10.1007/978-3-642-33885-4_36

4Citations

16Readers

Abstract

In the context of object and scene recognition, state-of-the-art performances are obtained with Bag of Words (BoW) models of mid-level representations computed from dense sampled local descriptors (e.g. SIFT). Several methods to combine low-level features and to set mid-level parameters have been evaluated recently for image classification. In this paper, we further investigate the impact of the main parameters in the BoW pipeline. We show that an adequate combination of several low (sampling rate, multiscale) and mid level (codebook size, normalization) parameters is decisive to reach good performances. Based on this analysis, we propose a merging scheme exploiting the specificities of edge-based descriptors. Low and high-contrast regions are pooled separately and combined to provide a powerful representation of images. Sucessful experiments are provided on the Caltech-101 and Scene-15 datasets. © 2012 Springer-Verlag.

Cite

CITATION STYLE

APA

Law, M., Thome, N., & Cord, M. (2012). Hybrid pooling fusion in the BoW pipeline. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 7585 LNCS, pp. 355–364). Springer Verlag. https://doi.org/10.1007/978-3-642-33885-4_36

Hybrid pooling fusion in the BoW pipeline

Abstract

Cite

Register to see more suggestions