Regularized Max Pooling for Image Categorization

Minh Hoai

In Proceedings British Machine Vision Conference 2014


We propose Regularized Max Pooling (RMP) for image classification. RMP classifies an image (or an image region) by extracting feature vectors at multiple subwindows at multiple locations and scales. Unlike Spatial Pyramid Matching where the subwindows are defined purely based on geometric correspondence, RMP accounts for the deformation of discriminative parts. The amount of deformation and the discriminative ability for multiple parts are jointly learned during training. RMP outperforms the state-of-the-art performance by a wide margin on the challenging PASCAL VOC2012 dataset for human action recognition on still images.


Image Classification


Extended Abstract (PDF, 1 page, 132K)
Paper (PDF, 12 pages, 931K)
Bibtex File



Minh Hoai. Regularized Max Pooling for Image Categorization. Proceedings of the British Machine Vision Conference. BMVA Press, September 2014.


	title = {Regularized Max Pooling for Image Categorization},
	author = {Hoai, Minh},
	year = {2014},
	booktitle = {Proceedings of the British Machine Vision Conference},
	publisher = {BMVA Press},
	editors = {Valstar, Michel and French, Andrew and Pridmore, Tony}
	doi = { }