The Statistical Inefficiency of Sparse Coding for Images (or, One Gabor to Rule them All)

Bergstra, James; Courville, Aaron; Bengio, Yoshua

Computer Science > Computer Vision and Pattern Recognition

arXiv:1109.6638 (cs)

[Submitted on 29 Sep 2011 (v1), last revised 30 Sep 2011 (this version, v2)]

Title:The Statistical Inefficiency of Sparse Coding for Images (or, One Gabor to Rule them All)

Authors:James Bergstra, Aaron Courville, Yoshua Bengio

View PDF

Abstract:Sparse coding is a proven principle for learning compact representations of images. However, sparse coding by itself often leads to very redundant dictionaries. With images, this often takes the form of similar edge detectors which are replicated many times at various positions, scales and orientations. An immediate consequence of this observation is that the estimation of the dictionary components is not statistically efficient. We propose a factored model in which factors of variation (e.g. position, scale and orientation) are untangled from the underlying Gabor-like filters. There is so much redundancy in sparse codes for natural images that our model requires only a single dictionary element (a Gabor-like edge detector) to outperform standard sparse coding. Our model scales naturally to arbitrary-sized images while achieving much greater statistical efficiency during learning. We validate this claim with a number of experiments showing, in part, superior compression of out-of-sample data using a sparse coding dictionary learned with only a single image.

Comments:	9 pages, 8 figures
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
Cite as:	arXiv:1109.6638 [cs.CV]
	(or arXiv:1109.6638v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1109.6638

Submission history

From: James Bergstra [view email]
[v1] Thu, 29 Sep 2011 19:47:00 UTC (113 KB)
[v2] Fri, 30 Sep 2011 15:27:25 UTC (107 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CV

< prev | next >

new | recent | 2011-09

Change to browse by:

cs
cs.AI

References & Citations

DBLP - CS Bibliography

listing | bibtex

James Bergstra
Aaron C. Courville
Yoshua Bengio

export BibTeX citation

Computer Science > Computer Vision and Pattern Recognition

Title:The Statistical Inefficiency of Sparse Coding for Images (or, One Gabor to Rule them All)

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:The Statistical Inefficiency of Sparse Coding for Images (or, One Gabor to Rule them All)

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators