Online Bayesian dictionary learning for large datasets

Lingbo Li, Jorge Silva, Mingyuan Zhou, Lawrence Carin

Research output: Chapter in Book/Report/Conference proceedingConference contribution

13 Scopus citations


The problem of learning a data-adaptive dictionary for a very large collection of signals is addressed. This paper proposes a statistical model and associated variational Bayesian (VB) inference for simultaneously learning the dictionary and performing sparse coding of the signals. The model builds upon beta process factor analysis (BPFA), with the number of factors automatically inferred, and posterior distributions are estimated for both the dictionary and the signals. Crucially, an online learning procedure is employed, allowing scalability to very large datasets which would be beyond the capabilities of existing batch methods. State-of-the-art performance is demonstrated by experiments with large natural images containing tens of millions of pixels. © 2012 IEEE.
Original languageEnglish (US)
Title of host publicationICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
Number of pages4
StatePublished - Oct 23 2012
Externally publishedYes


Dive into the research topics of 'Online Bayesian dictionary learning for large datasets'. Together they form a unique fingerprint.

Cite this