A biclustering algorithm for binary matrices based on penalized Bernoulli likelihood

Seokho Lee, Jianhua Z. Huang

Research output: Contribution to journalArticlepeer-review

12 Scopus citations

Abstract

We propose a new biclustering method for binary data matrices using the maximum penalized Bernoulli likelihood estimation. Our method applies a multi-layer model defined on the logits of the success probabilities, where each layer represents a simple bicluster structure and the combination of multiple layers is able to reveal complicated, multiple biclusters. The method allows for non-pure biclusters, and can simultaneously identify the 1-prevalent blocks and 0-prevalent blocks. A computationally efficient algorithm is developed and guidelines are provided for specifying the tuning parameters, including initial values of model parameters, the number of layers, and the penalty parameters. Missing-data imputation can be handled in the EM framework. The method is tested using synthetic and real datasets and shows good performance. © 2013 Springer Science+Business Media New York.
Original languageEnglish (US)
Pages (from-to)429-441
Number of pages13
JournalStatistics and Computing
Volume24
Issue number3
DOIs
StatePublished - Jan 31 2013
Externally publishedYes

Fingerprint

Dive into the research topics of 'A biclustering algorithm for binary matrices based on penalized Bernoulli likelihood'. Together they form a unique fingerprint.

Cite this