TY - JOUR
T1 - A beta-mixture quantile normalization method for correcting probe design bias in Illumina Infinium 450 k DNA methylation data
AU - Teschendorff, Andrew E.
AU - Marabita, Francesco
AU - Lechner, Matthias
AU - Bartlett, Thomas
AU - Tegner, Jesper
AU - Gomez-Cabrero, David
AU - Beck, Stephan
N1 - Generated from Scopus record by KAUST IRTS on 2021-02-16
PY - 2013/1/15
Y1 - 2013/1/15
N2 - Motivation: The Illumina Infinium 450 k DNA Methylation Beadchip is a prime candidate technology for Epigenome-Wide Association Studies (EWAS). However, a difficulty associated with these beadarrays is that probes come in two different designs, characterized by widely different DNA methylation distributions and dynamic range, which may bias downstream analyses. A key statistical issue is therefore how best to adjust for the two different probe designs.Results: Here we propose a novel model-based intra-array normalization strategy for 450 k data, called BMIQ (Beta MIxture Quantile dilation), to adjust the beta-values of type2 design probes into a statistical distribution characteristic of type1 probes. The strategy involves application of a three-state beta-mixture model to assign probes to methylation states, subsequent transformation of probabilities into quantiles and finally a methylation-dependent dilation transformation to preserve the monotonicity and continuity of the data. We validate our method on cell-line data, fresh frozen and paraffin-embedded tumour tissue samples and demonstrate that BMIQ compares favourably with two competing methods. Specifically, we show that BMIQ improves the robustness of the normalization procedure, reduces the technical variation and bias of type2 probe values and successfully eliminates the type1 enrichment bias caused by the lower dynamic range of type2 probes. BMIQ will be useful as a preprocessing step for any study using the Illumina Infinium 450 k platform. © 2012 The Author. Published by Oxford University Press. All rights reserved.
AB - Motivation: The Illumina Infinium 450 k DNA Methylation Beadchip is a prime candidate technology for Epigenome-Wide Association Studies (EWAS). However, a difficulty associated with these beadarrays is that probes come in two different designs, characterized by widely different DNA methylation distributions and dynamic range, which may bias downstream analyses. A key statistical issue is therefore how best to adjust for the two different probe designs.Results: Here we propose a novel model-based intra-array normalization strategy for 450 k data, called BMIQ (Beta MIxture Quantile dilation), to adjust the beta-values of type2 design probes into a statistical distribution characteristic of type1 probes. The strategy involves application of a three-state beta-mixture model to assign probes to methylation states, subsequent transformation of probabilities into quantiles and finally a methylation-dependent dilation transformation to preserve the monotonicity and continuity of the data. We validate our method on cell-line data, fresh frozen and paraffin-embedded tumour tissue samples and demonstrate that BMIQ compares favourably with two competing methods. Specifically, we show that BMIQ improves the robustness of the normalization procedure, reduces the technical variation and bias of type2 probe values and successfully eliminates the type1 enrichment bias caused by the lower dynamic range of type2 probes. BMIQ will be useful as a preprocessing step for any study using the Illumina Infinium 450 k platform. © 2012 The Author. Published by Oxford University Press. All rights reserved.
UR - https://academic.oup.com/bioinformatics/article-lookup/doi/10.1093/bioinformatics/bts680
UR - http://www.scopus.com/inward/record.url?scp=84872564830&partnerID=8YFLogxK
U2 - 10.1093/bioinformatics/bts680
DO - 10.1093/bioinformatics/bts680
M3 - Article
C2 - 23175756
SN - 1367-4803
VL - 29
SP - 189
EP - 196
JO - Bioinformatics
JF - Bioinformatics
IS - 2
ER -