TY - JOUR
T1 - Testing for the presence of significant covariates through conditional marginal regression
AU - Tang, Yanlin
AU - Wang, Huixia Judy
AU - Barut, Emre
N1 - KAUST Repository Item: Exported on 2021-04-06
Acknowledgements: The authors thank the reviewers, associate editor and editor for constructive comments and helpful suggestions. The authors also thank Professor Soumendra Lahiri for helpful discussions. This research was supported by the U.S. National Science Foundation, the National Natural Science Foundation of China, and the King Abdullah University of Science & Technology.
This publication acknowledges KAUST support, but has no KAUST affiliated authors.
PY - 2017/12/8
Y1 - 2017/12/8
N2 - Researchers sometimes have a priori information on the relative importance of predictors that can be used to screen out covariates. An important question is whether any of the discarded covariates have predictive power when the most relevant predictors are included in the model. We consider testing whether any discarded covariate is significant conditional on some pre-chosen covariates. We propose a maximum-type test statistic and show that it has a nonstandard asymptotic distribution, giving rise to the conditional adaptive resampling test. To accommodate signals of unknown sparsity, we develop a hybrid test statistic, which is a weighted average of maximum- and sum-type statistics. We prove the consistency of the test procedure under general assumptions and illustrate how it can be used as a stopping rule in forward regression. We show, through simulation, that the proposed method provides adequate control of the familywise error rate with competitive power for both sparse and dense signals, even in high-dimensional cases, and we demonstrate its advantages in cases where the covariates are heavily correlated. We illustrate the application of our method by analysing an expression quantitative trait locus dataset.
AB - Researchers sometimes have a priori information on the relative importance of predictors that can be used to screen out covariates. An important question is whether any of the discarded covariates have predictive power when the most relevant predictors are included in the model. We consider testing whether any discarded covariate is significant conditional on some pre-chosen covariates. We propose a maximum-type test statistic and show that it has a nonstandard asymptotic distribution, giving rise to the conditional adaptive resampling test. To accommodate signals of unknown sparsity, we develop a hybrid test statistic, which is a weighted average of maximum- and sum-type statistics. We prove the consistency of the test procedure under general assumptions and illustrate how it can be used as a stopping rule in forward regression. We show, through simulation, that the proposed method provides adequate control of the familywise error rate with competitive power for both sparse and dense signals, even in high-dimensional cases, and we demonstrate its advantages in cases where the covariates are heavily correlated. We illustrate the application of our method by analysing an expression quantitative trait locus dataset.
UR - http://hdl.handle.net/10754/668557
UR - https://academic.oup.com/biomet/article/105/1/57/4711135
UR - http://www.scopus.com/inward/record.url?scp=85043249579&partnerID=8YFLogxK
U2 - 10.1093/biomet/asx061
DO - 10.1093/biomet/asx061
M3 - Article
SN - 0006-3444
VL - 105
SP - 57
EP - 71
JO - Biometrika
JF - Biometrika
IS - 1
ER -