TY - JOUR
T1 - Feature selection for high-dimensional temporal data
AU - Tsagris, Michail
AU - Lagani, Vincenzo
AU - Tsamardinos, Ioannis
N1 - Generated from Scopus record by KAUST IRTS on 2023-09-23
PY - 2018/1/23
Y1 - 2018/1/23
N2 - Background: Feature selection is commonly employed for identifying collectively-predictive biomarkers and biosignatures; it facilitates the construction of small statistical models that are easier to verify, visualize, and comprehend while providing insight to the human expert. In this work we extend established constrained-based, feature-selection methods to high-dimensional "omics" temporal data, where the number of measurements is orders of magnitude larger than the sample size. The extension required the development of conditional independence tests for temporal and/or static variables conditioned on a set of temporal variables. Results: The algorithm is able to return multiple, equivalent solution subsets of variables, scale to tens of thousands of features, and outperform or be on par with existing methods depending on the analysis task specifics. Conclusions: The use of this algorithm is suggested for variable selection with high-dimensional temporal data.
AB - Background: Feature selection is commonly employed for identifying collectively-predictive biomarkers and biosignatures; it facilitates the construction of small statistical models that are easier to verify, visualize, and comprehend while providing insight to the human expert. In this work we extend established constrained-based, feature-selection methods to high-dimensional "omics" temporal data, where the number of measurements is orders of magnitude larger than the sample size. The extension required the development of conditional independence tests for temporal and/or static variables conditioned on a set of temporal variables. Results: The algorithm is able to return multiple, equivalent solution subsets of variables, scale to tens of thousands of features, and outperform or be on par with existing methods depending on the analysis task specifics. Conclusions: The use of this algorithm is suggested for variable selection with high-dimensional temporal data.
UR - https://bmcbioinformatics.biomedcentral.com/articles/10.1186/s12859-018-2023-7
UR - http://www.scopus.com/inward/record.url?scp=85040866104&partnerID=8YFLogxK
U2 - 10.1186/s12859-018-2023-7
DO - 10.1186/s12859-018-2023-7
M3 - Article
SN - 1471-2105
VL - 19
JO - BMC BIOINFORMATICS
JF - BMC BIOINFORMATICS
IS - 1
ER -