TY - GEN
T1 - Performance-estimation properties of cross-validation-based protocols with simultaneous hyper-parameter optimization
AU - Tsamardinos, Ioannis
AU - Rakhshani, Amin
AU - Lagani, Vincenzo
N1 - Generated from Scopus record by KAUST IRTS on 2023-09-23
PY - 2014/1/1
Y1 - 2014/1/1
N2 - In a typical supervised data analysis task, one needs to perform the following two tasks: (a) select the best combination of learning methods (e.g., for variable selection and classifier) and tune their hyper-parameters (e.g., K in K-NN), also called model selection, and (b) provide an estimate of the performance of the final, reported model. Combining the two tasks is not trivial because when one selects the set of hyper-parameters that seem to provide the best estimated performance, this estimation is optimistic (biased / overfitted) due to performing multiple statistical comparisons. In this paper, we confirm that the simple Cross-Validation with model selection is indeed optimistic (overestimates) in small sample scenarios. In comparison the Nested Cross Validation and the method by Tibshirani and Tibshirani provide conservative estimations, with the later protocol being more computationally efficient. The role of stratification of samples is examined and it is shown that stratification is beneficial. © 2014 Springer International Publishing.
AB - In a typical supervised data analysis task, one needs to perform the following two tasks: (a) select the best combination of learning methods (e.g., for variable selection and classifier) and tune their hyper-parameters (e.g., K in K-NN), also called model selection, and (b) provide an estimate of the performance of the final, reported model. Combining the two tasks is not trivial because when one selects the set of hyper-parameters that seem to provide the best estimated performance, this estimation is optimistic (biased / overfitted) due to performing multiple statistical comparisons. In this paper, we confirm that the simple Cross-Validation with model selection is indeed optimistic (overestimates) in small sample scenarios. In comparison the Nested Cross Validation and the method by Tibshirani and Tibshirani provide conservative estimations, with the later protocol being more computationally efficient. The role of stratification of samples is examined and it is shown that stratification is beneficial. © 2014 Springer International Publishing.
UR - http://link.springer.com/10.1007/978-3-319-07064-3_1
UR - http://www.scopus.com/inward/record.url?scp=84900529400&partnerID=8YFLogxK
U2 - 10.1007/978-3-319-07064-3_1
DO - 10.1007/978-3-319-07064-3_1
M3 - Conference contribution
SN - 9783319070636
SP - 1
EP - 14
BT - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
PB - Springer [email protected]
ER -