Abstract：Objective To introduce the basic principle of 10-fold cross-validation, and use R to show the application of 10-fold cross-validation in the evaluation of the generalization ability of prediction models in an example. Methods The logistic regression model was used with 10-fold cross-validation as the evaluation method to predict the treatment outcome of drug-resistant tuberculosis patients. At the same time, the 5-fold, 10-fold, and leave-one-out cross-validation were used to compare the parameters of evaluation. Results Different divisions of training and test sets have influence on the parameters of evaluation. Compared with other cross-validation methods, the parameters of evaluation from 10-fold cross-validation have a higher stability and efficiency than other numbers of fold. Conclusion In the evaluation of the application effects of machine learning models, the comprehensive performance of 10-fold cross-validation is prominent and the generalization ability of different models can be objectively measured.
梁子超，李智炜，赖铿，林卓琛，李铁钢，张晋昕. 10折交叉验证用于预测模型泛化能力评价及其R软件实现[J]. 中国医院统计, 2020, 27(4): 289-292.
Liang Zichao, Li Zhiwei, Lai Keng, Lin Zhuochen, Li Tiegang, Zhang Jinxin.. Application of 10-fold cross-validation in the evaluation of generalization ability of prediction models and the realization in R. journal1, 2020, 27(4): 289-292.
［1］YADAV S, SHUKLA S. Analysis of kfold crossvalidation over holdout validation on colossal datasets for quality classification［C/OL］//2016 IEEE 6th International Conference on Advanced Computing (IACC). Bhimavaram, India: IEEE, 2016:7883［20200518］.https://ieeexplore.ieee.org/document/7544814.
［2］RODRGUEZ J D, PREZ A, LOZANO J A. Sensitivity analysis of kappafold cross validation in prediction error estimation［J］. IEEE Trans Pattern Anal Mach Intell, 2010, 32(3):569-575.
［4］CAO T A, WANG Q S, LIU D, et al. Resting state EEGbased sudden pain recognition method and experimental study［J］. Biomedical Signal Processing and Control, 2020, 59:101925.
［5］MAO Y, DONG L X, ZHENG Y, et al. Prediction of recurrence in cervical cancer using a nine-lncRNA signature［J］. Front Genet, 2019, 10:284.
［6］WANG Q, WANG X M, CHEN W M, et al. Application of generalized estimation equations to establish prediction equation for tuberculosis drug resistance in Zhejiang Province［J］. Chinese Journal of Epidemiology, 2018, 39(3):368-373.
［8］KANG M W, KIM H K, CHOI Y S, et al. Surgical treatment for multidrugresistant and extensive drugresistant tuberculosis［J］. Ann Thorac Surg, 2010, 89(5):1597-1602.