1 School of Public Health, Chongqing Medical University, Chongqing 400016, China;
2 The State Key Laboratory of Ultrasound in Medicine and Engineering, Chongqing Medical University, Chongqing 400016, China;
3 Department of Health Management Center, Nanchong Central Hospital, North Sichuan Medical University, Nanchong 637001,China;
4 Department of Obstetrics and Gynecology, Nanchong Central Hospital, North Sichuan Medical University, Nanchong 637001, China
Abstract:Objective To systematically analyze the data characteristics of the current health examination data, and to realize the data preprocessing by using Excel and SAS software macro process.Methods Based on the physical examination data from the physical examination data platform of a municipal tertiary hospital from October 2017 to December 2020, the characteristics of the current physical examination data were summarized through data combing, and the corresponding preprocessing rules were formulated. Based on Excel and SAS software, the specific data preprocessing scheme, operation process and macro code were proposed. data characteristics were summarized through data sorting, preprocessing rules were formulated, and specific solutions, operation procedures and macro codes were proposed based on Excel and SAS software.Results The batch column names of physical examination data were converted by Excel and SAS software, making them conform to the variable name naming rules of SAS software. Multiple data sets with different structures were realized without truncation value, which ensured the integrity of the database. By deleting missing variables and observation, combining duplicate variables and identifying duplicate observation
张丽君,黄艳艳,蒲杨,陈柯,徐凡,罗祥力,石丘玲. 基于SAS软件的地市级医院健康体检数据预处理方法探索[J]. 中国医院统计, 2023, 30(1): 64-70.
Zhang Lijun, Huang Yanyan,Pu Yang, Chen Ke,Xu Fan, Luo Xiangli,Shi Qiuling. Research on the preprocessing method of health examination data in prefecture-level hospitals based on SAS software. journal1, 2023, 30(1): 64-70.
[12]MALLEY B, RAMAZZOTTI D, WU J T Y. Data pre-processing[M]//Secondary Analysis of Electronic Health Records. Cham: Springer International Publishing, 2016:115-141. DOI:10.1007/978-3-319-43742-2_12.
[13]MCCARTHY D J, CAMPBELL K R, LUN A T, et al. Scater: pre-processing, quality control, normalization and visualization of single-cell RNA-seq data in R[J]. Bioinformatics, 2017, 33(8):1179-1186. DOI:10.1093/bioinformatics/btw777.
[15]GOKHALE K M, CHANDAN J S, TOULIS K, et al. Data extraction for epidemiological research (DExtER):A novel tool for automated clinical epidemiology studies[J]. Eur J Epidemiol, 2021, 36(2):165-178. DOI:10.1007/s10654-020-00677-6.
[16]DEMBE A E, PARTRIDGE J S, GEIST L C. Statistical software applications used in health services research: Analysis of published studies in the US[J]. BMC Health Serv Res, 2011, 11:252. DOI:10.1186/1472-6963-11-252.