International Journal of Management, Economics and Social Sciences
Special Issue-International Conference on Medical and Health Informatics (ICMHI 2017)
2017, Vol. 6(S1), pp.293 – 306.
ISSN 2304 – 1366


Data Mining Techniques for Forecasting the Medical Resource Consumption of Patients with Diabetic Nephropathy


Tian-Shyug Lee1
Wensheng Dai2
Bo-Lin Huang1
Chi-Jie Lu3
1Dept. of Business Administration, Fu Jen Catholic University, Taiwan
2International Monetary Institute and Finance, Policy Research Center, Renmin University of China, China
3Dept. of Industrial Engineering and Management, Chien Hsin University of Science and Technology, Taiwan



Diabetes has become an important public health issue in the twenty-first century, and dialysis treatment has become a large burden on the National Health Insurance of Taiwan. Diabetic nephropathy(DN) is the leading factor that determines whether patients with diabetes will require dialysis. Statistical data published by the Ministry of Health and Welfare in 2015 indicated that, second only to cancer, chronic kidney failure is the most prevalent disease treated by primary outpatient clinics. In addition, according to the National Health Insurance Administration Ministry of Health and Welfare, 6% of the national health insurance budget was spent to cover the dialysis treatment of ESRD patients. Therefore, in this study, we proposed and developed a forecasting model for the medical resource consumption of DN patients. We used multiple regression, stepwise regression, multivariate adaptive regression splines (MARS), support vector regression, and twostage model (T-SVR). We used a combination of important variables screened out by stepwise regression and MARS to construct the T-SVR model. We screened out the important factors with a significant impact on medical consumption. We then identified the model with the best forecasting performance out of the five data mining techniques. Our results can aid the managers of medical institutions to properly and effectively allocate medical resources and control medical expenses.

Keywords:  Medical resource consumption, diabetic nephropathy, data mining, multivariate adaptive regression splines, support vector regression