首页> 中文期刊> 《中国邮电高校学报:英文版》 >Comparison of three data mining methods in predicting 5-year survival of colorectal cancer patients

Comparison of three data mining methods in predicting 5-year survival of colorectal cancer patients

         

摘要

The prediction of colorectal cancer(CRC) survivability has always been a challenging research issue. Considering the importance of predicting CRC patients' survival rates, we compared the performance of three data mining methods: decision trees(DTs), artificial neural networks(ANNs) and support vector machines(SVMs), for predicting 5-year survival of CRC patients to assist clinicians in making treatment decisions. The CRC dataset used to build the prediction model comes from the surveillance, epidemiology, and end results(SEER) program. The 5-fold cross-validation and random forest algorithm were respectively utilized for measuring the model predictive accuracy and the importance of features. Experimental results show that the predictive accuracy of ANNs(0.73) and SVMs(0.75) were higher than that of DTs, and they also have the best result in the area under the receiver operating characteristic(ROC) curve(area under curve(AUC)=0.82). This result may indicate high predictive power of ANNs and SVMs for predicting 5-year survival of CRC patients.

著录项

获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号