Emergency Department Readmission Risk Prediction: A Case Study in Chile
Abstract
Short time readmission prediction in Emergency Departments (ED) is a valuable tool to improve both the ED management and the healthcare quality. It helps identifying patients requiring further post-discharge attention as well as reducing healthcare costs. As in many other medical domains, patient readmission data is heavily imbalanced, i.e. the minority class is very infrequent, which is a challenge for the construction of accurate predictors using machine learning tools. We have carried computational experiments on a dataset composed of ED admission records spanning more than 100000 patients in 3 years, with a highly imbalanced distribution. We employed various approaches for dealing with this highly imbalanced dataset in combination with different classification algorithms and compared their predictive power for the estimation of the ED readmission probability within 72 h after discharge. Results show that random undersampling and Bagging (RUSBagging) in combination with Random Forest achieves the best results in terms of Area Under ROC Curve (AUC).
BIB_text
title = {Emergency Department Readmission Risk Prediction: A Case Study in Chile},
pages = {11-20},
volume = {10338},
keywds = {
Readmission risk, Imbalanced data, Classification, Bagging
}
abstract = {
Short time readmission prediction in Emergency Departments (ED) is a valuable tool to improve both the ED management and the healthcare quality. It helps identifying patients requiring further post-discharge attention as well as reducing healthcare costs. As in many other medical domains, patient readmission data is heavily imbalanced, i.e. the minority class is very infrequent, which is a challenge for the construction of accurate predictors using machine learning tools. We have carried computational experiments on a dataset composed of ED admission records spanning more than 100000 patients in 3 years, with a highly imbalanced distribution. We employed various approaches for dealing with this highly imbalanced dataset in combination with different classification algorithms and compared their predictive power for the estimation of the ED readmission probability within 72 h after discharge. Results show that random undersampling and Bagging (RUSBagging) in combination with Random Forest achieves the best results in terms of Area Under ROC Curve (AUC).
}
isbn = {978-3-319-59773-7},
isi = {1},
doi = {10.1007/978-3-319-59773-7_2},
date = {2017-06-19},
year = {2017},
}