Comunicação em evento científico
A Data-driven Approach to Predict Hospital Length of Stay - A Portuguese Case Study
Nuno Caetano (Caetano, Nuno); Raul Laureano (Laureano, Raul M. S.); Paulo Cortez (Cortez, Paulo);
Título Evento
Proceedings of the 16th International Conference on Enterprise Information Systems (ICEIS 2014)
Ano (publicação definitiva)
2014
Língua
Inglês
País
Portugal
Mais Informação
Web of Science®

Esta publicação não está indexada na Web of Science®

Scopus

N.º de citações: 6

(Última verificação: 2022-02-21 22:29)

Ver o registo na Scopus

Google Scholar

Esta publicação não está indexada no Google Scholar

Abstract/Resumo
Data Mining (DM) aims at the extraction of useful knowledge from raw data. In the last decades, hospitals have collected large amounts of data through new methods of electronic data storage, thus increasing the potential value of DM in this domain area, in what is known as medical data mining. This work focuses on the case study of a Portuguese hospital, based on recent and large dataset that was collected from 2000 to 2013. A data-driven predictive model was obtained for the length of stay (LOS), using as inputs indicators commonly available at the hospitalization process. Based on a regression approach, several state-of-the-art DM models were compared. The best result was obtained by a Random Forest (RF), which presents a high quality coefficient of determination value (0.81). Moreover, a sensitivity analysis approach was used to extract human understandable knowledge from the RF model, revealing top three influential input attributes: hospital episode type, the physical service where the patient is hospitalized and the associated medical specialty. Such predictive and explanatory knowledge is valuable for supporting decisions of hospital managers.
Agradecimentos/Acknowledgements
--
Palavras-chave
Medical Data Mining, Length of Stay, CRISP-DM, Random Forest.
  • Ciências Físicas - Ciências Naturais