Publication in conference proceedings
A data-driven approach to predict hospital length of stay: A Portuguese case study
Nuno Caetano (Caetano, N.); Raul M. S. Laureano (Laureano, R.); Paulo Cortez (Cortez, P.);
Proceedings of the 16th International Conference on Enterprise Information Systems (ICEIS 2014)
Year (definitive publication)
2014
Language
English
Country
Portugal
More Information
Web of Science®

This publication is not indexed in Web of Science®

Scopus

This publication is not indexed in Scopus

Google Scholar

Times Cited: 32

(Last checked: 2026-05-02 16:13)

View record in Google Scholar

This publication is not indexed in Overton

Abstract
Data Mining (DM) aims at the extraction of useful knowledge from raw data. In the last decades, hospitals have collected large amounts of data through new methods of electronic data storage, thus increasing the potential value of DM in this domain area, in what is known as medical data mining. This work focuses on the case study of a Portuguese hospital, based on recent and large dataset that was collected from 2000 to 2013. A data-driven predictive model was obtained for the length of stay (LOS), using as inputs indicators commonly available at the hospitalization process. Based on a regression approach, several state-of-the-art DM models were compared. The best result was obtained by a Random Forest (RF), which presents a high quality coefficient of determination value (0.81). Moreover, a sensitivity analysis approach was used to extract human understandable knowledge from the RF model, revealing top three influential input attributes: hospital episode type, the physical service where the patient is hospitalized and the associated medical specialty. Such predictive and explanatory knowledge is valuable for supporting decisions of hospital managers.
Acknowledgements
--
Keywords
Medical data mining,Length of stay,CRISP-DM,Regression,Random forest
Funding Records
Funding Reference Funding Entity
: PEst- ˆ OE/EEI/UI0319/2014 Fundação para a Ciência e a Tecnologia

With the objective to increase the research activity directed towards the achievement of the United Nations 2030 Sustainable Development Goals, the possibility of associating scientific publications with the Sustainable Development Goals is now available in Ciência_Iscte. These are the Sustainable Development Goals identified by the author(s) for this publication. For more detailed information on the Sustainable Development Goals, click here.