Ciência-IUL
Publicações
Descrição Detalhada da Publicação
Proceedings of the 16th International Conference on Enterprise Information Systems (ICEIS 2014)
Ano (publicação definitiva)
2014
Língua
Inglês
País
Portugal
Mais Informação
Web of Science®
Esta publicação não está indexada na Web of Science®
Scopus
Esta publicação não está indexada na Scopus
Google Scholar
Abstract/Resumo
Data Mining (DM) aims at the extraction of useful knowledge from raw data. In the last decades, hospitals
have collected large amounts of data through new methods of electronic data storage, thus increasing the
potential value of DM in this domain area, in what is known as medical data mining. This work focuses on
the case study of a Portuguese hospital, based on recent and large dataset that was collected from 2000 to 2013. A data-driven predictive model was obtained for the length of stay (LOS), using as inputs indicators
commonly available at the hospitalization process. Based on a regression approach, several state-of-the-art DM models were compared. The best result was obtained by a Random Forest (RF), which presents a high quality coefficient of determination value (0.81). Moreover, a sensitivity analysis approach was used to extract human understandable knowledge from the RF model, revealing top three influential input attributes: hospital episode type, the physical service where the patient is hospitalized and the associated medical specialty. Such predictive and explanatory knowledge is valuable for supporting decisions of hospital managers.
Agradecimentos/Acknowledgements
--
Palavras-chave
Medical data mining,Length of stay,CRISP-DM,Regression,Random forest
Registos de financiamentos
Referência de financiamento | Entidade Financiadora |
---|---|
: PEst- ˆ OE/EEI/UI0319/2014 | Fundação para a Ciência e a Tecnologia |