Review article Q1
Data science, machine learning and big data in digital journalism: A survey of state-of-the-art, challenges and opportunities
Elizabeth Silva Fernandes (Fernandes, E.); Sérgio Moro (Moro, S.); Paulo Cortez (Cortez, P.);
Journal Title
Expert Systems with Applications
Year (definitive publication)
2023
Language
English
Country
United Kingdom
More Information
Web of Science®

Times Cited: 6

(Last checked: 2024-07-02 14:24)

View record in Web of Science®


: 0.7
Scopus

Times Cited: 13

(Last checked: 2024-06-28 15:08)

View record in Scopus


: 1.3
Google Scholar

Times Cited: 19

(Last checked: 2024-06-30 07:37)

View record in Google Scholar

Abstract
Digital journalism has faced a dramatic change and media companies are challenged to use data science algorithms to be more competitive in a Big Data era. While this is a relatively new area of study in the media landscape, the use of machine learning and artificial intelligence has increased substantially over the last few years. In particular, the adoption of data science models for personalization and recommendation has attracted the attention of several media publishers. Following this trend, this paper presents a research literature analysis on the role of Data Science (DS) in Digital Journalism (DJ). Specifically, the aim is to present a critical literature review, synthetizing the main application areas of DS in DJ, highlighting research gaps, challenges, and opportunities for future studies. Through a systematic literature review integrating bibliometric search, text mining, and qualitative discussion, the relevant literature was identified and extensively analyzed. The review reveals an increasing use of DS methods in DJ, with almost 47% of the research being published in the last three years. An hierarchical clustering highlighted six main research domains focused on text mining, event extraction, online comment analysis, recommendation systems, automated journalism, and exploratory data analysis along with some machine learning approaches. Future research directions comprise developing models to improve personalization and engagement features, exploring recommendation algorithms, testing new automated journalism solutions, and improving paywall mechanisms.
Acknowledgements
--
Keywords
Data science,Digital journalism,Text mining,Systematic literature review,Media analytics,Machine learning
  • Computer and Information Sciences - Natural Sciences
  • Media and Communications - Social Sciences
Funding Records
Funding Reference Funding Entity
UIDB/00319/2020 Fundação para a Ciência e a Tecnologia
UIDP/04466/2020 Fundação para a Ciência e a Tecnologia
UIDB/04466/2020 Fundação para a Ciência e a Tecnologia
Related Projects

This publication is an output of the following project(s):

With the objective to increase the research activity directed towards the achievement of the United Nations 2030 Sustainable Development Goals, the possibility of associating scientific publications with the Sustainable Development Goals is now available in Ciência-IUL. These are the Sustainable Development Goals identified by the author(s) for this publication. For more detailed information on the Sustainable Development Goals, click here.