Capítulo de livro
Looking back to 1850 in 2025: Historascan to digitize historical journals
Bruno Frutuoso Costa (Costa, B. F.); Bruno Contreiras Mateus (Mateus, B. C.); Hugo José Pinto (Pinto, H. ); Mohammad Reza Tabrizi (Tabrizi, M.);
Título Livro
Impact of digitalization on communication dynamics
Ano (publicação definitiva)
2025
Língua
Inglês
País
Estados Unidos da América
Mais Informação
Web of Science®

Esta publicação não está indexada na Web of Science®

Scopus

N.º de citações: 0

(Última verificação: 2026-01-27 14:14)

Ver o registo na Scopus

Google Scholar

N.º de citações: 3

(Última verificação: 2026-02-04 12:43)

Ver o registo no Google Scholar

Esta publicação não está indexada no Overton

Abstract/Resumo
This chapter analyses current technologies and the challenges involved in extracting and classifying articles and news headlines from historical journals, as well as converting images to text format. The work to develop a tool focused on digitising historical journals was carried out by a multidisciplinary team of experts in media studies, artificial intelligence, image processing, and cultural heritage preservation. The data used derives from two historic Portuguese journals, Diário de Notícias and Jornal de Notícias, which were created in the mid-19th century. This project is based on a mixture of heuristics, computer vision, pattern recognition, and other artificial intelligence and machine learning techniques. The main challenges included the variability in the design of historical journals, preserving the quality of images over time, and continuously improving image processing and OCR techniques to adapt to different styles and periods of newspapers.
Agradecimentos/Acknowledgements
This research was supported by the Portuguese Foundation for Science and Technology (FCT – Fundação para a Ciência e a Tecnologia) [2023.04877.BD].
Palavras-chave
Industry 4.0,News media,Historical newspapers,Journals,Digitization,Innovation,Content creation,Artificial intelligence,Machine learning,Optical Character Recognition,Convolutional Neural Networks,Natural language processing,Historascan,Diário de Notícias,Jornal de Notícias
Registos de financiamentos
Referência de financiamento Entidade Financiadora
2023.04877.BD Fundação para a Ciência e a Tecnologia
Projetos Relacionados