Privacy in text documents

Mariana Dias; Joao C Ferreira or Joao Ferreira; Rui Maia; Pedro Santos; Ricardo Ribeiro

Ciência_Iscte Publicações Descrição Detalhada da Publicação

Publicação em atas de evento científico

Privacy in text documents

Mariana Dias (Dias, M.); Joao C Ferreira or Joao Ferreira (Ferreira, J. C.); Rui Maia (Maia, R.); Pedro Santos (Santos, P.); Ricardo Ribeiro (Ribeiro, R.);

Proceedings of the 33rd International Business Information Management Association Conference, IBIMA 2019: Education Excellence and Innovation Management through Vision 2020

Ano (publicação definitiva)

2019

Língua

Inglês

País

Espanha

Mais Informação

Visitar Link

Web of Science®

N.º de citações: 1

(Última verificação: 2026-04-08 17:33)

Ver o registo na Web of Science®

Scopus

N.º de citações: 2

(Última verificação: 2026-04-08 22:14)

Ver o registo na Scopus

Google Scholar

N.º de citações: 2

(Última verificação: 2026-04-06 22:20)

Ver o registo no Google Scholar

Overton

Esta publicação não está indexada no Overton

Abstract/Resumo

The process of sensitive data preservation is a manual and a semi-automatic procedure. Sensitive data preservation suffers various problems, in particular, affect the handling of confidential, sensitive and personal information, such as the identification of sensitive data in documents requiring human intervention that is costly and propense to generate error, and the identification of sensitive data in large-scale documents does not allow an approach that depends on human expertise for their identification and relationship. DataSense will be highly exportable software that will enable organizations to identify and understand the sensitive data in their possession in unstructured textual information (digital documents) in order to comply with legal, compliance and security purposes. The goal is to identify and classify sensitive data (Personal Data) present in large-scale structured and non-structured information in a way that allows entities and/or organizations to understand it without calling into question security or confidentiality issues. The DataSense project will be based on European-Portuguese text documents with different approaches of NLP (Natural Language Processing) technologies and the advances in machine learning, such as Named Entity Recognition, Disambiguation, Co-referencing (ARE) and Automatic Learning and Human Feedback. It will also be characterized by the ability to assist organizations in complying with standards such as the GDPR (General Data Protection Regulation), which regulate data protection in the European Union.

Agradecimentos/Acknowledgements

Palavras-chave

Sensitive data,Natural language processing,Text mining,Named entities recognition

Classificação Fields of Science and Technology

Ciências da Computação e da Informação - Ciências Naturais
Engenharia Eletrotécnica, Eletrónica e Informática - Engenharia e Tecnologia

Registos de financiamentos

Referência de financiamento	Entidade Financiadora
POCI-01-0247-FEDER-038539	Comissão Europeia
UID/Multi/04466/2019	Fundação para a Ciência e a Tecnologia

Identificadores da Publicação

WoS (fonte: Ciência_Iscte)	WOS:000503988804014
Scopus (fonte: autor)	2-s2.0-85074083548
Outro ID (fonte: ORCID)	cv-prod-id-1713853
ISBN (fonte: Externo)	978-099985512-6
Scopus (fonte: Ciência_Iscte)	2-s2.0-85074083548
Handle (fonte: Ciência-IUL)	http://hdl.handle.net/10071/22603
WoS (fonte: autor)	000503988804014
Outro ID (fonte: Externo)	cv-prod-id-954996
ID Ciência_Iscte	ci-pub-63744

Outros Detalhes da Publicação

Ano Publicação Online	2019
Editora	International Business Information Management Association, IBIMA
Indexação	Web of Science©; Scopus;
ISSN	--
ISBN	978-099985512-6 (print) 978-099985512-6 (online)
Volume
Número Artigo
Páginas	2551 - 2560	Total Páginas	10
Avaliado Cientificamente	Sim
Editores	Soliman, K. S.
Título do Evento	33rd International Business Information Management Association Conference: Education Excellence and Innovation Management through Vision 2020, IBIMA 2019
Organizador do Evento	IBIMA
Cidade	Granada
Tipo de Evento	Conferência
Classificação do Evento	Internacional
Ano do Evento	2019
Tipo de Publicação no Evento	Artigo Completo
Repositório ISCTE-IUL	Link para o repositório
Data Publicação (online)	2019-01-01
Data Publicação (print)	2019-01-01

Altmetric

PlumX Metrics