Data sense platform

Mariana Dias; Joao C Ferreira or Joao Ferreira; Rui Maia; Pedro Santos; Ricardo Ribeiro; Ana Martins

Ciência_Iscte Publications Publication Detailed Description

Publication in conference proceedings

Data sense platform

Mariana Dias (Dias, M.); Joao C Ferreira or Joao Ferreira (Ferreira, J.); Rui Maia (Maia, R.); Pedro Santos (Santos, P.); Ricardo Ribeiro (Ribeiro, R.); Ana Martins (Martins, A.);

Proceedings of the IASTEM—586th International Conference on Science Technology and Management (ICSTM)

Year (definitive publication)

2019

Language

English

Country

Brunei Darussalam

More Information

Web of Science®

This publication is not indexed in Web of Science®

Scopus

This publication is not indexed in Scopus

Google Scholar

Times Cited: 2

(Last checked: 2026-03-26 09:32)

View record in Google Scholar

Overton

This publication is not indexed in Overton

Abstract

The current manual or semi-automatic document preservation process suffers from various problems that particularly affect the handling of confidential or sensitive information, such as the identification of sensitive data in documents requiring human intervention that is costly and propense to generate error, and the identification of sensitive data in large-scale documents does not allow an approach that depends on human expertise for their identification and relationship. DataSense will be highly exportable software that will enable organizations to identify and understand the sensitive data in their possession in unstructured textual information (digital documents) in order to comply with legal, compliance and security purposes, identify and classify and relate sensitive data (Personal Data) present in large-scale nonstructured information in a way that allows entities and/or organizations to understand it without calling into question security or confidentiality issues, and allowing companies that focus on their clients to better understand their profile from information collected from sensitive data or consent data or algorithms. The Data Sense project will be based on 3 key layers using the current potential of NLP technologies and the advances in machine learning (NER), Disambiguation and Coreferencing (ARE) and Automatic Learning and Human Feedback. It will also be characterized by the ability to learn from human feedback automatically, correcting and iteratively improving the AI model that supports it.

Acknowledgements

Keywords

Sensitive data,Natural language processing,Text mining,Named entities recognition,Co-reference resolution

Fields of Science and Technology Classification

Computer and Information Sciences - Natural Sciences

Funding Records

Funding Reference	Funding Entity
UID/MULTI/0446/2013	Fundação para a Ciência e a Tecnologia
UID/GES/00315/2013	Fundação para a Ciência e a Tecnologia

Publication Identifiers

Other ID (source: External)	cv-prod-id-869021
ISSN (source: External)	2250-1584
Ciência_Iscte ID	ci-pub-76403

Other Publication Details

Online Publication Year	2019
Publisher	--
Indexes	--
ISSN	2394-2835 (online)
ISBN	--
Volume
Article Number
Pages	--	Total Pages	6
Peer Reviewed	Yes
Dissemination Mean	Both (printed and digital)
Editors
Event Title	IASTEM International Conference
Event Organizer
City	Brunei Darussalam
Event Type	Conference
Event Classification	International
Event Year	2019
Event Publication Type	Full Paper
Publication Date (online)	2019-01-01
Publication Date (print)	2019-01-01