Gutenbrain: An architecture for equipment technical attributes extraction from Piping & Instrumentation Diagrams

Marco Vicente; João Guarda; Fernando Batista

Ciência_Iscte Publicações Descrição Detalhada da Publicação

Publicação em atas de evento científico

Gutenbrain: An architecture for equipment technical attributes extraction from Piping & Instrumentation Diagrams

Marco Vicente (Vicente, M.); João Guarda (Guarda, J.); Fernando Batista (Batista, F.);

Proceedings of the 14th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - KDIR

Ano (publicação definitiva)

2022

Língua

Inglês

País

Portugal

Mais Informação

Visitar Link

Web of Science®

Esta publicação não está indexada na Web of Science®

Scopus

Esta publicação não está indexada na Scopus

Google Scholar

N.º de citações: 2

(Última verificação: 2026-07-15 20:26)

Ver o registo no Google Scholar

Overton

Esta publicação não está indexada no Overton

Abstract/Resumo

Piping and Instrumentation Diagrams (P&ID) are detailed representations of engineering schematics with piping, instrumentation and other related equipment and their physical process flow. They are critical in engineering projects to convey the physical sequence of systems, allowing engineers to understand the process flow, safety and regulatory requirements, and operational details. P&IDs may be provided in several formats, including scanned paper, CAD files, PDF, images, but these documents are frequently searched manually to identify all the equipment and their inter-connectivity. Furthermore, engineers must search the related technical specifications in separate technical documents, as P&ID usually don’t include technical specifications. This paper presents Gutenbrain, an architecture to extract equipment technical attributes from piping & instrumentation diagrams and technical documentation, which relies in textual information only. It first extracts equipment from P&IDs, using m eta-data to understand the equipment type, and text coordinates to detect the equipment even when it is represented in multiple lines of text. After detecting the equipment and storing it in a database, it allows retrieving and inferring technical attributes from the related technical documentation using two question answering models based on BERT-like contextual embeddings, depending on the equipment type meta-data. One question answering model works with free questions of continuous text, while the other uses tabular data. This ensemble approach allows us to extract technical attributes from documents where information is unstructured and scattered. The performance results for the equipment extraction stage achieve about 97,2% precision and 71,2% recall. The stored information can be later accessed using Elasticsearch, allowing engineers to save thousands of hours in maintenance engineering tasks.

Agradecimentos/Acknowledgements

Palavras-chave

Information retrieval,Question-answering,Piping & Instrumentation Diagrams

Classificação Fields of Science and Technology

Engenharia Civil - Engenharia e Tecnologia
Engenharia Eletrotécnica, Eletrónica e Informática - Engenharia e Tecnologia
Engenharia Mecânica - Engenharia e Tecnologia
Engenharia do Ambiente - Engenharia e Tecnologia
Línguas e Literaturas - Humanidades

Contribuições para os Objetivos do Desenvolvimento Sustentável das Nações Unidas

Com o objetivo de aumentar a investigação direcionada para o cumprimento dos Objetivos do Desenvolvimento Sustentável para 2030 das Nações Unidas, é disponibilizada no Ciência_Iscte a possibilidade de associação, quando aplicável, dos artigos científicos aos Objetivos do Desenvolvimento Sustentável. Estes são os Objetivos do Desenvolvimento Sustentável identificados pelo(s) autor(es) para esta publicação. Para uma informação detalhada dos Objetivos do Desenvolvimento Sustentável, clique aqui.

Identificadores da Publicação

DOI (fonte: autor)	10.5220/0011528500003335
ID Ciência_Iscte	ci-pub-91284

Outros Detalhes da Publicação

Ano Publicação Online	2022
Editora	SCITEPRESS – Science and Technology Publications, Lda
Indexação	--
ISSN	2184-3228 (online)
ISBN	978-989-758-614-9 (online)
Volume	1
Número Artigo
Páginas	204 - 211	Total Páginas	8
Avaliado Cientificamente	Sim
Editores	Coenen, F., Fred, A., and Filipe, J.
Título do Evento	14th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - KDIR
Organizador do Evento
Cidade	Valletta, Malta
Tipo de Evento	Conferência
Classificação do Evento	Internacional
Ano do Evento	2022
Tipo de Publicação no Evento	Artigo Completo
Data Publicação (online)
Data Publicação (print)

Altmetric

Dimensions

PlumX Metrics