Gutenbrain: An architecture for equipment technical attributes extraction from Piping & Instrumentation Diagrams

Marco Vicente; João Guarda; Fernando Batista

Ciência_Iscte Publications Publication Detailed Description

Publication in conference proceedings

Gutenbrain: An architecture for equipment technical attributes extraction from Piping & Instrumentation Diagrams

Marco Vicente (Vicente, M.); João Guarda (Guarda, J.); Fernando Batista (Batista, F.);

Proceedings of the 14th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - KDIR

Year (definitive publication)

2022

Language

English

Country

Portugal

More Information

Visit Link

Web of Science®

This publication is not indexed in Web of Science®

Scopus

This publication is not indexed in Scopus

Google Scholar

Times Cited: 1

(Last checked: 2026-01-29 21:29)

View record in Google Scholar

Overton

This publication is not indexed in Overton

Abstract

Piping and Instrumentation Diagrams (P&ID) are detailed representations of engineering schematics with piping, instrumentation and other related equipment and their physical process flow. They are critical in engineering projects to convey the physical sequence of systems, allowing engineers to understand the process flow, safety and regulatory requirements, and operational details. P&IDs may be provided in several formats, including scanned paper, CAD files, PDF, images, but these documents are frequently searched manually to identify all the equipment and their inter-connectivity. Furthermore, engineers must search the related technical specifications in separate technical documents, as P&ID usually don’t include technical specifications. This paper presents Gutenbrain, an architecture to extract equipment technical attributes from piping & instrumentation diagrams and technical documentation, which relies in textual information only. It first extracts equipment from P&IDs, using m eta-data to understand the equipment type, and text coordinates to detect the equipment even when it is represented in multiple lines of text. After detecting the equipment and storing it in a database, it allows retrieving and inferring technical attributes from the related technical documentation using two question answering models based on BERT-like contextual embeddings, depending on the equipment type meta-data. One question answering model works with free questions of continuous text, while the other uses tabular data. This ensemble approach allows us to extract technical attributes from documents where information is unstructured and scattered. The performance results for the equipment extraction stage achieve about 97,2% precision and 71,2% recall. The stored information can be later accessed using Elasticsearch, allowing engineers to save thousands of hours in maintenance engineering tasks.

Acknowledgements

Keywords

Information retrieval,Question-answering,Piping & Instrumentation Diagrams

Fields of Science and Technology Classification

Civil Engineering - Engineering and Technology
Electrical Engineering, Electronic Engineering, Information Engineering - Engineering and Technology
Mechanical Engineering - Engineering and Technology
Environmental Engineering - Engineering and Technology
Languages and Literature - Humanities

Contributions to the Sustainable Development Goals of the United Nations

With the objective to increase the research activity directed towards the achievement of the United Nations 2030 Sustainable Development Goals, the possibility of associating scientific publications with the Sustainable Development Goals is now available in Ciência_Iscte. These are the Sustainable Development Goals identified by the author(s) for this publication. For more detailed information on the Sustainable Development Goals, click here.

Publication Identifiers

DOI (source: author)	10.5220/0011528500003335
Ciência_Iscte ID	ci-pub-91284

Other Publication Details

Online Publication Year	2022
Publisher	SCITEPRESS – Science and Technology Publications, Lda
Indexes	--
ISSN	2184-3228 (online)
ISBN	978-989-758-614-9 (online)
Volume	1
Article Number
Pages	204 - 211	Total Pages	8
Peer Reviewed	Yes
Editors	Coenen, F., Fred, A., and Filipe, J.
Event Title	14th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - KDIR
Event Organizer
City	Valletta, Malta
Event Type	Conference
Event Classification	International
Event Year	2022
Event Publication Type	Full Paper
Publication Date (online)
Publication Date (print)

Altmetric

Dimensions

PlumX Metrics