Export Publication

The publication can be exported in the following formats: APA (American Psychological Association) reference format, IEEE (Institute of Electrical and Electronics Engineers) reference format, BibTeX and RIS.

Export Reference (APA)
Vicente, M., Guarda, J. & Batista, F. (2022). Gutenbrain: An architecture for equipment technical attributes extraction from Piping & Instrumentation Diagrams. In Coenen, F., Fred, A., and Filipe, J. (Ed.), Proceedings of the 14th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - KDIR. (pp. 204-211). Valletta, Malta: SCITEPRESS – Science and Technology Publications, Lda.
Export Reference (IEEE)
M. Vicente et al.,  "Gutenbrain: An architecture for equipment technical attributes extraction from Piping & Instrumentation Diagrams", in Proc. of the 14th Int. Joint Conf. on Knowledge Discovery, Knowledge Engineering and Knowledge Management - KDIR, Coenen, F., Fred, A., and Filipe, J., Ed., Valletta, Malta, SCITEPRESS – Science and Technology Publications, Lda, 2022, vol. 1, pp. 204-211
Export BibTeX
@inproceedings{vicente2022_1715918312194,
	author = "Vicente, M. and Guarda, J. and Batista, F.",
	title = "Gutenbrain: An architecture for equipment technical attributes extraction from Piping & Instrumentation Diagrams",
	booktitle = "Proceedings of the 14th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - KDIR",
	year = "2022",
	editor = "Coenen, F., Fred, A., and Filipe, J.",
	volume = "1",
	number = "",
	series = "",
	doi = "10.5220/0011528500003335",
	pages = "204-211",
	publisher = "SCITEPRESS – Science and Technology Publications, Lda",
	address = "Valletta, Malta",
	organization = "",
	url = "https://www.scitepress.org/ProceedingsDetails.aspx?ID=lLKzJgn1Xno=&t=1"
}
Export RIS
TY  - CPAPER
TI  - Gutenbrain: An architecture for equipment technical attributes extraction from Piping & Instrumentation Diagrams
T2  - Proceedings of the 14th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - KDIR
VL  - 1
AU  - Vicente, M.
AU  - Guarda, J.
AU  - Batista, F.
PY  - 2022
SP  - 204-211
DO  - 10.5220/0011528500003335
CY  - Valletta, Malta
UR  - https://www.scitepress.org/ProceedingsDetails.aspx?ID=lLKzJgn1Xno=&t=1
AB  - Piping and Instrumentation Diagrams (P&ID) are detailed representations of engineering schematics with piping, instrumentation and other related equipment and their physical process flow. They are critical in engineering projects to convey the physical sequence of systems, allowing engineers to understand the process flow, safety and regulatory requirements, and operational details. P&IDs may be provided in several formats, including scanned paper, CAD files, PDF, images, but these documents are frequently searched manually to identify all the equipment and their inter-connectivity. Furthermore, engineers must search the related technical specifications in separate technical documents, as P&ID usually don’t include technical specifications. This paper presents Gutenbrain, an architecture to extract equipment technical attributes from piping & instrumentation diagrams and technical documentation, which relies in textual information only. It first extracts equipment from P&IDs, using m eta-data to understand the equipment type, and text coordinates to detect the equipment even when it is represented in multiple lines of text. After detecting the equipment and storing it in a database, it allows retrieving and inferring technical attributes from the related technical documentation using two question answering models based on BERT-like contextual embeddings, depending on the equipment type meta-data. One question answering model works with free questions of continuous text, while the other uses tabular data. This ensemble approach allows us to extract technical attributes from documents where information is unstructured and scattered. The performance results for the equipment extraction stage achieve about 97,2% precision and 71,2% recall. The stored information can be later accessed using Elasticsearch, allowing engineers to save thousands of hours in maintenance engineering tasks.
ER  -