Publication in conference proceedings
Segmentation Model for Judgments of the Portuguese Supreme Court of Justice
Martim Zanatti (Martim Zanatti); Ricardo Ribeiro (Ribeiro, R.); Helena Sofia Pinto (Pinto, H. Sofia);
Progress in Artificial Intelligence. EPIA 2024. Lecture Notes in Computer Science, vol 14969
Year (definitive publication)
2025
Language
English
Country
--
More Information
Web of Science®

Times Cited: 0

(Last checked: 2026-05-02 13:44)

View record in Web of Science®

Scopus

Times Cited: 1

(Last checked: 2026-04-26 19:07)

View record in Scopus

Google Scholar

This publication is not indexed in Google Scholar

This publication is not indexed in Overton

Abstract
Legal document segmentation is a critical task in the field of natural language processing (NLP), enabling efficient analysis, retrieval, and understanding of legal content. Despite its importance, research in this area for European Portuguese has been limited. To address this gap, we present a novel approach to automate the segmentation of legal judgments from the Portuguese Supreme Court of Justice into distinct sections. Leveraging a Bi-LSTM-CRF model, we developed a dataset and achieved significant results, including an accuracy of 0.9997, precision of 0.9986, recall of 0.996, and F1-Score of 0.9973. Our methodology and experimental results demonstrate the effectiveness and potential applications of our approach for the European Portuguese language.
Acknowledgements
--
Keywords
Sequence Labeling,Legal Documents Segmentation,Portuguese Legal Documents,Bi-LSTM-CRF,Dataset creation
Funding Records
Funding Reference Funding Entity
10.54499/UIDB/50021/2020 FCT
C645008882-00000055 PRR and NextGenerationEU