Ciência_Iscte
Publications
Publication Detailed Description
Segmentation Model for Judgments of the Portuguese Supreme Court of Justice
Progress in Artificial Intelligence. EPIA 2024. Lecture Notes in Computer Science, vol 14969
Year (definitive publication)
2025
Language
English
Country
--
More Information
Web of Science®
Scopus
Google Scholar
This publication is not indexed in Google Scholar
This publication is not indexed in Overton
Abstract
Legal document segmentation is a critical task in the field of natural language processing (NLP), enabling efficient analysis, retrieval, and understanding of legal content. Despite its importance, research in this area for European Portuguese has been limited. To address this gap, we present a novel approach to automate the segmentation of legal judgments from the Portuguese Supreme Court of Justice into distinct sections. Leveraging a Bi-LSTM-CRF model, we developed a dataset and achieved significant results, including an accuracy of 0.9997, precision of 0.9986, recall of 0.996, and F1-Score of 0.9973. Our methodology and experimental results demonstrate the effectiveness and potential applications of our approach for the European Portuguese language.
Acknowledgements
--
Keywords
Sequence Labeling,Legal Documents Segmentation,Portuguese Legal Documents,Bi-LSTM-CRF,Dataset creation
Funding Records
| Funding Reference | Funding Entity |
|---|---|
| 10.54499/UIDB/50021/2020 | FCT |
| C645008882-00000055 | PRR and NextGenerationEU |
Português