Less is more in incident categorization

Sara Silva; Ricardo Ribeiro; Rúben Pereira

Ciência_Iscte Publications Publication Detailed Description

Publication in conference proceedings Q4

Less is more in incident categorization

Sara Silva (Silva, S.); Ricardo Ribeiro (Ribeiro, R.); Rúben Pereira (Pereira, R.);

7th Symposium on Languages, Applications and Technologies, SLATE

Year (definitive publication)

2018

Language

English

Country

Portugal

More Information

Visit Link

Web of Science®

This publication is not indexed in Web of Science®

Scopus

Times Cited: 2

(Last checked: 2026-04-04 19:23)

View record in Scopus

Article Impact Index: 0.5

Google Scholar

Times Cited: 8

(Last checked: 2026-04-13 02:16)

View record in Google Scholar

Overton

This publication is not indexed in Overton

Abstract

The IT incident management process requires a correct categorization to attribute incident tickets to the right resolution group and obtain as quickly as possible an operational system, impacting the minimum as possible the business and costumers. In this work, we introduce automatic text classification, demonstrating the application of several natural language processing techniques and analyzing the impact of each one on a real incident tickets dataset. The techniques that we explore in the pre-processing of the text that describes an incident are the following: tokenization, stemming, eliminating stop-words, named-entity recognition, and TFxIDF-based document representation. Finally, to build the model and observe the results after applying the previous techniques, we use two machine learning algorithms: Support Vector Machine (SVM) and K-Nearest Neighbor (KNN). Two important findings result from this study: a shorter description of an incident is better than a full description of an incident; and, pre-processing has little impact on incident categorization, mainly due the specific vocabulary used in this type of text.

Acknowledgements

Keywords

Machine learning,Automated incident categorization,SVM,Incident management,Natural language

Fields of Science and Technology Classification

Computer and Information Sciences - Natural Sciences
Electrical Engineering, Electronic Engineering, Information Engineering - Engineering and Technology

Publication Identifiers

Scopus (source: Ciência_Iscte)	2-s2.0-85052026088
Other ID (source: ORCID)	cv-prod-id-1422003
DOI (source: author)	10.4230/OASIcs.SLATE.2018.17
Scopus (source: author)	2-s2.0-85052026088
Ciência_Iscte ID	ci-pub-50350
Handle (source: Ciência-IUL)	http://hdl.handle.net/10071/16690

Other Publication Details

Online Publication Year	2018
Publisher	Schloss Dagstuhl--Leibniz-Zentrum fuer Informatik
Indexes	Scopus;
ISSN	2190-6807 (print) 2190-6807 (online)
ISBN	978-3-95977-072-9 (print) 978-3-95977-072-9 (online)
Volume	62
Article Number	17
Pages	--	Total Pages	7
Peer Reviewed	Yes
Dissemination Mean	Digital
Editors	Pedro Rangel Henriques; José Paulo Leal; António Menezes Leitão; Xavier Gómez Guinovart
Event Title	--
Event Organizer	Universidade do Minho
City	Guimarães
Event Type	Conference
Event Classification	International
Event Year	2018
Event Publication Type	Full Paper
ISCTE-IUL Repository	Link to the repository
Publication Date (online)
Publication Date (print)

Altmetric

Dimensions

PlumX Metrics