Export Publication

The publication can be exported in the following formats: APA (American Psychological Association) reference format, IEEE (Institute of Electrical and Electronics Engineers) reference format, BibTeX and RIS.

Export Reference (APA)
Silva, S., Ribeiro, R. & Pereira, R. (2018). Less is more in incident categorization. In Pedro Rangel Henriques; José Paulo Leal; António Menezes Leitão; Xavier Gómez Guinovart (Ed.), 7th Symposium on Languages, Applications and Technologies, SLATE. Guimarães: Schloss Dagstuhl--Leibniz-Zentrum fuer Informatik.
Export Reference (IEEE)
S. Silva et al.,  "Less is more in incident categorization", in 7th Symp. on Languages, Applications and Technologies, SLATE, Pedro Rangel Henriques; José Paulo Leal; António Menezes Leitão; Xavier Gómez Guinovart, Ed., Guimarães, Schloss Dagstuhl--Leibniz-Zentrum fuer Informatik, 2018, vol. 62
Export BibTeX
@inproceedings{silva2018_1765835917716,
	author = "Silva, S. and Ribeiro, R. and Pereira, R.",
	title = "Less is more in incident categorization",
	booktitle = "7th Symposium on Languages, Applications and Technologies, SLATE",
	year = "2018",
	editor = "Pedro Rangel Henriques; José Paulo Leal; António Menezes Leitão; Xavier Gómez Guinovart",
	volume = "62",
	number = "",
	series = "",
	doi = "10.4230/OASIcs.SLATE.2018.17",
	publisher = "Schloss Dagstuhl--Leibniz-Zentrum fuer Informatik",
	address = "Guimarães",
	organization = "Universidade do Minho",
	url = "http://slate-conf.org/2018/home"
}
Export RIS
TY  - CPAPER
TI  - Less is more in incident categorization
T2  - 7th Symposium on Languages, Applications and Technologies, SLATE
VL  - 62
AU  - Silva, S.
AU  - Ribeiro, R.
AU  - Pereira, R.
PY  - 2018
SN  - 2190-6807
DO  - 10.4230/OASIcs.SLATE.2018.17
CY  - Guimarães
UR  - http://slate-conf.org/2018/home
AB  - The IT incident management process requires a correct categorization to attribute incident tickets to the right resolution group and obtain as quickly as possible an operational system, impacting the minimum as possible the business and costumers. In this work, we introduce automatic text classification, demonstrating the application of several natural language processing techniques and analyzing the impact of each one on a real incident tickets dataset. The techniques that we explore in the pre-processing of the text that describes an incident are the following: tokenization, stemming, eliminating stop-words, named-entity recognition, and TFxIDF-based document representation. Finally, to build the model and observe the results after applying the previous techniques, we use two machine learning algorithms: Support Vector Machine (SVM) and K-Nearest Neighbor (KNN). Two important findings result from this study: a shorter description of an incident is better than a full description of an incident; and, pre-processing has little impact on incident categorization, mainly due the specific vocabulary used in this type of text.
ER  -