Crowdsmelling: A preliminary study on using collective knowledge in code smells detection

José Pereira dos Reis; Fernando Brito e Abreu; Glauco Carneiro

Ciência_Iscte Publicações Descrição Detalhada da Publicação

Artigo em revista científica Q1

Crowdsmelling: A preliminary study on using collective knowledge in code smells detection

José Pereira dos Reis (Reis, J.); Fernando Brito e Abreu (Brito e Abreu, F.); Glauco Carneiro (Figueiredo Carneiro, G.);

Título Revista

Empirical Software Engineering

Ano (publicação definitiva)

2022

Língua

Inglês

País

Países Baixos (Holanda)

Mais Informação

Visitar Link

Web of Science®

N.º de citações: 8

(Última verificação: 2025-04-29 09:51)

Ver o registo na Web of Science®

Índice de Impacto do Artigo: 0.7

Ver Mais

Scopus

N.º de citações: 11

(Última verificação: 2025-04-21 20:13)

Ver o registo na Scopus

Índice de Impacto do Artigo: 0.8

Ver Mais

Google Scholar

N.º de citações: 11

(Última verificação: 2025-04-26 13:25)

Ver o registo no Google Scholar

Overton

Esta publicação não está indexada no Overton

Abstract/Resumo

Code smells are seen as a major source of technical debt and, as such, should be detected and removed. However, researchers argue that the subjectiveness of the code smells detection process is a major hindrance to mitigating the problem of smells-infected code. This paper presents the results of a validation experiment for the Crowdsmelling approach proposed earlier. The latter is based on supervised machine learning techniques, where the wisdom of the crowd (of software developers) is used to collectively calibrate code smells detection algorithms, thereby lessening the subjectivity issue. In the context of three consecutive years of a Software Engineering course, a total ``crowd'' of around a hundred teams, with an average of three members each, classified the presence of 3 code smells (Long Method, God Class, and Feature Envy) in Java source code. These classifications were the basis of the oracles used for training six machine learning algorithms. Over one hundred models were generated and evaluated to determine which machine learning algorithms had the best performance in detecting each of the aforementioned code smells. Good performances were obtained for God Class detection (ROC=0.896 for Naive Bayes) and Long Method detection (ROC=0.870 for AdaBoostM1), but much lower for Feature Envy (ROC=0.570 for Random Forrest). The results suggest that Crowdsmelling is a feasible approach for the detection of code smells. Further validation experiments based on dynamic learning are required to comprehensive coverage of code smells to increase external validity.

Agradecimentos/Acknowledgements

This work was partially funded by the Portuguese Foundation for Science and Technology, under ISTAR's projects UIDB/04466/2020 and UIDP/04466/2020, and by Anima Institute (Edital Nº 43/2021).

Palavras-chave

Crowdsmelling,Code smells,Code smells detection,Software quality,Software maintenance,Collective knowledge,Machine learning algorithms

Classificação Fields of Science and Technology

Ciências da Computação e da Informação - Ciências Naturais

Registos de financiamentos

Referência de financiamento	Entidade Financiadora
UIDP/04466/2020	Fundação para a Ciência e a Tecnologia
UIDB/04466/2020	Fundação para a Ciência e a Tecnologia

Projetos Relacionados

Esta publicação é um output do(s) seguinte(s) projeto(s):

Contribuições para os Objetivos do Desenvolvimento Sustentável das Nações Unidas

Com o objetivo de aumentar a investigação direcionada para o cumprimento dos Objetivos do Desenvolvimento Sustentável para 2030 das Nações Unidas, é disponibilizada no Ciência_Iscte a possibilidade de associação, quando aplicável, dos artigos científicos aos Objetivos do Desenvolvimento Sustentável. Estes são os Objetivos do Desenvolvimento Sustentável identificados pelo(s) autor(es) para esta publicação. Para uma informação detalhada dos Objetivos do Desenvolvimento Sustentável, clique aqui.

Identificadores da Publicação

WoS (fonte: Ciência_Iscte)	WOS:000770339500010
Scopus (fonte: autor)	2-s2.0-85126535966
DOI (fonte: autor)	10.1007/s10664-021-10110-5
Scopus (fonte: Ciência_Iscte)	2-s2.0-85126535966
Handle (fonte: Ciência-IUL)	http://hdl.handle.net/10071/25596
WoS (fonte: autor)	000770339500010
ID Ciência_Iscte	ci-pub-84764

Outros Detalhes da Publicação

Ano Publicação Online	2022
Editora	Springer
Indexação	Web of Science©; Scopus;
ISSN	1382-3256 (print) 1573-7616 (online)
ISBN	--
Factor de Impacto	7.300
Volume	27	Número	3
Série
Número Artigo	69
Páginas	--
Avaliado Cientificamente	Sim
Meio de Divulgação	Ambos (impresso e digital)
Repositório ISCTE-IUL	Link para o repositório
Data Publicação (online)	2022-03-17
Data Publicação (print)

Altmetric

Dimensions

PlumX Metrics

Citations

Citation Indexes: 11

Captures

Readers: 39

see details