Evaluating a clustering solution: an application in the tourism market

Margarida G. M. S. Cardoso; Isabel H. Themido; Fernando Moura Pires

Ciência_Iscte Publicações Descrição Detalhada da Publicação

Artigo em revista científica Q2

Evaluating a clustering solution: an application in the tourism market

Margarida G. M. S. Cardoso (Cardoso, M. G. M. S.); Isabel H. Themido (Themido, I. H.); Fernando Moura Pires (Moura Pires, F.);

Título Revista

Intelligent Data Analysis

Ano (publicação definitiva)

1999

Língua

Inglês

País

Estados Unidos da América

Mais Informação

Visitar Link

Web of Science®

Esta publicação não está indexada na Web of Science®

Scopus

N.º de citações: 10

(Última verificação: 2026-07-11 07:10)

Ver o registo na Scopus

Google Scholar

Esta publicação não está indexada no Google Scholar

Overton

Esta publicação não está indexada no Overton

Abstract/Resumo

This paper discusses the evaluation of a clustering solution. Criteria based on the number of clusters and discrimination and classification processes are used to evaluate a clustering solution. The proposed approach is based on two paradigms: Statistics and Machine Learning. A multimethodological approach is advocated in the construction of models associating between properties and clusters, to provide a wider and richer set of analysis perspectives and a better knowledge discovery. Specifically, the construction of classification and discrimination logical models as a complement of quantitative statistical models is particularly useful when most of the available information is of a qualitative nature (nominal or ordinal variables). Both, the classification's global precision and the comprehension added by the discriminant model to the association between variables and clusters, are essential to evaluate a clustering solution. Depending on the dimension of the sample, descriptive analysis performed can be validated through the partition in two of the total sample-(one sub-sample for model build-up and another (holdout) for validation)-or by other procedures of cross-validation. The proposed evaluation approach is applied to a Marketing Tourism case study. The clustering solution is built upon a sample of more than 2500 Portuguese clients of Pousadas de Portugal Hotels. The database includes variables related to the evaluation of stay (per client) at the Pousadas and profiles of the surveyed clients on holidays, demographic and psychographic aspects. Measures of association, Chi-square tests, ANOVA, Discriminant Analysis, Logistic Regression, and Rule Induction (based on CN2 and C4.5 algorithms) are applied in evaluating the clustering solution built through a K-Means process.

Agradecimentos/Acknowledgements

Palavras-chave

Clustering,Machine learning,Marketing and tourism,Multivariate statistics

Classificação Fields of Science and Technology

Matemáticas - Ciências Naturais
Ciências da Computação e da Informação - Ciências Naturais

Identificadores da Publicação

Scopus (fonte: autor)	2-s2.0-0011141514
DOI (fonte: autor)	10.3233/IDA-1999-3606
Scopus (fonte: Ciência_Iscte)	2-s2.0-0011141514
ID Ciência_Iscte	ci-pub-52314

Outros Detalhes da Publicação

Ano Publicação Online	1999
Editora	IOS Press
Indexação	Scopus;
ISSN	1088-467X (print) 1088-467X (online)
ISBN	--
Factor de Impacto	--
Volume	3	Número	6
Série
Número Artigo
Páginas	491 - 510
Avaliado Cientificamente	Sim
Data Publicação (online)
Data Publicação (print)

Altmetric

Dimensions

PlumX Metrics