Comunicação em evento científico
The performance of a combined distance between time series
Margarida G. M. S. Cardoso (Cardoso, M. G. M. S.); Ana Alexandra A. F. Martins (Martins, A. A. A. F.);
Título Evento
XXV Congress of the Portuguese Statistical Society
Ano (publicação definitiva)
2021
Língua
Inglês
País
Portugal
Mais Informação
Web of Science®

Esta publicação não está indexada na Web of Science®

Scopus

Esta publicação não está indexada na Scopus

Google Scholar

Esta publicação não está indexada no Google Scholar

Abstract/Resumo
The use of dissimilarity measures between time series is critical in several data analysis tasks which range from simple querying to classification, clustering and anomaly detection. Recently, we proposed a new dissimilarity measure, a convex combination of four (normalized) distance measures which offer complementary perspectives on the differences between two time series: the Euclidean distance which captures differences in scale; a Pearson correlation based measure that takes into account linear increasing and decreasing trends over time; a Periodogram based measure that expresses the dissimilarities between frequencies or cyclical components of the series; and a distance between estimated autocorrelation structures, comparing the series in terms of their dependence on past observations. We conduct an experimental analysis, to evaluate the comparative performance of this combined distance measure, resorting to the UCR Time-Series Archive that includes time series data sets from a wide variety of application domains. We follow a methodology suggested in previous studies [?] that were conducted to compare several dissimilarity measures and their variants: we use one nearest neighbor (1NN) classifier on labelled data to evaluate the efficacy of the distance measures. In fact, since the distance measure used is critical to 1NN accuracy, this indicator directly reflects the effectiveness of the dissimilarity measure used. We conclude that the proposed combined measure is competitive in several settings. Finally, we suggest further research taking into account normalization methods.
Agradecimentos/Acknowledgements
--
Palavras-chave
clustering,distance measures,time series