Scientific journal paper Q1
The performance of distances between time series: An in-depth comparison
Margarida G. M. S. Cardoso (Cardoso, M. G. M. S.); Ana Alexandra A. F. Martins (Martins, A. A.);
Journal Title
Expert Systems
Year (definitive publication)
2025
Language
English
Country
United States of America
More Information
Web of Science®

Times Cited: 0

(Last checked: 2025-12-12 10:45)

View record in Web of Science®

Scopus

Times Cited: 0

(Last checked: 2025-12-01 19:02)

View record in Scopus

Google Scholar

This publication is not indexed in Google Scholar

This publication is not indexed in Overton

Abstract
The performance of distance measures between time series has been discussed in diverse studies. Most identified performance as the accuracy resulting from the use of a specific distance in 1-Nearest Neighbour. Few studies have addressed the related computation time, and no systematic analyses of the associations between the distances' performance (1-NN-based accuracy and computation time) and the time series' characteristics have been presented yet. We propose to fill this research gap by analysing these relationships considering the following features: the training and test sets' dimensions, the time series' length, the number of classes, and the classes' separability as measured by the Average Silhouette index. This last characteristic was not mentioned in previous studies. A methodological approach is devised to compare nine distance measures, including three recently proposed combined distances (COMB and two variants). We resort to a stepwise method for multiple comparisons and deal with the experiment-wise error rate to obtain homogeneous groups of distances with indistinct performances. The CART algorithm is used to explore the relationships between accuracy values corresponding to each distance measure under study (target) and the time series characteristics (predictors). Our analyses are based on datasets from the UCR time series classification archive. We concluded that the combined distance (COMB), dynamic time warping distance (DTW), and complexity invariance distance (CID) are consistently included in the subset of best-performing distances in all experimental scenarios. The latter (CID) has a significantly lower computational cost. We determined that the classes' separability is the time series' attribute most associated with the distances' performance.
Acknowledgements
This research was supported by Fundacao para a Ciencia e a Tecnologia, grant UIDB/00315/2020 (DOI: 10.54499/UIDB/00315/2020). It was also supported by Instituto Politécnico Lisboa (IPL) with reference IPL/IDI&CA2023/ELForcast2_ISEL and FCT Portugal, throu
Keywords
1-nearest neighbour classifier,Classification,Combined distance,Distances between time series,Multiple comparison procedures,Time series,UCR archive
  • Mathematics - Natural Sciences
  • Computer and Information Sciences - Natural Sciences
  • Civil Engineering - Engineering and Technology
Funding Records
Funding Reference Funding Entity
UIDB/00315/2020 Fundação para a Ciência e a Tecnologia
UID/MAT/04674/2013 Fundação para a Ciência e a Tecnologia
IPL/IDI&CA2023/ELForcast2_ISEL Instituto Politécnico Lisboa