Evaluating Performance of Unsupervised Machine Learning Methods for Time Series Clustering
Abstract Number:
3223
Submission Type:
Contributed Abstract
Contributed Abstract Type:
Speed
Participants:
Brittney Marian (1), Yue Zhang (2), Kenan Li (3), Erika Garcia (1), Sandrah Eckel (1)
Institutions:
(1) University of Southern California, N/A, (2) University of Utah, N/A, (3) Saint Louis University, N/A
Co-Author(s):
First Author:
Presenting Author:
Abstract Text:
Unsupervised clustering is widely used to discover patterns in data without pre-defined labels. Clustering methods for time series data have been less studied and still present challenges. In this study, we use simulated data to showcase the performance of clustering algorithms on time series data and provide new insights into methodological choices. We selected a range of clustering algorithms-Hierarchical, k-means, k-medoids, Gaussian mixture, self-organizing maps, and density-based clustering-and distance metrics included Euclidean, correlation-based distances, dynamic time warping (DTW), and variants like weighted DTW. Results were evaluated using the adjusted Rand index and validated with known cluster labels. Preliminary findings in simulated univariate time series data showed that data transformation (i.e., standardization) was the leading determinant of clustering performance. In benchmark multivariate time series data, clustering performance was weaker. Next steps include investigations using simulated multivariate data. Results inform a project to identify distinct diurnal patterns of multiple air pollutants.
Keywords:
time series data|clustering|unsupervised learning| | |
Sponsors:
Section on Statistical Learning and Data Science
Tracks:
Machine Learning
Can this be considered for alternate subtype?
Yes
Are you interested in volunteering to serve as a session chair?
No
I have read and understand that JSM participants must abide by the Participant Guidelines.
Yes
I understand that JSM participants must register and pay the appropriate registration fee by June 1, 2024. The registration fee is non-refundable.
I understand
You have unsaved changes.