Evaluating Performance of Unsupervised Machine Learning Methods for Time Series Clustering

Abstract Number:

3223 

Submission Type:

Contributed Abstract 

Contributed Abstract Type:

Speed 

Participants:

Brittney Marian (1), Yue Zhang (2), Kenan Li (3), Erika Garcia (1), Sandrah Eckel (1)

Institutions:

(1) University of Southern California, N/A, (2) University of Utah, N/A, (3) Saint Louis University, N/A

Co-Author(s):

Yue Zhang  
University of Utah
Kenan Li  
Saint Louis University
Erika Garcia  
University of Southern California
Sandrah Eckel  
University of Southern California

First Author:

Brittney Marian  
University of Southern California

Presenting Author:

Brittney Marian  
N/A

Abstract Text:

Unsupervised clustering is widely used to discover patterns in data without pre-defined labels. Clustering methods for time series data have been less studied and still present challenges. In this study, we use simulated data to showcase the performance of clustering algorithms on time series data and provide new insights into methodological choices. We selected a range of clustering algorithms-Hierarchical, k-means, k-medoids, Gaussian mixture, self-organizing maps, and density-based clustering-and distance metrics included Euclidean, correlation-based distances, dynamic time warping (DTW), and variants like weighted DTW. Results were evaluated using the adjusted Rand index and validated with known cluster labels. Preliminary findings in simulated univariate time series data showed that data transformation (i.e., standardization) was the leading determinant of clustering performance. In benchmark multivariate time series data, clustering performance was weaker. Next steps include investigations using simulated multivariate data. Results inform a project to identify distinct diurnal patterns of multiple air pollutants.

Keywords:

time series data|clustering|unsupervised learning| | |

Sponsors:

Section on Statistical Learning and Data Science

Tracks:

Machine Learning

Can this be considered for alternate subtype?

Yes

Are you interested in volunteering to serve as a session chair?

No

I have read and understand that JSM participants must abide by the Participant Guidelines.

Yes

I understand that JSM participants must register and pay the appropriate registration fee by June 1, 2024. The registration fee is non-refundable.

I understand