Random Forests and Clustering for Identifying Clinical Phenotypes

Abstract Number:

3520 

Submission Type:

Contributed Abstract 

Contributed Abstract Type:

Speed 

Participants:

Barbara Bailey (1)

Institutions:

(1) San Diego State University, N/A

First Author:

Barbara Bailey  
San Diego State University

Presenting Author:

Barbara Bailey  
San Diego State University

Abstract Text:

Random Forests can be used for classification and clustering. In the supervised Random Forest used for classification, each subject will have a known grouping. In the unsupervised Random Forest used for clustering, the proximity matrix needed for clustering can be estimated. Clustering algorithms use data to form groups of similar subjects that share distinct properties. Phenotypes can be identified using a proximity matrix generated by the unsupervised Random Forests and subsequent clustering by the Partitioning around Medoids (PAM) algorithm.
PAM uses the dissimilarity matrix in its class partitioning or clustering algorithm and is more robust to noise and outliers as compared to the more commonly used k-means algorithm.

We present results that identify distinct phenotypes or groups of subjects that are Hispanic/Latino with chronic low back pain. Data consisted of sensor-based measures of posture and movement, pain behavior, and psychological measures. Groupings may provide a basis for a more personalized plan of care, including pain management strategies that encourage movement and rest periods.

Keywords:

random forests|chronic lower back pain| | | |

Sponsors:

WNAR

Tracks:

Miscellaneous

Can this be considered for alternate subtype?

Yes

Are you interested in volunteering to serve as a session chair?

Yes

I have read and understand that JSM participants must abide by the Participant Guidelines.

Yes

I understand that JSM participants must register and pay the appropriate registration fee by June 1, 2024. The registration fee is non-refundable.

I understand