12. Consensus Dimension Reduction via Data Integration

Conference: Women in Statistics and Data Science 2025
11/13/2025: 2:30 PM - 4:00 PM EST
Speed 

Description

A plethora of dimension reduction methods have been developed to visualize high-dimensional data in low dimensions. However, different dimension reduction methods often output different visualizations, and there are many challenges that make it difficult for researchers to determine which visualization is best. We thus propose a novel consensus dimension reduction framework, which summarizes multiple visualizations into a single "consensus" visualization. Here, we leverage ideas from data integration in order to identify the patterns that are most stable or shared across the many different dimension reduction visualizations and subsequently visualize this shared structure in a single low-dimensional plot. We demonstrate that this consensus visualization effectively identifies and preserves the shared low-dimensional data structure through extensive simulations and real-world case studies. We further highlight our method's robustness to the choice of dimension reduction method and/or hyperparameters --- a highly-desirable property when working towards trustworthy and reproducible data science.

Keywords

Dimension Reduction

Data Integration

Data Visualization

High Dimensional Data 

Presenting Author

Bingxue An, University of Notre Dame

First Author

Bingxue An, University of Notre Dame

CoAuthor

Tiffany Tang

Target Audience

Beginner

Tracks

Knowledge
Women in Statistics and Data Science 2025