12. Consensus Dimension Reduction via Data Integration
Conference: Women in Statistics and Data Science 2025
11/13/2025: 2:30 PM - 4:00 PM EST
Speed
A plethora of dimension reduction methods have been developed to visualize high-dimensional data in low dimensions. However, different dimension reduction methods often output different visualizations, and there are many challenges that make it difficult for researchers to determine which visualization is best. We thus propose a novel consensus dimension reduction framework, which summarizes multiple visualizations into a single "consensus" visualization. Here, we leverage ideas from data integration in order to identify the patterns that are most stable or shared across the many different dimension reduction visualizations and subsequently visualize this shared structure in a single low-dimensional plot. We demonstrate that this consensus visualization effectively identifies and preserves the shared low-dimensional data structure through extensive simulations and real-world case studies. We further highlight our method's robustness to the choice of dimension reduction method and/or hyperparameters --- a highly-desirable property when working towards trustworthy and reproducible data science.
Dimension Reduction
Data Integration
Data Visualization
High Dimensional Data
Presenting Author
Bingxue An, University of Notre Dame
First Author
Bingxue An, University of Notre Dame
CoAuthor
Tiffany Tang
Target Audience
Beginner
Tracks
Knowledge
Women in Statistics and Data Science 2025
You have unsaved changes.