Visualize your fitted non-linear dimension reduction model in the high-dimensional data space

Dianne Cook Co-Author
Monash University
 
Paul Harrison Co-Author
Monash University
 
Michael Lydeamore Co-Author
Dr
 
Thiyanga S. Talagala Co-Author
University of Sri Jayewardenepura, Sri Lanka
 
Piyadi Gamage Jayani Lakshika Presenting Author
 
Monday, Aug 4: 12:15 PM - 12:20 PM
1051 
Contributed Speed 
Music City Center 
Non-linear dimension reduction (NLDR) techniques such as tSNE, UMAP provide a low-dimensional representation of high-dimensional data by applying non-linear transformation. The methods and parameter choices can create wildly different representations, so much so that it is difficult to decide which is best, or whether any or all are accurate or misleading. NLDR often exaggerates random patterns, sometimes due to the samples observed, but NLDR views have an important role in data analysis because, if done well, they provide a concise visual (and conceptual) summary of high dimensional distributions. To help evaluate the NLDR we have developed a way to take the fitted model, as represented by the positions of points in 2D, and turn it into a high-dimensional wireframe to overlay on the data, viewing it with a tour. Viewing a model in the data space is an ideal way to examine the fit. It is used here to help with the difficult decision on which 2D layout is the best representation of the high-dimensional distribution, or whether the 2D layout is displaying mostly random structure, and how different methods have same summary or particular quirks. Available in the R package `quollr`.

Keywords

high-dimensional data vizualization

non-linear dimension reduction

tour 

Main Sponsor

Section on Statistical Graphics