Nonlinear Embedding and Integration of Omics Data: A Fast and Tuning-Free Approach

Tianwei Yu Co-Author
 
Shengjie Liu First Author
The Chinese University of Hong Kong, Shenzhen
 
Tianwei Yu Presenting Author
 
Wednesday, Aug 6: 2:35 PM - 2:50 PM
0981 
Contributed Papers 
Music City Center 
The rapid progress of single-cell technology is enabling biologists to unravel the intricacies of cell populations, disease states, and developmental lineages. The high-dimensional, noisy, and sparse nature of single-cell omics data poses significant analytical challenges. Here, we introduce DCOL (Dissimilarity based on Conditional Ordered List) correlation, a functional dependency measure for quantifying nonlinear relationships between variables. Based on this measure, we propose DCOL-PCA and DCOL-CCA, for dimension reduction and integration of single- and multi-omics data. In simulation studies, our methods outperformed eight other DR methods and four joint dimension reduction (jDR) methods, showcasing stable performance across various settings. It proved highly effective in extracting essential factors even in the most challenging scenarios. We also validated these methods on real datasets, with our method demonstrating its ability to detect intricate signals within and between omics data and generate lower-dimensional embeddings that preserve the essential information and latent structures in the data.

Keywords

Nonlinear Dimensionality Reduction

Single-cell Analysis

Multi-Omics Integration 

Main Sponsor

Section on Statistics in Genomics and Genetics