Kernel Density Balancing for Hi-C data

Ning Hao Co-Author
University of Arizona
 
John Park First Author
 
John Park Presenting Author
 
Monday, Aug 4: 10:50 AM - 11:05 AM
1183 
Contributed Papers 
Music City Center 
High-throughput chromatin conformation capture (Hi-C) data provide insights into the 3D structure of chromosomes, with normalization being a crucial pre-processing step. A common technique for normalization is matrix balancing, which rescales rows and columns of a Hi-C matrix to equalize their sums. Despite its popularity and convenience, matrix balancing lacks statistical justification. In this talk, we introduce a statistical model to analyze matrix balancing methods and propose a kernel-based estimator that leverages spatial structure. Under mild assumptions, we demonstrate that the kernel-based method is consistent, converges faster, and is more robust to data sparsity when compared to existing approaches.

Keywords

Density Estimation

Matrix Balancing 

Main Sponsor

Section on Statistical Learning and Data Science