Yuping Zhang, Disheng Mao, Zhengqing Ouyang
Recent development of high-throughput biotechnologies, such as Hi-C, have enabled genome-wide measurement of chromosomal conformation. The interaction signals among genomic loci are contaminated with noises. It remains largely unknown how well the underlying chromosomal conformation can be elucidated, based on massive and noisy measurements. We propose a new model-based distance embedding (MDE) framework, to reveal spatial organizations of chromosomes. The proposed framework is a general methodology, which allows us to link accurate probabilistic models, which characterize biological data properties, to efficiently recovering Euclidean distance matrices from noisy observations...
September 2022: Annals of Applied Statistics