Computational Approaches for Predicting DNA Methylation and Constructing Whole-Genome Structures Based on Hi-C Data
- Publication date
- 2026
- Publisher
Abstract
Recently, a biochemistry experiment named methyl-3C was developed to simultaneously capture the chromosomal conformations and DNA methylation levels on individual single cells. However, the number of data sets generated from this experiment is still small in the scientific community compared with the greater amount of single-cell Hi-C data generated from separate single cells. Therefore, a computational tool is needed to predict single-cell methylation levels based on single-cell Hi-C data on the same individual cells. We developed a graph transformer named scHiMe to accurately predict the base-pair-specific (bp-specific) methylation levels based on single-cell Hi-C data and DNA nucleotide sequences. We benchmarked scHiMe for predicting the bp-specific methylation levels on all of the promoters of the human genome, all of the promoter regions together with the corresponding first exon and intron regions, and random regions on the whole genome. Our evaluation showed a high consistency between the predicted and methyl-3C-detected methylation levels. Moreover, the predicted DNA methylation levels resulted in accurate classifications of cells into different cell types, which indicated that our algorithm successfully captured the cell-to-cell variability in the single-cell Hi-C data. scHiMe is freely available at http://dna.cs.miami.edu/scHiMe/. </p