Predicting regulatory variants with composite statistic

Abstract

Motivation: Prediction and prioritization of human noncoding regulatory variants is critical for understanding the regulatory mechanisms of disease pathogenesis and promoting personalized medicine. Existing tools utilize functional genomics data and evolutionary information to evaluate the pathogenicity or regulatory functions of noncoding variants. However, different algorithms lead to inconsistent and even conflicting predictions. Combining multiple methods may increase accuracy in regulatory variant prediction. Results: Here, we compiled an integrative resource for predictions from eight different tools on functional annotation of noncoding variants. We further developed a composite strategy to integrate multiple predictions and computed the composite likelihood of a given variant being regulatory variant. Benchmarked by multiple independent causal variants datasets, we demonstrated that our composite model significantly improves the prediction performance. Availability: We implemented our model and scoring procedure as a tool, named PRVCS, which is freely available to academic and nonprofit usage at http://jjwanglab.org/PRVCS.Statistic

    Similar works