23 research outputs found
EpiGRAPH: user-friendly software for statistical analysis and prediction of (epi)genomic data
EpiGRAPH is a genome-scale data-mining software tool that enables users to identify epigenetic and gene regulatory features in large datasets of genomic regions
A genotypic method for determining HIV-2 coreceptor usage enables epidemiological studies and clinical decision support
Background: CCR5-coreceptor antagonists can be used for treating HIV-2 infected individuals. Before initiating treatment with coreceptor antagonists, viral coreceptor usage should be determined to ensure that the virus can use only the CCR5 coreceptor (R5) and cannot evade the drug by using the CXCR4 coreceptor (X4-capable). However, until now, no online tool for the genotypic identification of HIV-2 coreceptor usage had been available. Furthermore, there is a lack of knowledge on the determinants of HIV-2 coreceptor usage. Therefore, we developed a data-driven web service for the prediction of HIV-2 coreceptor usage from the V3 loop of the HIV-2 glycoprotein and used the tool to identify novel discriminatory features of X4-capable variants. Results: Using 10 runs of tenfold cross validation, we selected a linear support vector machine (SVM) as the model for geno2pheno[coreceptor-hiv2], because it outperformed the other SVMs with an area under the ROC curve (AUC) of 0.95. We found that SVMs were highly accurate in identifying HIV-2 coreceptor usage, attaining sensitivities of 73.5% and specificities of 96% during tenfold nested cross validation. The predictive performance of SVMs was not significantly different (p value 0.37) from an existing rules-based approach. Moreover, geno2pheno[coreceptor-hiv2] achieved a predictive accuracy of 100% and outperformed the existing approach on an independent data set containing nine new isolates with corresponding phenotypic measurements of coreceptor usage. geno2pheno[coreceptor-hiv2] could not only reproduce the established markers of CXCR4-usage, but also revealed novel markers: the substitutions 27K, 15G, and 8S were significantly predictive of CXCR4 usage. Furthermore, SVMs trained on the amino-acid sequences of the V1 and V2 loops were also quite accurate in predicting coreceptor usage (AUCs of 0.84 and 0.65, respectively). Conclusions: In this study, we developed geno2pheno[coreceptor-hiv2], the first online tool for the prediction of HIV-2 coreceptor usage from the V3 loop. Using our method, we identified novel amino-acid markers of X4-capable variants in the V3 loop and found that HIV-2 coreceptor usage is also influenced by the V1/V2 region. The tool can aid clinicians in deciding whether coreceptor antagonists such as maraviroc are a treatment option and enables epidemiological studies investigating HIV-2 coreceptor usage. geno2pheno[coreceptor-hiv2] is freely available at http://coreceptor-hiv2.geno2pheno.org
Predicting the Response to Combination Antiretroviral Therapy: Retrospective Validation of geno2pheno-THEO on a Large Clinical Database
BackgroundExpert-based genotypic interpretation systems are standard methods for guiding treatment selection for patients infected with human immunodeficiency virus type 1. We previously introduced the software pipeline geno2pheno-THEO (g2p-THEO), which on the basis of viral sequence predicts the response to treatment with a combination of antiretroviral compounds by applying methods from statistical learning and the estimated potential of the virus to escape from drug pressure MethodsWe retrospectively validated the statistical model used by g2p-THEO in ∼7600 independent treatment-sequence pairs extracted from the EuResist integrated database, ranging from 1990 to 2007. Results were compared with the 3 most widely used expert-based interpretation systems: Stanford HIVdb, ANRS, and Rega ResultsThe difference in receiver operating characteristic curves between g2p-THEO and expert-based approaches was significant (P<.001; paired Wilcoxon test). Indeed, at 80% specificity, g2p-THEO found 16.2%-19.8% more successful regimens than did the expert-based approaches. The increased performance of g2p-THEO was confirmed in a 2001-2007 data set from which most obsolete therapies had been removed ConclusionFinding drug combinations that increase the chances of therapeutic success is the main reason for using decision support systems. The present analysis of a large data set derived from clinical practice demonstrates that g2p-THEO solves this task significantly better than state-of-the-art expert-based systems. The tool is available at http://www.geno2pheno.or
Comparison of Classifier Fusion Methods for Predicting Response to Anti HIV-1 Therapy
BACKGROUND: Analysis of the viral genome for drug resistance mutations is state-of-the-art for guiding treatment selection for human immunodeficiency virus type 1 (HIV-1)-infected patients. These mutations alter the structure of viral target proteins and reduce or in the worst case completely inhibit the effect of antiretroviral compounds while maintaining the ability for effective replication. Modern anti-HIV-1 regimens comprise multiple drugs in order to prevent or at least delay the development of resistance mutations. However, commonly used HIV-1 genotype interpretation systems provide only classifications for single drugs. The EuResist initiative has collected data from about 18,500 patients to train three classifiers for predicting response to combination antiretroviral therapy, given the viral genotype and further information. In this work we compare different classifier fusion methods for combining the individual classifiers. PRINCIPAL FINDINGS: The individual classifiers yielded similar performance, and all the combination approaches considered performed equally well. The gain in performance due to combining methods did not reach statistical significance compared to the single best individual classifier on the complete training set. However, on smaller training set sizes (200 to 1,600 instances compared to 2,700) the combination significantly outperformed the individual classifiers (p<0.01; paired one-sided Wilcoxon test). Together with a consistent reduction of the standard deviation compared to the individual prediction engines this shows a more robust behavior of the combined system. Moreover, using the combined system we were able to identify a class of therapy courses that led to a consistent underestimation (about 0.05 AUC) of the system performance. Discovery of these therapy courses is a further hint for the robustness of the combined system. CONCLUSION: The combined EuResist prediction engine is freely available at http://engine.euresist.org
DASMI: exchanging, annotating and assessing molecular interaction data
Motivation: Ever increasing amounts of biological interaction data are being accumulated worldwide, but they are currently not readily accessible to the biologist at a single site. New techniques are required for retrieving, sharing and presenting data spread over the Internet
Arevir: A Secure Platform for Designing Personalized Antiretroviral Therapies Against HIV
Despite the availability of antiretroviral combination therapies, success
in drug treatment of HIV-infected patients is limited. One reason for therapy
failure is the development of drug-resistant genetic variants. In principle,
the viral genomic sequence provides resistance information and could thus
guide the selection of an optimal drug combination. In practice however, the
benefit of this procedure is impaired by (1) the difficulty in inferring the clinically
relevant information from the genotype of the virus and (2) the restricted
availability of this information. We have developed a secure platform for
collaborative research aimed at optimizing anti-HIV therapies, called Arevir. A
relational database schema was designed and implemented together with a webbased
user interface. Our system provides a basis for monitoring patients, decision-
support, and computational analyses. Thus, it merges clinical, diagnostic
and bioinformatics efforts to exploit genomic and patient therapy data in clinical
practice