Search CORE

NORA - Norwegian Open Research Archives

Proteochemometric modeling of HIV protease susceptibility

Author: Eklund Martin
Lapins Maris
Prusis Peteris
Spjuth Ola
Wikberg Jarl ES
Publication venue: BioMed Central
Publication date: 01/01/2008
Field of study

Abstract Background A major obstacle in treatment of HIV is the ability of the virus to mutate rapidly into drug-resistant variants. A method for predicting the susceptibility of mutated HIV strains to antiviral agents would provide substantial clinical benefit as well as facilitate the development of new candidate drugs. Therefore, we used proteochemometrics to model the susceptibility of HIV to protease inhibitors in current use, utilizing descriptions of the physico-chemical properties of mutated HIV proteases and 3D structural property descriptions for the protease inhibitors. The descriptions were correlated to the susceptibility data of 828 unique HIV protease variants for seven protease inhibitors in current use; the data set comprised 4792 protease-inhibitor combinations. Results The model provided excellent predictability (<it>R</it>2 = 0.92, <it>Q</it>2 = 0.87) and identified general and specific features of drug resistance. The model's predictive ability was verified by external prediction in which the susceptibilities to each one of the seven inhibitors were omitted from the data set, one inhibitor at a time, and the data for the six remaining compounds were used to create new models. This analysis showed that the over all predictive ability for the omitted inhibitors was <it>Q</it>2 <it>inhibitors </it>= 0.72. Conclusion Our results show that a proteochemometric approach can provide generalized susceptibility predictions for new inhibitors. Our proteochemometric model can directly analyze inhibitor-protease interactions and facilitate treatment selection based on viral genotype. The model is available for public use, and is located at HIV Drug Research Centre.</p

Springer - Publisher Connector

Public Library of Science (PLOS)

A Look Inside HIV Resistance through Retroviral Protease Interaction Maps

Author: Aleksejs Kontijevskis
Felikss Mutulis
Ilze Mutule
Jan Komorowski
Jarl E. S Wikberg
John H Elder
Peteris Prusis
Ramona Petrovska
Sviatlana Yahorava
Publication venue: Public Library of Science
Publication date: 01/03/2007
Field of study

Retroviruses affect a large number of species, from fish and birds to mammals and humans, with global socioeconomic negative impacts. Here the authors report and experimentally validate a novel approach for the analysis of the molecular networks that are involved in the recognition of substrates by retroviral proteases. Using multivariate analysis of the sequence-based physiochemical descriptions of 61 retroviral proteases comprising wild-type proteases, natural mutants, and drug-resistant forms of proteases from nine different viral species in relation to their ability to cleave 299 substrates, the authors mapped the physicochemical properties and cross-dependencies of the amino acids of the proteases and their substrates, which revealed a complex molecular interaction network of substrate recognition and cleavage. The approach allowed a detailed analysis of the molecular–chemical mechanisms involved in substrate cleavage by retroviral proteases

Polypharmacology Modelling Using Proteochemometrics (PCM): Recent Methodological Developments, Applications to Target Families, and Future Prospects

Author: Ain Qurrat Ul
Bender Andreas
Cortes-Ciriano Isidro
IJzerman Adriaan P.
Lenselink Eelke B.
Malliavin Thérèse E.
Mendez-Lucio Oscar
Prusis Peteris
Subramanian Vigneshwari
van Westen Gerard J. P.
Wohlfahrt Gerd
Publication venue
Publication date: 01/01/2014
Field of study

Peer reviewe

Leiden University Scholary Publications

Helsingin yliopiston digitaalinen arkisto

Apollo (Cambridge)

Predictive proteochemometric models for kinases derived from 3D protein field-based descriptors

Author: Prusis Peteris
Subramanian Vigneshwari
Wohlfahrt Gerd
Xhaard Henri
Publication venue
Publication date: 01/01/2016
Field of study

Proteochemometrics, a method that simultaneously uses protein and ligand description, was used to model the target-ligand interaction space of 95 kinases and 1572 inhibitors. To build models, we applied 3-dimensional field-based description of the receptors, which allows the visualization of receptor and ligand features relevant for activity within the spatial framework of the binding sites. Receptor fields were derived from knowledge-based potentials and Schrodinger's WaterMaps, while ligands were described by different 1D, 2D and 3D descriptors. Besides good interpretability, which is important for inhibitor design, the obtained proteochemometric models also predicted external test sets with active and inactive ligands or additional protein targets for ligands with more than 80% accuracy and AUCs above 0.8.Peer reviewe

Helsingin yliopiston digitaalinen arkisto

BMC Bioinformatics Research article Prediction of indirect interactions in proteins

Author: Maris Lapinsh
Peteris Prusis
Ramona Petrovska
Staffan Uhlén
Publication venue
Publication date
Field of study

Background: Both direct and indirect interactions determine molecular recognition of ligands by proteins. Indirect interactions can be defined as effects on recognition controlled from distant sites in the proteins, e.g. by changes in protein conformation and mobility, whereas direct interactions occur in close proximity of the protein's amino acids and the ligand. Molecular recognition is traditionally studied using three-dimensional methods, but with such techniques it is difficult to predict the effects caused by mutational changes of amino acids located far away from the ligandbinding site. We recently developed an approach, proteochemometrics, to the study of molecular recognition that models the chemical effects involved in the recognition of ligands by proteins using statistical sampling and mathematical modelling. Results: A proteochemometric model was built, based on a statistically designed protein library's (melanocortin receptors') interaction with three peptides and used to predict which amino acids and sequence fragments that are involved in direct and indirect ligand interactions. The model predictions were confirmed by directed mutagenesis. The predicted presumed direct interactions were in good agreement with previous three-dimensional studies of ligand recognition. However

CiteSeerX

Unbiased descriptor and parameter selection confirms the potential of proteochemometric modelling

Author: Freyhult Eva
Gustafsson Mats G.
Lapinsh Maris
Moulton Vincent
Prusis Peteris
Wikberg Jarl E. S.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2005
Field of study

Background: Proteochemometrics is a new methodology that allows prediction of protein function directly from real interaction measurement data without the need of 3D structure information. Several reported proteochemometric models of ligand-receptor interactions have already yielded significant insights into various forms of bio-molecular interactions. The proteochemometric models are multivariate regression models that predict binding affinity for a particular combination of features of the ligand and protein. Although proteochemometric models have already offered interesting results in various studies, no detailed statistical evaluation of their average predictive power has been performed. In particular, variable subset selection performed to date has always relied on using all available examples, a situation also encountered in microarray gene expression data analysis. Results: A methodology for an unbiased evaluation of the predictive power of proteochemometric models was implemented and results from applying it to two of the largest proteochemometric data sets yet reported are presented. A double cross-validation loop procedure is used to estimate the expected performance of a given design method. The unbiased performance estimates (P2) obtained for the data sets that we consider confirm that properly designed single proteochemometric models have useful predictive power, but that a standard design based on cross validation may yield models with quite limited performance. The results also show that different commercial software packages employed for the design of proteochemometric models may yield very different and therefore misleading performance estimates. In addition, the differences in the models obtained in the double CV loop indicate that detailed chemical interpretation of a single proteochemometric model is uncertain when data sets are small. Conclusion: The double CV loop employed offer unbiased performance estimates about a given proteochemometric modelling procedure, making it possible to identify cases where the proteochemometric design does not result in useful predictive models. Chemical interpretations of single proteochemometric models are uncertain and should instead be based on all the models selected in the double CV loop employed here

Springer - Publisher Connector