Search CORE

249 research outputs found

The utility of geometrical and chemical restraint information extracted from predicted ligand-binding sites in protein structure refinement

Author: Brylinski Michal
Lee Seung Yup
Skolnick Jeffrey
Zhou Hongyi
Publication venue: LSU Digital Commons
Publication date: 01/03/2011
Field of study

Exhaustive exploration of molecular interactions at the level of complete proteomes requires efficient and reliable computational approaches to protein function inference. Ligand docking and ranking techniques show considerable promise in their ability to quantify the interactions between proteins and small molecules. Despite the advances in the development of docking approaches and scoring functions, the genome-wide application of many ligand docking/screening algorithms is limited by the quality of the binding sites in theoretical receptor models constructed by protein structure prediction. In this study, we describe a new template-based method for the local refinement of ligand-binding regions in protein models using remotely related templates identified by threading. We designed a Support Vector Regression (SVR) model that selects correct binding site geometries in a large ensemble of multiple receptor conformations. The SVR model employs several scoring functions that impose geometrical restraints on the Cα positions, account for the specific chemical environment within a binding site and optimize the interactions with putative ligands. The SVR score is well correlated with the RMSD from the native structure; in 47% (70%) of the cases, the Pearson\u27s correlation coefficient is \u3e0.5 (\u3e0.3). When applied to weakly homologous models, the average heavy atom, local RMSD from the native structure of the top-ranked (best of top five) binding site geometries is 3.1. Å (2.9. Å) for roughly half of the targets; this represents a 0.1 (0.3). Å average improvement over the original predicted structure. Focusing on the subset of strongly conserved residues, the average heavy atom RMSD is 2.6. Å (2.3. Å). Furthermore, we estimate the upper bound of template-based binding site refinement using only weakly related proteins to be ∼2.6. Å RMSD. This value also corresponds to the plasticity of the ligand-binding regions in distant homologues. The Binding Site Refinement (BSR) approach is available to the scientific community as a web server that can be accessed at http://cssb.biology.gatech.edu/bsr/. © 2010 Elsevier Inc

PubMed Central

Louisiana State University

One Decade of Development and Evolution of MicroRNA Target Prediction Algorithms

Author: Alexiou
Altuvia
Baek
Bandyopadhyay
Barreau
Bartel
Bartel
Betel
Betel
Cai
Chandra
Chi
Didiano
Elisa Ficarra
Enright
Friedman
Friedman
Gaidatzis
Garcia
Griffiths-Jones
Grimson
Guo
Hafner
Hsu
Jacobsen
Ji
Jin
John
Kertesz
Khan
Kim
Kiriakidou
Kruger
Kumar
Lagos-Quintana
Lall
Lee
Lee
Lewis
Lim
Lund
Maragkakis
Mendes
Min
Miranda
Muckstein
Pandey
Papadopoulos
Paula H. Reyes∼Herrera
Reyes∼Herrera
Saetrom
Sandberg
Schmidt
Selbach
Stefani
Sturm
Tan
Thomas
Thomson
Vergoulis
Wang
Watanabe
Wen
Witkos
Xiao
Yan
Yang
Yang
Yousef
Zhao
Publication venue: Elsevier BV:PO Box 211, 1000 AE Amsterdam Netherlands:011 31 20 4853757, 011 31 20 4853642, 011 31 20 4853641, EMAIL: [email protected], INTERNET: http://www.elsevier.nl, Fax: 011 31 20 4853598
Publication date: 01/01/2012
Field of study

Nearly two decades have passed since the publication of the first study reporting the discovery of microRNAs (miRNAs). The key role of miRNAs in post-transcriptional gene regulation led to the performance of an increasing number of studies focusing on origins, mechanisms of action and functionality of miRNAs. In order to associate each miRNA to a specific functionality it is essential to unveil the rules that govern miRNA action. Despite the fact that there has been significant improvement exposing structural characteristics of the miRNA-mRNA interaction, the entire physical mechanism is not yet fully understood. In this respect, the development of computational algorithms for miRNA target prediction becomes increasingly important. This manuscript summarizes the research done on miRNA target prediction. It describes the experimental data currently available and used in the field and presents three lines of computational approaches for target prediction. Finally, the authors put forward a number of considerations regarding current challenges and future direction

Elsevier - Publisher Connector

Crossref

PubMed Central

PORTO@iris (Publications Open Repository TOrino - Politecnico di Torino)

Archivio istituzionale della ricerca - Università di Modena e Reggio Emilia

PORTO Publications Open Repository TOrino

Prediction of Promiscuous P-Glycoprotein Inhibition Using a Novel Machine Learning Scheme

Author: A Golbraikh
A Palmeira
A Ravna
A Ward
AH Schinkel
Arto Urtti
AT Fojo
B Greiner
C Chang
C Chang
C Globisch
C Klein
C Martin
C-N Chen
DA Evans
DAP Gutmann
DB Solit
E Benfenati
E Crowley
F Thiebaut
G Chang
G Jansen
G Szakacs
GD Leonard
GF Ecker
H Li
H Zhou
HM Coley
HM Zalloum
Hong-Bin Chen
I Kolossvary
I Pajeva
IK Pajeva
J Zou
JE Penzotti
JH Lin
K Pleban
L Chen
MA Biamonte
Max K. Leong
MK Leong
MK Leong
MK Leong
MK Leong
MM Gottesman
MVS Varma
N Foloppe
P Bahadduri
P Chiba
P Gramatica
P Labrie
P Labrie
P Mistry
PP Roy
PW Sprague
R Hiessböck
RB Kim
S Ambudkar
S Doppenschmitt
S Ekins
S Ekins
S Ekins
S Ekins
S Shukla
SG Aller
SV Ambudkar
T Bansal
T Langer
T Stockner
T Taldone
TA Halgren
TW Loo
W Löscher
W-X Li
WC Still
Y Kurogi
Y-H Wang
YS Kim
Yu-Hsuan Shih
Publication venue: Public Library of Science
Publication date: 16/03/2012
Field of study

BACKGROUND: P-glycoprotein (P-gp) is an ATP-dependent membrane transporter that plays a pivotal role in eliminating xenobiotics by active extrusion of xenobiotics from the cell. Multidrug resistance (MDR) is highly associated with the over-expression of P-gp by cells, resulting in increased efflux of chemotherapeutical agents and reduction of intracellular drug accumulation. It is of clinical importance to develop a P-gp inhibition predictive model in the process of drug discovery and development. METHODOLOGY/PRINCIPAL FINDINGS: An in silico model was derived to predict the inhibition of P-gp using the newly invented pharmacophore ensemble/support vector machine (PhE/SVM) scheme based on the data compiled from the literature. The predictions by the PhE/SVM model were found to be in good agreement with the observed values for those structurally diverse molecules in the training set (n = 31, r(2) = 0.89, q(2) = 0.86, RMSE = 0.40, s = 0.28), the test set (n = 88, r(2) = 0.87, RMSE = 0.39, s = 0.25) and the outlier set (n = 11, r(2) = 0.96, RMSE = 0.10, s = 0.05). The generated PhE/SVM model also showed high accuracy when subjected to those validation criteria generally adopted to gauge the predictivity of a theoretical model. CONCLUSIONS/SIGNIFICANCE: This accurate, fast and robust PhE/SVM model that can take into account the promiscuous nature of P-gp can be applied to predict the P-gp inhibition of structurally diverse compounds that otherwise cannot be done by any other methods in a high-throughput fashion to facilitate drug discovery and development by designing drug candidates with better metabolism profile

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

FigShare

MHC2SKpan: a novel kernel based approach for pan-specific MHC class II peptide binding prediction

Author
Publication venue: BioMed Central
Publication date: 16/10/2013
Field of study

Springer - Publisher Connector

A pipeline for improved QSAR analysis of peptides: physiochemical property parameter selection via BMSF, near-neighbor sample selection via semivariogram, and weighted SVR regression and prediction

Author: Bai Lianyang
Chen Yuan
Dai Zhijun
Wang Haiyan
Wang Lifeng
Yuan Zheming
Publication venue
Publication date: 01/01/2014
Field of study

In this paper, we present a pipeline to perform improved QSAR analysis of peptides. The modeling involves a double selection procedure that first performs feature selection and then conducts sample selection before the final regression analysis. Five hundred and thirty-one physicochemical property parameters of amino acids were used as descriptors to characterize the structure of peptides. These high-dimensional descriptors then go through a feature selection process given by the Binary Matrix Shuffling Filter (BMSF) to obtain a set of important low dimensional features. Each descriptor that passed the BMSF filtering also receives a weight defined through its contribution to reduce the estimation error. These selected features were served as the predictors for subsequent sample selection and modeling. Based on the weighted Euclidean distances between samples, a common range was determined with high-dimensional semivariogram and then used as a threshold to select the near-neighbor samples from the training set. For each sample to be predicted, the QSAR model was established using SVR with the weighted, selected features based on the exclusive set of near-neighbor training samples. Prediction was conducted for each test sample accordingly. The performances of this pipeline are tested with the QSAR analysis of angiotensin-converting enzyme (ACE) inhibitors and HLA-A*0201 data sets. Improved prediction accuracy was obtained in both applications. This pipeline can optimize the QSAR modeling from both the feature selection and sample selection perspectives. This leads to improved accuracy over single selection methods. We expect this pipeline to have extensive application prospect in the field of regression prediction

K-State Research Exchange

Machine Learning Small Molecule Properties in Drug Discovery

Author: Arroniz Carlos
De Fabritiis Gianni
Majewski Maciej
Schapin Nikolai
Varela Alejandro
Publication venue
Publication date: 02/08/2023
Field of study

Machine learning (ML) is a promising approach for predicting small molecule properties in drug discovery. Here, we provide a comprehensive overview of various ML methods introduced for this purpose in recent years. We review a wide range of properties, including binding affinities, solubility, and ADMET (Absorption, Distribution, Metabolism, Excretion, and Toxicity). We discuss existing popular datasets and molecular descriptors and embeddings, such as chemical fingerprints and graph-based neural networks. We highlight also challenges of predicting and optimizing multiple properties during hit-to-lead and lead optimization stages of drug discovery and explore briefly possible multi-objective optimization techniques that can be used to balance diverse properties while optimizing lead candidates. Finally, techniques to provide an understanding of model predictions, especially for critical decision-making in drug discovery are assessed. Overall, this review provides insights into the landscape of ML models for small molecule property predictions in drug discovery. So far, there are multiple diverse approaches, but their performances are often comparable. Neural networks, while more flexible, do not always outperform simpler models. This shows that the availability of high-quality training data remains crucial for training accurate models and there is a need for standardized benchmarks, additional performance metrics, and best practices to enable richer comparisons between the different techniques and models that can shed a better light on the differences between the many techniques.Comment: 46 pages, 1 figur

arXiv.org e-Print Archive