Search CORE

7,292 research outputs found

Nonparametric inference of doubly stochastic Poisson process data via the kernel method

Author: Kou S. C.
Zhang Tingting
Publication venue: 'Institute of Mathematical Statistics'
Publication date: 06/01/2011
Field of study

Doubly stochastic Poisson processes, also known as the Cox processes, frequently occur in various scientific fields. In this article, motivated primarily by analyzing Cox process data in biophysics, we propose a nonparametric kernel-based inference method. We conduct a detailed study, including an asymptotic analysis, of the proposed method, and provide guidelines for its practical use, introducing a fast and stable regression method for bandwidth selection. We apply our method to real photon arrival data from recent single-molecule biophysical experiments, investigating proteins' conformational dynamics. Our result shows that conformational fluctuation is widely present in protein systems, and that the fluctuation covers a broad range of time scales, highlighting the dynamic and complex nature of proteins' structure.Comment: Published in at http://dx.doi.org/10.1214/10-AOAS352 the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org

arXiv.org e-Print Archive

Crossref

Confab - Systematic generation of diverse low-energy conformers

Author: A Smellie
Anita R Maguire
CH Schwab
Christopher J Flynn
D Lagorce
DD Beusen
DL Theobald
Geoffrey R Hutchison
J Sadowski
KS Watts
MA Miteva
MJ Vainio
N Huang
N Sauton
NM O'Boyle
Noel M O'Boyle
O Sperandio
P Alfke
PCD Hawkins
S Makino
S Renner
SJ Weiner
SJ Weiner
TA Halgren
Tim Vandermeersch
W Kabsch
YV Borodina
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background Many computational chemistry analyses require the generation of conformers, either on-the-fly, or in advance. We present Confab, an open source command-line application for the systematic generation of low-energy conformers according to a diversity criterion. Results Confab generates conformations using the 'torsion driving approach' which involves iterating systematically through a set of allowed torsion angles for each rotatable bond. Energy is assessed using the MMFF94 forcefield. Diversity is measured using the heavy-atom root-mean-square deviation (RMSD) relative to conformers already stored. We investigated the recovery of crystal structures for a dataset of 1000 ligands from the Protein Data Bank with fewer than 1 million conformations. Confab can recover 97% of the molecules to within 1.5 Å at a diversity level of 1.5 Å and an energy cutoff of 50 kcal/mol. Conclusions Confab is available from <url>http://confab.googlecode.com</url>.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Cork Open Research Archive

Representation of target-bound drugs by computed conformers: implications for conformational libraries

Author: A Goede
A Kramer
A Smellie
A Smellie
A Smellie
Andrean Goede
AY Yang
BR Brooks
C Gille
C Oostenbrink
C Svensson
CA Lipinski
Christian Senger
D Bailey
DEJ Koshland
E Michalsky
E Perola
Elke Michalsky
GR Stockwell
HJ Woo
HM Berman
HM Vinkers
IL Alberts
J Bostrom
K Bernacki
L Parsons
M Feher
M Fullbeck
M Gerstein
M Recanatini
M Vieth
MA Penny
MC Nicklaus
MJ Betts
PC Weber
R Di Santo
Robert Preissner
S Gunther
S Lorenzen
Stefan Günther
T Klabunde
Publication venue: BioMed Central
Publication date: 01/01/2006
Field of study

BACKGROUND: The increasing number of known protein structures provides valuable information about pharmaceutical targets. Drug binding sites are identifiable and suitable lead compounds can be proposed. The flexibility of ligands is a critical point for the selection of potential drugs. Since computed 3D structures of millions of compounds are available, the knowledge of their binding conformations would be a great benefit for the development of efficient screening methods. RESULTS: Integration of two public databases allowed superposition of conformers for 193 approved drugs with 5507 crystallised target-bound counterparts. The generation of 9600 drug conformers using an atomic force field was carried out to obtain an optimal coverage of the conformational space. Bioactive conformations are best described by a conformational ensemble: half of all drugs exhibit multiple active states, distributed over the entire range of the reachable energy and conformational space. A number of up to 100 conformers per drug enabled us to reproduce the bound states within a similarity threshold of 1.0 Å in 70% of all cases. This fraction rises to about 90% for smaller or average sized drugs. CONCLUSION: Single drugs adopt multiple bioactive conformations if they interact with different target proteins. Due to the structural diversity of binding sites they adopt conformations that are distributed over a broad conformational space and wide energy range. Since the majority of drugs is well represented by a predefined low number of conformers (up to 100) this procedure is a valuable method to compare compounds by three-dimensional features or for fast similarity searches starting with pharmacophores. The underlying 9600 generated drug conformers are downloadable from the Super Drug Web site [1]. All superpositions are visualised at the same source. Additional conformers (110,000) of 2400 classified WHO-drugs are also available

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

D2P2: database of disordered protein predictions

Author: Dosztányi Zsuzsanna
Ishida Takashi
Oates Matt E.
Romero Pedro
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2013
Field of study

We present the Database of Disordered Protein Prediction (D2P2), available at http://d2p2.pro (including website source code). A battery of disorder predictors and their variants, VL-XT, VSL2b, PrDOS, PV2, Espritz and IUPred, were run on all protein sequences from 1765 complete proteomes (to be updated as more genomes are completed). Integrated with these results are all of the predicted (mostly structured) SCOP domains using the SUPERFAMILY predictor. These disorder/structure annotations together enable comparison of the disorder predictors with each other and examination of the overlap between disordered predictions and SCOP domains on a large scale. D2P2 will increase our understanding of the interplay between disorder and structure, the genomic distribution of disorder, and its evolutionary history. The parsed data are made available in a unified format for download as flat files or SQL tables either by genome, by predictor, or for the complete set. An interactive website provides a graphical view of each protein annotated with the SCOP domains and disordered regions from all predictors overlaid (or shown as a consensus). There are statistics and tools for browsing and comparing genomes and their disorder within the context of their position on the tree of life. © The Author(s) 2012. Published by Oxford University Press

Repository of the Academy's Library

Advances in image processing for single-particle analysis by electron cryomicroscopy and challenges ahead

Author: Acton S. T.
Carazo J. M.
Jiménez-Moreno A.
Majtner T.
Maluenda Niubó David
Mota J.
Sorzano C. O. S.
Tabassum N.
Vilas J. L.
Publication venue: 'Elsevier BV'
Publication date: 16/06/2021
Field of study

Electron cryomicroscopy (cryo-EM) is essential for the study and functional understanding of non-crystalline macromolecules such as proteins. These molecules cannot be imaged using X-ray crystallography or other popular methods. CryoEM has been successfully used to visualize molecules such as ribosomes, viruses, and ion channels, for example. Obtaining structural models of these at various conformational states leads to insight on how these molecules function. Recent advances in imaging technology have given cryo-EM a scientific rebirth. Because of imaging improvements, image processing and analysis of the resultant images have increased the resolution such that molecular structures can be resolved at the atomic level. Cryo-EM is ripe with stimulating image processing challenges. In this article, we will touch on the most essential in order to build an accurate structural three-dimensional model from noisy projection images. Traditional approaches, such as k-means clustering for class averaging, will be provided as background. With this review, however, we will highlight fresh approaches from new and varied angles for each image processing sub-problem, including a 3D reconstruction method for asymmetric molecules using just two projection images and deep learning algorithms for automated particle picking. Keywords: Cryo-electron microscopy, Single Particle Analysis, Image processing algorithms

Diposit Digital de la Universitat de Barcelona

A^2-Net: Molecular Structure Estimation from Cryo-EM Density Volumes

Author: Li Hongsheng
Shi Jiangping
Wang Zhe
Xu Kui
Zhang Qiangfeng Cliff
Publication venue
Publication date: 12/02/2019
Field of study

Constructing of molecular structural models from Cryo-Electron Microscopy (Cryo-EM) density volumes is the critical last step of structure determination by Cryo-EM technologies. Methods have evolved from manual construction by structural biologists to perform 6D translation-rotation searching, which is extremely compute-intensive. In this paper, we propose a learning-based method and formulate this problem as a vision-inspired 3D detection and pose estimation task. We develop a deep learning framework for amino acid determination in a 3D Cryo-EM density volume. We also design a sequence-guided Monte Carlo Tree Search (MCTS) to thread over the candidate amino acids to form the molecular structure. This framework achieves 91% coverage on our newly proposed dataset and takes only a few minutes for a typical structure with a thousand amino acids. Our method is hundreds of times faster and several times more accurate than existing automated solutions without any human intervention.Comment: 8 pages, 5 figures, 4 table

arXiv.org e-Print Archive

Association for the Advancement of Artificial Intelligence: AAAI Publications

Using structural bioinformatics to investigate the impact of non synonymous SNPs and disease mutations: scope and limitations

Author: Reumers Joke
Rousseau Fréderic
Schymkowitz Joost
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

BACKGROUND: Linking structural effects of mutations to functional outcomes is a major issue in structural bioinformatics, and many tools and studies have shown that specific structural properties such as stability and residue burial can be used to distinguish neutral variations and disease associated mutations. RESULTS: We have investigated 39 structural properties on a set of SNPs and disease mutations from the Uniprot Knowledge Base that could be mapped on high quality crystal structures and show that none of these properties can be used as a sole classification criterion to separate the two data sets. Furthermore, we have reviewed the annotation process from mutation to result and identified the liabilities in each step. CONCLUSION: Although excellent annotation results of various research groups underline the great potential of using structural bioinformatics to investigate the mechanisms underlying disease, the interpretation of such annotations cannot always be extrapolated to proteome wide variation studies. Difficulties for large-scale studies can be found both on the technical level, i.e. the scarcity of data and the incompleteness of the structural tool suites, and on the conceptual level, i.e. the correct interpretation of the results in a cellular context.status: publishe

Lirias

CiteSeerX

Crossref

Springer - Publisher Connector

PubMed Central

Do Deep Learning Methods Really Perform Better in Molecular Conformation Generation?

Author: Gao Zhifeng
Ke Guolin
Wei Zhewei
Zheng Hang
Zhou Gengmo
Publication venue
Publication date: 14/02/2023
Field of study

Molecular conformation generation (MCG) is a fundamental and important problem in drug discovery. Many traditional methods have been developed to solve the MCG problem, such as systematic searching, model-building, random searching, distance geometry, molecular dynamics, Monte Carlo methods, etc. However, they have some limitations depending on the molecular structures. Recently, there are plenty of deep learning based MCG methods, which claim they largely outperform the traditional methods. However, to our surprise, we design a simple and cheap algorithm (parameter-free) based on the traditional methods and find it is comparable to or even outperforms deep learning based MCG methods in the widely used GEOM-QM9 and GEOM-Drugs benchmarks. In particular, our design algorithm is simply the clustering of the RDKIT-generated conformations. We hope our findings can help the community to revise the deep learning methods for MCG. The code of the proposed algorithm could be found at https://gist.github.com/ZhouGengmo/5b565f51adafcd911c0bc115b2ef027c

arXiv.org e-Print Archive

Sequence-based prediction for vaccine strain selection and identification of antigenic variability in foot-and-mouth disease virus

Author: A Bastos
A Bastos
A Samuel
A Thomas
A Thomas
AD Bastos
ADS Bastos
AFY Poon
AJ Drummond
B Baxt
B Shapiro
Belinda Blignaut
C Bolwell
D Paton
Daniel T. Haydon
DJ Smith
E Beck
Elizabeth E. Fry
Elizabeth Rieder
F Yates
Francois F. Maree
Hester G. O'Neill
HG van Rensburg
J Crowther
J Crowther
J Felsenstein
J Holland
J Kitson
Jacques Theron
Jan J. Esterhuysen
JC Saiz
Louise Matthews
M Lee
M Rweyemamu
M Rweyemamu
M Rweyemamu
M Suchard
M-S Lee
Mark M. Tanaka
MG Mateu
N Knowles
N Mattion
N Mattion
P Barnett
P Barnett
Pamela Opperman
R Boom
R Garten
RA Fisher
Richard Reeve
S Holm
S Lea
S Parida
Tjaart A. P. de Beer
W Vosloo
W Vosloo
W Vosloo
Wilna Vosloo
Y-C Liao
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2010
Field of study

Identifying when past exposure to an infectious disease will protect against newly emerging strains is central to understanding the spread and the severity of epidemics, but the prediction of viral cross-protection remains an important unsolved problem. For foot-and-mouth disease virus (FMDV) research in particular, improved methods for predicting this cross-protection are critical for predicting the severity of outbreaks within endemic settings where multiple serotypes and subtypes commonly co-circulate, as well as for deciding whether appropriate vaccine(s) exist and how much they could mitigate the effects of any outbreak. To identify antigenic relationships and their predictors, we used linear mixed effects models to account for variation in pairwise cross-neutralization titres using only viral sequences and structural data. We identified those substitutions in surface-exposed structural proteins that are correlates of loss of cross-reactivity. These allowed prediction of both the best vaccine match for any single virus and the breadth of coverage of new vaccine candidates from their capsid sequences as effectively as or better than serology. Sub-sequences chosen by the model-building process all contained sites that are known epitopes on other serotypes. Furthermore, for the SAT1 serotype, for which epitopes have never previously been identified, we provide strong evidence - by controlling for phylogenetic structure - for the presence of three epitopes across a panel of viruses and quantify the relative significance of some individual residues in determining cross-neutralization. Identifying and quantifying the importance of sites that predict viral strain cross-reactivity not just for single viruses but across entire serotypes can help in the design of vaccines with better targeting and broader coverage. These techniques can be generalized to any infectious agents where cross-reactivity assays have been carried out. As the parameterization uses pre-existing datasets, this approach quickly and cheaply increases both our understanding of antigenic relationships and our power to control disease

Public Library of Science (PLOS)

Crossref

North-West University Institutional Repository

Directory of Open Access Journals

PubMed Central

Enlighten

UPSpace at the University of Pretoria