4,694 research outputs found
Protein-Protein Docking with F2Dock 2.0 and GB-Rerank
Rezaul Chowdhury is with UT Austin; Muhibur Rasheed is with UT Austin; Maysam Moussalem is with UT Austin; Donald Keidel is with The Scripps Research Institute; Arthur Olson is with The Scripps Research Institute; Michel Sanner is with The Scripps Research Institute; Chandrajit Bajaj is with The Scripps Research Institute.Motivation -- Computational simulation of protein-protein docking can expedite the process of molecular modeling and drug discovery. This paper reports on our new F2 Dock protocol which improves the state of the art in initial stage rigid body exhaustive docking search, scoring and ranking by introducing improvements in the shape-complementarity and electrostatics affinity functions, a new knowledge-based interface propensity term with FFT formulation, a set of novel knowledge-based filters and finally a solvation energy (GBSA) based reranking technique. Our algorithms are based on highly efficient data structures including the dynamic packing grids and octrees which significantly speed up the computations and also provide guaranteed bounds on approximation error. Results -- The improved affinity functions show superior performance compared to their traditional counterparts in finding correct docking poses at higher ranks. We found that the new filters and the GBSA based reranking individually and in combination significantly improve the accuracy of docking predictions with only minor increase in computation time. We compared F2 Dock 2.0 with ZDock 3.0.2 and found improvements over it, specifically among 176 complexes in ZLab Benchmark 4.0, F2 Dock 2.0 finds a near-native solution as the top prediction for 22 complexes; where ZDock 3.0.2 does so for 13 complexes. F2 Dock 2.0 finds a near-native solution within the top 1000 predictions for 106 complexes as opposed to 104 complexes for ZDock 3.0.2. However, there are 17 and 15 complexes where F2 Dock 2.0 finds a solution but ZDock 3.0.2 does not and vice versa; which indicates that the two docking protocols can also complement each other. Availability -- The docking protocol has been implemented as a server with a graphical client (TexMol) which allows the user to manage multiple docking jobs, and visualize the docked poses and interfaces. Both the server and client are available for download. Server: http://www.cs.utexas.edu/~bajaj/cvc/soft​ware/f2dock.shtml. Client: http://www.cs.utexas.edu/~bajaj/cvc/soft​ware/f2dockclient.shtml.The research of C.B., R.C., M.M., and M.R. of University of Texas, was supported in part by National Science Foundation (NSF) grant CNS-0540033, and grants from the National Institutes of Health (NIH) R01-GM074258, R01-GM073087, R01-EB004873. The research of M.M. was additionally supported by an NSF Graduate Research Fellowship. The research of M.S. and A.O. of TSRI was supported in part by a subcontract on NIH grant R01-GM073087. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.Computer Science
HARES: an efficient method for first-principles electronic structure calculations of complex systems
We discuss our new implementation of the Real-space Electronic Structure
method for studying the atomic and electronic structure of infinite periodic as
well as finite systems, based on density functional theory. This improved
version which we call HARES (for High-performance-fortran Adaptive grid
Real-space Electronic Structure) aims at making the method widely applicable
and efficient, using high performance Fortran on parallel architectures. The
scaling of various parts of a HARES calculation is analyzed and compared to
that of plane-wave based methods. The new developments that lead to enhanced
performance, and their parallel implementation, are presented in detail. We
illustrate the application of HARES to the study of elemental crystalline
solids, molecules and complex crystalline materials, such as blue bronze and
zeolites.Comment: 17 two-column pages, including 9 figures, 5 tables. To appear in
Computer Physics Communications. Several minor revisions based on feedbac
Recommended from our members
Predicting multibody assembly of proteins
textThis thesis addresses the multi-body assembly (MBA) problem in the context of protein assemblies. [...] In this thesis, we chose the protein assembly domain because accurate and reliable computational modeling, simulation and prediction of such assemblies would clearly accelerate discoveries in understanding of the complexities of metabolic pathways, identifying the molecular basis for normal health and diseases, and in the designing of new drugs and other therapeutics. [...] [We developed] F²Dock (Fast Fourier Docking) which includes a multi-term function which includes both a statistical thermodynamic approximation of molecular free energy as well as several of knowledge-based terms. Parameters of the scoring model were learned based on a large set of positive/negative examples, and when tested on 176 protein complexes of various types, showed excellent accuracy in ranking correct configurations higher (F² Dock ranks the correcti solution as the top ranked one in 22/176 cases, which is better than other unsupervised prediction software on the same benchmark). Most of the protein-protein interaction scoring terms can be expressed as integrals over the occupied volume, boundary, or a set of discrete points (atom locations), of distance dependent decaying kernels. We developed a dynamic adaptive grid (DAG) data structure which computes smooth surface and volumetric representations of a protein complex in O(m log m) time, where m is the number of atoms assuming that the smallest feature size h is [theta](r[subscript max]) where r[subscript max] is the radius of the largest atom; updates in O(log m) time; and uses O(m)memory. We also developed the dynamic packing grids (DPG) data structure which supports quasi-constant time updates (O(log w)) and spherical neighborhood queries (O(log log w)), where w is the word-size in the RAM. DPG and DAG together results in O(k) time approximation of scoring terms where k << m is the size of the contact region between proteins. [...] [W]e consider the symmetric spherical shell assembly case, where multiple copies of identical proteins tile the surface of a sphere. Though this is a restricted subclass of MBA, it is an important one since it would accelerate development of drugs and antibodies to prevent viruses from forming capsids, which have such spherical symmetry in nature. We proved that it is possible to characterize the space of possible symmetric spherical layouts using a small number of representative local arrangements (called tiles), and their global configurations (tiling). We further show that the tilings, and the mapping of proteins to tilings on arbitrary sized shells is parameterized by 3 discrete parameters and 6 continuous degrees of freedom; and the 3 discrete DOF can be restricted to a constant number of cases if the size of the shell is known (in terms of the number of protein n). We also consider the case where a coarse model of the whole complex of proteins are available. We show that even when such coarse models do not show atomic positions, they can be sufficient to identify a general location for each protein and its neighbors, and thereby restricts the configurational space. We developed an iterative refinement search protocol that leverages such multi-resolution structural data to predict accurate high resolution model of protein complexes, and successfully applied the protocol to model gp120, a protein on the spike of HIV and currently the most feasible target for anti-HIV drug design.Computer Science
The impact of accelerator processors for high-throughput molecular modeling and simulation
Accepted versio
Three real-space discretization techniques in electronic structure calculations
A characteristic feature of the state-of-the-art of real-space methods in
electronic structure calculations is the diversity of the techniques used in
the discretization of the relevant partial differential equations. In this
context, the main approaches include finite-difference methods, various types
of finite-elements and wavelets. This paper reports on the results of several
code development projects that approach problems related to the electronic
structure using these three different discretization methods. We review the
ideas behind these methods, give examples of their applications, and discuss
their similarities and differences.Comment: 39 pages, 10 figures, accepted to a special issue of "physica status
solidi (b) - basic solid state physics" devoted to the CECAM workshop "State
of the art developments and perspectives of real-space electronic structure
techniques in condensed matter and molecular physics". v2: Minor stylistic
and typographical changes, partly inspired by referee comment
O(N) methods in electronic structure calculations
Linear scaling methods, or O(N) methods, have computational and memory
requirements which scale linearly with the number of atoms in the system, N, in
contrast to standard approaches which scale with the cube of the number of
atoms. These methods, which rely on the short-ranged nature of electronic
structure, will allow accurate, ab initio simulations of systems of
unprecedented size. The theory behind the locality of electronic structure is
described and related to physical properties of systems to be modelled, along
with a survey of recent developments in real-space methods which are important
for efficient use of high performance computers. The linear scaling methods
proposed to date can be divided into seven different areas, and the
applicability, efficiency and advantages of the methods proposed in these areas
is then discussed. The applications of linear scaling methods, as well as the
implementations available as computer programs, are considered. Finally, the
prospects for and the challenges facing linear scaling methods are discussed.Comment: 85 pages, 15 figures, 488 references. Resubmitted to Rep. Prog. Phys
(small changes
- …