374 research outputs found
p3d – Python module for structural bioinformatics
<p>Abstract</p> <p>Background</p> <p>High-throughput bioinformatic analysis tools are needed to mine the large amount of structural data via knowledge based approaches. The development of such tools requires a robust interface to access the structural data in an easy way. For this the Python scripting language is the optimal choice since its philosophy is to write an understandable source code.</p> <p>Results</p> <p>p3d is an object oriented Python module that adds a simple yet powerful interface to the Python interpreter to process and analyse three dimensional protein structure files (PDB files). p3d's strength arises from the combination of a) very fast spatial access to the structural data due to the implementation of a binary space partitioning (BSP) tree, b) set theory and c) functions that allow to combine a and b and that use human readable language in the search queries rather than complex computer language. All these factors combined facilitate the rapid development of bioinformatic tools that can perform quick and complex analyses of protein structures.</p> <p>Conclusion</p> <p>p3d is the perfect tool to quickly develop tools for structural bioinformatics using the Python scripting language.</p
Quantitative test of the barrier nucleosome model for statistical positioning of nucleosomes up- and downstream of transcription start sites
The positions of nucleosomes in eukaryotic genomes determine which parts of
the DNA sequence are readily accessible for regulatory proteins and which are
not. Genome-wide maps of nucleosome positions have revealed a salient pattern
around transcription start sites, involving a nucleosome-free region (NFR)
flanked by a pronounced periodic pattern in the average nucleosome density.
While the periodic pattern clearly reflects well-positioned nucleosomes, the
positioning mechanism is less clear. A recent experimental study by Mavrich et
al. argued that the pattern observed in S. cerevisiae is qualitatively
consistent with a `barrier nucleosome model', in which the oscillatory pattern
is created by the statistical positioning mechanism of Kornberg and Stryer. On
the other hand, there is clear evidence for intrinsic sequence preferences of
nucleosomes, and it is unclear to what extent these sequence preferences affect
the observed pattern. To test the barrier nucleosome model, we quantitatively
analyze yeast nucleosome positioning data both up- and downstream from NFRs.
Our analysis is based on the Tonks model of statistical physics which
quantifies the interplay between the excluded-volume interaction of nucleosomes
and their positional entropy. We find that although the typical patterns on the
two sides of the NFR are different, they are both quantitatively described by
the same physical model, with the same parameters, but different boundary
conditions. The inferred boundary conditions suggest that the first nucleosome
downstream from the NFR (the +1 nucleosome) is typically directly positioned
while the first nucleosome upstream is statistically positioned via a
nucleosome-repelling DNA region. These boundary conditions, which can be
locally encoded into the genome sequence, significantly shape the statistical
distribution of nucleosomes over a range of up to ~1000 bp to each side.Comment: includes supporting materia
Predicting Target DNA Sequences of DNA-Binding Proteins Based on Unbound Structures
DNA-binding proteins such as transcription factors use DNA-binding domains (DBDs) to bind to specific sequences in the genome to initiate many important biological functions. Accurate prediction of such target sequences, often represented by position weight matrices (PWMs), is an important step to understand many biological processes. Recent studies have shown that knowledge-based potential functions can be applied on protein-DNA co-crystallized structures to generate PWMs that are considerably consistent with experimental data. However, this success has not been extended to DNA-binding proteins lacking co-crystallized structures. This study aims at investigating the possibility of predicting the DNA sequences bound by DNA-binding proteins from the proteins' unbound structures (structures of the unbound state). Given an unbound query protein and a template complex, the proposed method first employs structure alignment to generate synthetic protein-DNA complexes for the query protein. Once a complex is available, an atomic-level knowledge-based potential function is employed to predict PWMs characterizing the sequences to which the query protein can bind. The evaluation of the proposed method is based on seven DNA-binding proteins, which have structures of both DNA-bound and unbound forms for prediction as well as annotated PWMs for validation. Since this work is the first attempt to predict target sequences of DNA-binding proteins from their unbound structures, three types of structural variations that presumably influence the prediction accuracy were examined and discussed. Based on the analyses conducted in this study, the conformational change of proteins upon binding DNA was shown to be the key factor. This study sheds light on the challenge of predicting the target DNA sequences of a protein lacking co-crystallized structures, which encourages more efforts on the structure alignment-based approaches in addition to docking- and homology modeling-based approaches for generating synthetic complexes
From Nonspecific DNA–Protein Encounter Complexes to the Prediction of DNA–Protein Interactions
©2009 Gao, Skolnick. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.doi:10.1371/journal.pcbi.1000341DNA–protein interactions are involved in many essential biological activities. Because there is no simple mapping code between DNA base pairs and protein amino acids, the prediction of DNA–protein interactions is a challenging problem. Here, we present a novel computational approach for predicting DNA-binding protein residues and DNA–protein interaction modes without knowing its specific DNA target sequence. Given the structure of a DNA-binding protein, the method first generates an ensemble of complex structures obtained by rigid-body docking with a nonspecific canonical B-DNA. Representative models are subsequently selected through clustering and ranking by their DNA–protein interfacial energy. Analysis of these encounter complex models suggests that the recognition sites for specific DNA binding are usually favorable interaction sites for the nonspecific DNA probe and that nonspecific DNA–protein interaction modes exhibit some similarity to specific DNA–protein binding modes. Although the method requires as input the knowledge that the protein binds DNA, in benchmark tests, it achieves better performance in identifying DNA-binding sites than three previously established methods, which are based on sophisticated machine-learning techniques. We further apply our method to protein structures predicted through modeling and demonstrate that our method performs satisfactorily on protein models whose root-mean-square Ca deviation from native is up to 5 Å from their native structures. This study provides valuable structural insights into how a specific DNA-binding protein interacts with a nonspecific DNA sequence. The similarity between the specific DNA–protein interaction mode and nonspecific interaction modes may reflect an important sampling step in search of its specific DNA targets by a DNA-binding protein
Gauge symmetry and W-algebra in higher derivative systems
The problem of gauge symmetry in higher derivative Lagrangian systems is
discussed from a Hamiltonian point of view. The number of independent gauge
parameters is shown to be in general {\it{less}} than the number of independent
primary first class constraints, thereby distinguishing it from conventional
first order systems. Different models have been considered as illustrative
examples. In particular we show a direct connection between the gauge symmetry
and the W-algebra for the rigid relativistic particle.Comment: 1+22 pages, 1 figure, LaTeX, v2; title changed, considerably expanded
version with new results, to appear in JHE
A mathematical and computational review of Hartree-Fock SCF methods in Quantum Chemistry
We present here a review of the fundamental topics of Hartree-Fock theory in
Quantum Chemistry. From the molecular Hamiltonian, using and discussing the
Born-Oppenheimer approximation, we arrive to the Hartree and Hartree-Fock
equations for the electronic problem. Special emphasis is placed in the most
relevant mathematical aspects of the theoretical derivation of the final
equations, as well as in the results regarding the existence and uniqueness of
their solutions. All Hartree-Fock versions with different spin restrictions are
systematically extracted from the general case, thus providing a unifying
framework. Then, the discretization of the one-electron orbitals space is
reviewed and the Roothaan-Hall formalism introduced. This leads to a exposition
of the basic underlying concepts related to the construction and selection of
Gaussian basis sets, focusing in algorithmic efficiency issues. Finally, we
close the review with a section in which the most relevant modern developments
(specially those related to the design of linear-scaling methods) are commented
and linked to the issues discussed. The whole work is intentionally
introductory and rather self-contained, so that it may be useful for non
experts that aim to use quantum chemical methods in interdisciplinary
applications. Moreover, much material that is found scattered in the literature
has been put together here to facilitate comprehension and to serve as a handy
reference.Comment: 64 pages, 3 figures, tMPH2e.cls style file, doublesp, mathbbol and
subeqn package
Azimuthal anisotropy and correlations at large transverse momenta in and Au+Au collisions at = 200 GeV
Results on high transverse momentum charged particle emission with respect to
the reaction plane are presented for Au+Au collisions at =
200 GeV. Two- and four-particle correlations results are presented as well as a
comparison of azimuthal correlations in Au+Au collisions to those in at
the same energy. Elliptic anisotropy, , is found to reach its maximum at
GeV/c, then decrease slowly and remain significant up to
-- 10 GeV/c. Stronger suppression is found in the back-to-back
high- particle correlations for particles emitted out-of-plane compared to
those emitted in-plane. The centrality dependence of at intermediate
is compared to simple models based on jet quenching.Comment: 4 figures. Published version as PRL 93, 252301 (2004
Azimuthal anisotropy in Au+Au collisions at sqrtsNN = 200 GeV
The results from the STAR Collaboration on directed flow (v_1), elliptic flow
(v_2), and the fourth harmonic (v_4) in the anisotropic azimuthal distribution
of particles from Au+Au collisions at sqrtsNN = 200 GeV are summarized and
compared with results from other experiments and theoretical models. Results
for identified particles are presented and fit with a Blast Wave model.
Different anisotropic flow analysis methods are compared and nonflow effects
are extracted from the data. For v_2, scaling with the number of constituent
quarks and parton coalescence is discussed. For v_4, scaling with v_2^2 and
quark coalescence is discussed.Comment: 26 pages. As accepted by Phys. Rev. C. Text rearranged, figures
modified, but data the same. However, in Fig. 35 the hydro calculations are
corrected in this version. The data tables are available at
http://www.star.bnl.gov/central/publications/ by searching for "flow" and
then this pape
- …