107 research outputs found
Detecting DNA-binding helix–turn–helix structural motifs using sequence and structure information
In this work, we analyse the potential for using structural knowledge to improve the detection of the DNA-binding helix–turn–helix (HTH) motif from sequence. Starting from a set of DNA-binding protein structures that include a functional HTH motif and have no apparent sequence similarity to each other, two different libraries of hidden Markov models (HMMs) were built. One library included sequence models of whole DNA-binding domains, which incorporate the HTH motif, the second library included shorter models of ‘partial’ domains, representing only the fraction of the domain that corresponds to the functionally relevant HTH motif itself. The libraries were scanned against a dataset of protein sequences, some containing the HTH motifs, others not. HMM predictions were compared with the results obtained from a previously published structure-based method and subsequently combined with it. The combined method proved more effective than either of the single-featured approaches, showing that information carried by motif sequences and motif structures are to some extent complementary and can successfully be used together for the detection of DNA-binding HTHs in proteins of unknown function
Modeling Amyloid Beta Peptide Insertion into Lipid Bilayers
Inspired by recent suggestions that the Alzheimer's amyloid beta peptide (A
beta) can insert into cell membranes and form harmful ion channels, we model
insertion of the 40 and 42 residue forms of the peptide into cell membranes
using a Monte Carlo code which is specific at the amino acid level. We examine
insertion of the regular A-beta peptide as well as mutants causing familial
Alzheimer's disease, and find that all but one of the mutants change the
insertion behavior by causing the peptide to spend more simulation steps in
only one leaflet of the bilayer. We also find that A-beta 42, because of the
extra hydrophobic residues relative to A-beta 40, is more likely to adopt this
conformation than A-beta 40 in both wild-type and mutant forms. We argue
qualitatively why these effects happen. Here, we present our results and
develop the hypothesis that this partial insertion increases the probability of
harmful channel formation. This hypothesis can partly explain why these
mutations are neurotoxic simply due to peptide insertion behavior. We further
apply this model to various artificial A-beta mutants which have been examined
experimentally, and offer testable experimental predictions contrasting the
roles of aggregation and insertion with regard to toxicity of A-beta mutants.
These can be used through further experiments to test our hypothesis.Comment: 14 pages; 8 figures; 2nd revisio
Transcriptome bioinformatic analysis identifies potential therapeutic mechanism of pentylenetetrazole in down syndrome
<p>Abstract</p> <p>Background</p> <p>Pentylenetetrazole (PTZ) has recently been found to ameliorate cognitive impairment in rodent models of Down syndrome (DS). The mechanism underlying PTZ's therapeutic effect in DS is however not clear. Microarray profiling has previously reported differential expression, both up- and down-regulation, of genes in DS. Given this, transcriptomic data related to PTZ treatment, if available, could be used to understand the drug's therapeutic mechanism in DS. No such mammalian data however exists. Nevertheless, a <it>Drosophila </it>model inspired by PTZ induced kindling plasticity in rodents has recently been described. Microarray profiling has shown PTZ's downregulatory effect on gene expression in the fly heads.</p> <p>Methods</p> <p>In a comparative transcriptomics approach, I have analyzed the available microarray data in order to identify potential therapeutic mechanism of PTZ in DS. In the analysis, summary data of up- and down-regulated genes reported in human DS studies and of down-regulated genes reported in the <it>Drosophila </it>model has been used.</p> <p>Results</p> <p>I find that transcriptomic correlate of chronic PTZ in <it>Drosophila </it>counteracts that of DS. Genes downregulated by PTZ significantly over-represent genes upregulated in DS and under-represent genes downregulated in DS. Further, the genes which are common in the downregulated and upregulated DS set show enrichment for MAP kinase pathway.</p> <p>Conclusion</p> <p>My analysis suggests that downregulation of MAP kinase pathway may mediate therapeutic effect of PTZ in DS. Existing evidence implicating MAP kinase pathway in DS supports this observation.</p
Critical assessment of methods of protein structure prediction—Round VII
This paper is an introduction to the supplemental issue of the journal PROTEINS, dedicated to the seventh CASP experiment to assess the state of the art in protein structure prediction. The paper describes the conduct of the experiment, the categories of prediction included, and outlines the evaluation and assessment procedures. Highlights are improvements in model accuracy relative to that obtainable from knowledge of a single best template structure; convergence of the accuracy of models produced by automatic servers toward that produced by human modeling teams; the emergence of methods for predicting the quality of models; and rapidly increasing practical applications of the methods
3V: cavity, channel and cleft volume calculator and extractor
As larger macromolecular structures become available, there is a growing need to understand their ‘internal’ volumes—such as deep clefts, channels and cavities—as these often play critical roles in their function. The 3V web server can automatically extract and comprehensively analyze all the internal volumes from input RNA and protein structures. It rapidly finds internal volumes by taking the difference between two rolling-probe solvent-excluded surfaces, one with as large as possible a probe radius and the other with a solvent radius (typically 1.5 Å for water). The outputs are volumetric representations, both as images and downloadable files, which can be used for further analysis
Extending CATH: increasing coverage of the protein structure universe and linking structure with function
CATH version 3.3 (class, architecture, topology, homology) contains 128 688 domains, 2386 homologous superfamilies and 1233 fold groups, and reflects a major focus on classifying structural genomics (SG) structures and transmembrane proteins, both of which are likely to add structural novelty to the database and therefore increase the coverage of protein fold space within CATH. For CATH version 3.4 we have significantly improved the presentation of sequence information and associated functional information for CATH superfamilies. The CATH superfamily pages now reflect both the functional and structural diversity within the superfamily and include structural alignments of close and distant relatives within the superfamily, annotated with functional information and details of conserved residues. A significantly more efficient search function for CATH has been established by implementing the search server Solr (http://lucene.apache.org/solr/). The CATH v3.4 webpages have been built using the Catalyst web framework
Boosting the prediction and understanding of DNA-binding domains from sequence
DNA-binding proteins perform vital functions related to transcription, repair and replication. We have developed a new sequence-based machine learning protocol to identify DNA-binding proteins. We compare our method with an extensive benchmark of previously published structure-based machine learning methods as well as a standard sequence alignment technique, BLAST. Furthermore, we elucidate important feature interactions found in a learned model and analyze how specific rules capture general mechanisms that extend across DNA-binding motifs. This analysis is carried out using the malibu machine learning workbench available at http://proteomics.bioengr.uic.edu/malibu and the corresponding data sets and features are available at http://proteomics.bioengr.uic.edu/dna
RHYTHM—a server to predict the orientation of transmembrane helices in channels and membrane-coils
RHYTHM is a web server that predicts buried versus exposed residues of helical membrane proteins. Starting from a given protein sequence, secondary and tertiary structure information is calculated by RHYTHM within only a few seconds. The prediction applies structural information from a growing data base of precalculated packing files and evolutionary information from sequence patterns conserved in a representative dataset of membrane proteins (‘Pfam-domains’). The program uses two types of position specific matrices to account for the different geometries of packing in channels and transporters (‘channels’) or other membrane proteins (‘membrane-coils’). The output provides information on the secondary structure and topology of the protein and specifically on the contact type of each residue and its conservation. This information can be downloaded as a graphical file for illustration, a text file for analysis and statistics and a PyMOL file for modeling purposes. The server can be freely accessed at: URL: http://proteinformatics.de/rhyth
Visual cavity analysis in molecular simulations
Molecular surfaces provide a useful mean for analyzing interactions between biomolecules; such as identification and characterization of ligand binding sites to a host macromolecule. We present a novel technique, which extracts potential binding sites, represented by cavities, and characterize them by 3D graphs and by amino acids. The binding sites are extracted using an implicit function sampling and graph algorithms. We propose an advanced cavity exploration technique based on the graph parameters and associated amino acids. Additionally, we interactively visualize the graphs in the context of the molecular surface. We apply our method to the analysis of MD simulations of Proteinase 3, where we verify the previously described cavities and suggest a new potential cavity to be studied
Installing hydrolytic activity into a completely <i>de novo </i>protein framework
The design of enzyme-like catalysts tests our understanding of sequence-to-structure/function relationships in proteins. Here we install hydrolytic activity predictably into a completely de novo and thermostable α-helical barrel, which comprises seven helices arranged around an accessible channel. We show that the lumen of the barrel accepts 21 mutations to functional polar residues. The resulting variant, which has cysteine–histidine–glutamic acid triads on each helix, hydrolyses p-nitrophenyl acetate with catalytic efficiencies that match the most-efficient redesigned hydrolases based on natural protein scaffolds. This is the first report of a functional catalytic triad engineered into a de novo protein framework. The flexibility of our system also allows the facile incorporation of unnatural side chains to improve activity and probe the catalytic mechanism. Such a predictable and robust construction of truly de novo biocatalysts holds promise for applications in chemical and biochemical synthesis
- …