42,764 research outputs found

    The Phyre2 web portal for protein modeling, prediction and analysis

    Get PDF
    Phyre2 is a suite of tools available on the web to predict and analyze protein structure, function and mutations. The focus of Phyre2 is to provide biologists with a simple and intuitive interface to state-of-the-art protein bioinformatics tools. Phyre2 replaces Phyre, the original version of the server for which we previously published a paper in Nature Protocols. In this updated protocol, we describe Phyre2, which uses advanced remote homology detection methods to build 3D models, predict ligand binding sites and analyze the effect of amino acid variants (e.g., nonsynonymous SNPs (nsSNPs)) for a user's protein sequence. Users are guided through results by a simple interface at a level of detail they determine. This protocol will guide users from submitting a protein sequence to interpreting the secondary and tertiary structure of their models, their domain composition and model quality. A range of additional available tools is described to find a protein structure in a genome, to submit large number of sequences at once and to automatically run weekly searches for proteins that are difficult to model. The server is available at http://www.sbg.bio.ic.ac.uk/phyre2. A typical structure prediction will be returned between 30 min and 2 h after submission

    Comparative genomics of Burkholderia multivorans, a ubiquitous pathogen with a highly conserved genomic structure

    Get PDF
    The natural environment serves as a reservoir of opportunistic pathogens. A well-established method for studying the epidemiology of such opportunists is multilocus sequence typing, which in many cases has defined strains predisposed to causing infection. Burkholderia multivorans is an important pathogen in people with cystic fibrosis (CF) and its epidemiology suggests that strains are acquired from non-human sources such as the natural environment. This raises the central question of whether the isolation source (CF or environment) or the multilocus sequence type (ST) of B. multivorans better predicts their genomic content and functionality. We identified four pairs of B. multivorans isolates, representing distinct STs and consisting of one CF and one environmental isolate each. All genomes were sequenced using the PacBio SMRT sequencing technology, which resulted in eight high-quality B. multivorans genome assemblies. The present study demonstrated that the genomic structure of the examined B. multivorans STs is highly conserved and that the B. multivorans genomic lineages are defined by their ST. Orthologous protein families were not uniformly distributed among chromosomes, with core orthologs being enriched on the primary chromosome and ST-specific orthologs being enriched on the second and third chromosome. The ST-specific orthologs were enriched in genes involved in defense mechanisms and secondary metabolism, corroborating the strain-specificity of these virulence characteristics. Finally, the same B. multivorans genomic lineages occur in both CF and environmental samples and on different continents, demonstrating their ubiquity and evolutionary persistence

    FFAS server: novel features and applications.

    Get PDF
    The Fold and Function Assignment System (FFAS) server [Jaroszewski et al. (2005) FFAS03: a server for profile-profile sequence alignments. Nucleic Acids Research, 33, W284-W288] implements the algorithm for protein profile-profile alignment introduced originally in [Rychlewski et al. (2000) Comparison of sequence profiles. Strategies for structural predictions using sequence information. Protein Science: a Publication of the Protein Society, 9, 232-241]. Here, we present updates, changes and novel functionality added to the server since 2005 and discuss its new applications. The sequence database used to calculate sequence profiles was enriched by adding sets of publicly available metagenomic sequences. The profile of a user's protein can now be compared with ∼20 additional profile databases, including several complete proteomes, human proteins involved in genetic diseases and a database of microbial virulence factors. A newly developed interface uses a system of tabs, allowing the user to navigate multiple results pages, and also includes novel functionality, such as a dotplot graph viewer, modeling tools, an improved 3D alignment viewer and links to the database of structural similarities. The FFAS server was also optimized for speed: running times were reduced by an order of magnitude. The FFAS server, http://ffas.godziklab.org, has no log-in requirement, albeit there is an option to register and store results in individual, password-protected directories. Source code and Linux executables for the FFAS program are available for download from the FFAS server

    Of bits and bugs

    Get PDF
    Pur-α is a nucleic acid-binding protein involved in cell cycle control, transcription, and neuronal function. Initially no prediction of the three-dimensional structure of Pur-α was possible. However, recently we solved the X-ray structure of Pur-α from the fruitfly Drosophila melanogaster and showed that it contains a so-called PUR domain. Here we explain how we exploited bioinformatics tools in combination with X-ray structure determination of a bacterial homolog to obtain diffracting crystals and the high-resolution structure of Drosophila Pur-α. First, we used sensitive methods for remote-homology detection to find three repetitive regions in Pur-α. We realized that our lack of understanding how these repeats interact to form a globular domain was a major problem for crystallization and structure determination. With our information on the repeat motifs we then identified a distant bacterial homolog that contains only one repeat. We determined the bacterial crystal structure and found that two of the repeats interact to form a globular domain. Based on this bacterial structure, we calculated a computational model of the eukaryotic protein. The model allowed us to design a crystallizable fragment and to determine the structure of Drosophila Pur-α. Key for success was the fact that single repeats of the bacterial protein self-assembled into a globular domain, instructing us on the number and boundaries of repeats to be included for crystallization trials with the eukaryotic protein. This study demonstrates that the simpler structural domain arrangement of a distant prokaryotic protein can guide the design of eukaryotic crystallization constructs. Since many eukaryotic proteins contain multiple repeats or repeating domains, this approach might be instructive for structural studies of a range of proteins

    Rampant exchange of the structure and function of extramembrane domains between membrane and water soluble proteins.

    Get PDF
    Of the membrane proteins of known structure, we found that a remarkable 67% of the water soluble domains are structurally similar to water soluble proteins of known structure. Moreover, 41% of known water soluble protein structures share a domain with an already known membrane protein structure. We also found that functional residues are frequently conserved between extramembrane domains of membrane and soluble proteins that share structural similarity. These results suggest membrane and soluble proteins readily exchange domains and their attendant functionalities. The exchanges between membrane and soluble proteins are particularly frequent in eukaryotes, indicating that this is an important mechanism for increasing functional complexity. The high level of structural overlap between the two classes of proteins provides an opportunity to employ the extensive information on soluble proteins to illuminate membrane protein structure and function, for which much less is known. To this end, we employed structure guided sequence alignment to elucidate the functions of membrane proteins in the human genome. Our results bridge the gap of fold space between membrane and water soluble proteins and provide a resource for the prediction of membrane protein function. A database of predicted structural and functional relationships for proteins in the human genome is provided at sbi.postech.ac.kr/emdmp

    The Borrelia afzelii outer membrane protein BAPKO_0422 binds human Factor-H and is predicted to form a membrane-spanning beta-barrel

    Get PDF
    The deep evolutionary history of the Spirochetes places their branch point early in the evolution of the diderms, before the divergence of the present day Proteobacteria. As a Spirochete, the morphology of the Borrelia cell envelope shares characteristics of both Gram-positive and Gram-negative bacteria. A thin layer of peptidoglycan, tightly associated with the cytoplasmic membrane is surrounded by a more labile outer membrane (OM). This OM is rich in lipoproteins but with few known integral membrane proteins. The OmpA domain is an eight-stranded membrane-spanning β-barrel, highly conserved among the Proteobacteria but so far unknown in the Spirochetes. In the present work we describe the identification of four novel OmpA-like β-barrels from Borrelia afzelii, the most common cause of erythema migrans rash in Europe. Structural characterisation of one these proteins (BAPKO_0422) by small angle X-ray scattering (SAXS) and circular dichroism indicate a compact globular structure rich in β-strand consistent with a monomeric β-barrel. Ab initio molecular envelopes calculated from the scattering profile are consistent with homology models and demonstrate that BAPKO_0422 adopts a peanut shape with dimensions 25 x 45 Å. Deviations from the standard C-terminal signature sequence are apparent; in particular the C-terminal Phe residue commonly found in Proteobacterial OM proteins is replaced by Ile/Leu or Asn. BAPKO_0422 is demonstrated to bind human factor-H and therefore may contribute to immune evasion by inhibition of the complement response. Encoded by chromosomal genes, these proteins are highly conserved between Borrelia subspecies and may be of diagnostic or therapeutic value

    AIP1 is a novel Agenet/Tudor domain protein from Arabidopsis that interacts with regulators of DNA replication, transcription and chromatin remodeling

    Get PDF
    Background: DNA replication and transcription are dynamic processes regulating plant development that are dependent on the chromatin accessibility. Proteins belonging to the Agenet/Tudor domain family are known as histone modification "readers" and classified as chromatin remodeling proteins. Histone modifications and chromatin remodeling have profound effects on gene expression as well as on DNA replication, but how these processes are integrated has not been completely elucidated. It is clear that members of the Agenet/Tudor family are important regulators of development playing roles not well known in plants. Methods: Bioinformatics and phylogenetic analyses of the Agenet/Tudor Family domain in the plant kingdom were carried out with sequences from available complete genomes databases. 3D structure predictions of Agenet/Tudor domains were calculated by I-TASSER server. Protein interactions were tested in two-hybrid, GST pulldown, semi-in vivo pulldown and Tandem Affinity Purification assays. Gene function was studied in a T-DNA insertion GABI-line. Results: In the present work we analyzed the family of Agenet/Tudor domain proteins in the plant kingdom and we mapped the organization of this family throughout plant evolution. Furthermore, we characterized a member from Arabidopsis thaliana named AIP1 that harbors Agenet/Tudor and DUF724 domains. AIP1 interacts with ABAP1, a plant regulator of DNA replication licensing and gene transcription, with a plant histone modification "reader" (LHP1) and with non modified histones. AIP1 is expressed in reproductive tissues and its down-regulation delays flower development timing. Also, expression of ABAP1 and LHP1 target genes were repressed in flower buds of plants with reduced levels of AIP1. Conclusions: AIP1 is a novel Agenet/Tudor domain protein in plants that could act as a link between DNA replication, transcription and chromatin remodeling during flower development

    Structural genomics analysis of uncharacterized protein families overrepresented in human gut bacteria identifies a novel glycoside hydrolase.

    Get PDF
    BackgroundBacteroides spp. form a significant part of our gut microbiome and are well known for optimized metabolism of diverse polysaccharides. Initial analysis of the archetypal Bacteroides thetaiotaomicron genome identified 172 glycosyl hydrolases and a large number of uncharacterized proteins associated with polysaccharide metabolism.ResultsBT_1012 from Bacteroides thetaiotaomicron VPI-5482 is a protein of unknown function and a member of a large protein family consisting entirely of uncharacterized proteins. Initial sequence analysis predicted that this protein has two domains, one on the N- and one on the C-terminal. A PSI-BLAST search found over 150 full length and over 90 half size homologs consisting only of the N-terminal domain. The experimentally determined three-dimensional structure of the BT_1012 protein confirms its two-domain architecture and structural analysis of both domains suggests their specific functions. The N-terminal domain is a putative catalytic domain with significant similarity to known glycoside hydrolases, the C-terminal domain has a beta-sandwich fold typically found in C-terminal domains of other glycosyl hydrolases, however these domains are typically involved in substrate binding. We describe the structure of the BT_1012 protein and discuss its sequence-structure relationship and their possible functional implications.ConclusionsStructural and sequence analyses of the BT_1012 protein identifies it as a glycosyl hydrolase, expanding an already impressive catalog of enzymes involved in polysaccharide metabolism in Bacteroides spp. Based on this we have renamed the Pfam families representing the two domains found in the BT_1012 protein, PF13204 and PF12904, as putative glycoside hydrolase and glycoside hydrolase-associated C-terminal domain respectively
    corecore