17,459 research outputs found

    Revealing evolutionary constraints on proteins through sequence analysis

    Full text link
    Statistical analysis of alignments of large numbers of protein sequences has revealed "sectors" of collectively coevolving amino acids in several protein families. Here, we show that selection acting on any functional property of a protein, represented by an additive trait, can give rise to such a sector. As an illustration of a selected trait, we consider the elastic energy of an important conformational change within an elastic network model, and we show that selection acting on this energy leads to correlations among residues. For this concrete example and more generally, we demonstrate that the main signature of functional sectors lies in the small-eigenvalue modes of the covariance matrix of the selected sequences. However, secondary signatures of these functional sectors also exist in the extensively-studied large-eigenvalue modes. Our simple, general model leads us to propose a principled method to identify functional sectors, along with the magnitudes of mutational effects, from sequence data. We further demonstrate the robustness of these functional sectors to various forms of selection, and the robustness of our approach to the identification of multiple selected traits.Comment: 37 pages, 28 figure

    Computational approaches to identify genetic interactions for cancer therapeutics

    Get PDF
    The development of improved cancer therapies is frequently cited as an urgent unmet medical need. Here we describe how genetic interactions are being therapeutically exploited to identify novel targeted treatments for cancer. We discuss the current methodologies that use ‘omics data to identify genetic interactions, in particular focusing on synthetic sickness lethality (SSL) and synthetic dosage lethality (SDL). We describe the experimen- tal and computational approaches undertaken both in humans and model organisms to identify these interac- tions. Finally we discuss some of the identified targets with licensed drugs, inhibitors in clinical trials or with compounds under development

    Blueprint for a high-performance biomaterial: full-length spider dragline silk genes.

    Get PDF
    Spider dragline (major ampullate) silk outperforms virtually all other natural and manmade materials in terms of tensile strength and toughness. For this reason, the mass-production of artificial spider silks through transgenic technologies has been a major goal of biomimetics research. Although all known arthropod silk proteins are extremely large (>200 kiloDaltons), recombinant spider silks have been designed from short and incomplete cDNAs, the only available sequences. Here we describe the first full-length spider silk gene sequences and their flanking regions. These genes encode the MaSp1 and MaSp2 proteins that compose the black widow's high-performance dragline silk. Each gene includes a single enormous exon (>9000 base pairs) that translates into a highly repetitive polypeptide. Patterns of variation among sequence repeats at the amino acid and nucleotide levels indicate that the interaction of selection, intergenic recombination, and intragenic recombination governs the evolution of these highly unusual, modular proteins. Phylogenetic footprinting revealed putative regulatory elements in non-coding flanking sequences. Conservation of both upstream and downstream flanking sequences was especially striking between the two paralogous black widow major ampullate silk genes. Because these genes are co-expressed within the same silk gland, there may have been selection for similarity in regulatory regions. Our new data provide complete templates for synthesis of recombinant silk proteins that significantly improve the degree to which artificial silks mimic natural spider dragline fibers

    Local Function Conservation in Sequence and Structure Space

    Get PDF
    We assess the variability of protein function in protein sequence and structure space. Various regions in this space exhibit considerable difference in the local conservation of molecular function. We analyze and capture local function conservation by means of logistic curves. Based on this analysis, we propose a method for predicting molecular function of a query protein with known structure but unknown function. The prediction method is rigorously assessed and compared with a previously published function predictor. Furthermore, we apply the method to 500 functionally unannotated PDB structures and discuss selected examples. The proposed approach provides a simple yet consistent statistical model for the complex relations between protein sequence, structure, and function. The GOdot method is available online (http://godot.bioinf.mpi-inf.mpg.de)

    Structural basis of IL-23 antagonism by an Alphabody protein scaffold

    Get PDF
    Protein scaffolds can provide a promising alternative to antibodies for various biomedical and biotechnological applications, including therapeutics. Here we describe the design and development of the Alphabody, a protein scaffold featuring a single-chain antiparallel triple-helix coiled-coil fold. We report affinity-matured Alphabodies with favourable physicochemical properties that can specifically neutralize human interleukin (IL)-23, a pivotal therapeutic target in autoimmune inflammatory diseases such as psoriasis and multiple sclerosis. The crystal structure of human IL-23 in complex with an affinity-matured Alphabody reveals how the variable interhelical groove of the scaffold uniquely targets a large epitope on the p19 subunit of IL-23 to harness fully the hydrophobic and hydrogen-bonding potential of tryptophan and tyrosine residues contributed by p19 and the Alphabody, respectively. Thus, Alphabodies are suitable for targeting protein-protein interfaces of therapeutic importance and can be tailored to interrogate desired design and binding-mode principles via efficient selection and affinity-maturation strategies

    The genome of Romanomermis culicivorax:revealing fundamental changes in the core developmental genetic toolkit in Nematoda

    Get PDF
    Background: The genetics of development in the nematode Caenorhabditis elegans has been described in exquisite detail. The phylum Nematoda has two classes: Chromadorea (which includes C. elegans) and the Enoplea. While the development of many chromadorean species resembles closely that of C. elegans, enoplean nematodes show markedly different patterns of early cell division and cell fate assignment. Embryogenesis of the enoplean Romanomermis culicivorax has been studied in detail, but the genetic circuitry underpinning development in this species has not been explored. Results: We generated a draft genome for R. culicivorax and compared its gene content with that of C. elegans, a second enoplean, the vertebrate parasite Trichinella spiralis, and a representative arthropod, Tribolium castaneum. This comparison revealed that R. culicivorax has retained components of the conserved ecdysozoan developmental gene toolkit lost in C. elegans. T. spiralis has independently lost even more of this toolkit than has C. elegans. However, the C. elegans toolkit is not simply depauperate, as many novel genes essential for embryogenesis in C. elegans are not found in, or have only extremely divergent homologues in R. culicivorax and T. spiralis. Our data imply fundamental differences in the genetic programmes not only for early cell specification but also others such as vulva formation and sex determination. Conclusions: Despite the apparent morphological conservatism, major differences in the molecular logic of development have evolved within the phylum Nematoda. R. culicivorax serves as a tractable system to contrast C. elegans and understand how divergent genomic and thus regulatory backgrounds nevertheless generate a conserved phenotype. The R. culicivorax draft genome will promote use of this species as a research model

    Secondary metabolite gene clusters in the entomopathogen fungus Metarhizium anisopliae : genome identification and patterns of expression in a cuticle infection model

    Get PDF
    Background: The described species from the Metarhizium genus are cosmopolitan fungi that infect arthropod hosts. Interestingly, while some species infect a wide range of hosts (host-generalists), other species infect only a few arthropods (host-specialists). This singular evolutionary trait permits unique comparisons to determine how pathogens and virulence determinants emerge. Among the several virulence determinants that have been described, secondary metabolites (SMs) are suggested to play essential roles during fungal infection. Despite progress in the study of pathogen-host relationships, the majority of genes related to SM production in Metarhizium spp. are uncharacterized, and little is known about their genomic organization, expression and regulation. To better understand how infection conditions may affect SM production in Metarhizium anisopliae, we have performed a deep survey and description of SM biosynthetic gene clusters (BGCs) in M. anisopliae, analyzed RNA-seq data from fungi grown on cattle-tick cuticles, evaluated the differential expression of BGCs, and assessed conservation among the Metarhizium genus. Furthermore, our analysis extended to the construction of a phylogeny for the following three BGCs: a tropolone/citrinin-related compound (MaPKS1), a pseurotin-related compound (MaNRPS-PKS2), and a putative helvolic acid (MaTERP1). Results: Among 73 BGCs identified in M. anisopliae, 20 % were up-regulated during initial tick cuticle infection and presumably possess virulence-related roles. These up-regulated BGCs include known clusters, such as destruxin, NG39x and ferricrocin, together with putative helvolic acid and, pseurotin and tropolone/citrinin-related compound clusters as well as uncharacterized clusters. Furthermore, several previously characterized and putative BGCs were silent or down-regulated in initial infection conditions, indicating minor participation over the course of infection. Interestingly, several up-regulated BGCs were not conserved in host-specialist species from the Metarhizium genus, indicating differences in the metabolic strategies employed by generalist and specialist species to overcome and kill their host. These differences in metabolic potential may have been partially shaped by horizontal gene transfer (HGT) events, as our phylogenetic analysis provided evidence that the putative helvolic acid cluster in Metarhizium spp. originated from an HGT event. Conclusions: Several unknown BGCs are described, and aspects of their organization, regulation and origin are discussed, providing further support for the impact of SM on the Metarhizium genus lifestyle and infection process
    corecore