14,147 research outputs found

    An analysis of the Sargasso Sea resource and the consequences for database composition

    Get PDF
    Background: The environmental sequencing of the Sargasso Sea has introduced a huge new resource of genomic information. Unlike the protein sequences held in the current searchable databases, the Sargasso Sea sequences originate from a single marine environment and have been sequenced from species that are not easily obtainable by laboratory cultivation. The resource also contains very many fragments of whole protein sequences, a side effect of the shotgun sequencing method.These sequences form a significant addendum to the current searchable databases but also present us with some intrinsic difficulties. While it is important to know whether it is possible to assign function to these sequences with the current methods and whether they will increase our capacity to explore sequence space, it is also interesting to know how current bioinformatics techniques will deal with the new sequences in the resource.Results: The Sargasso Sea sequences seem to introduce a bias that decreases the potential of current methods to propose structure and function for new proteins. In particular the high proportion of sequence fragments in the resource seems to result in poor quality multiple alignments.Conclusion: These observations suggest that the new sequences should be used with care, especially if the information is to be used in large scale analyses. On a positive note, the results may just spark improvements in computational and experimental methods to take into account the fragments generated by environmental sequencing techniques

    Blueprint for a high-performance biomaterial: full-length spider dragline silk genes.

    Get PDF
    Spider dragline (major ampullate) silk outperforms virtually all other natural and manmade materials in terms of tensile strength and toughness. For this reason, the mass-production of artificial spider silks through transgenic technologies has been a major goal of biomimetics research. Although all known arthropod silk proteins are extremely large (>200 kiloDaltons), recombinant spider silks have been designed from short and incomplete cDNAs, the only available sequences. Here we describe the first full-length spider silk gene sequences and their flanking regions. These genes encode the MaSp1 and MaSp2 proteins that compose the black widow's high-performance dragline silk. Each gene includes a single enormous exon (>9000 base pairs) that translates into a highly repetitive polypeptide. Patterns of variation among sequence repeats at the amino acid and nucleotide levels indicate that the interaction of selection, intergenic recombination, and intragenic recombination governs the evolution of these highly unusual, modular proteins. Phylogenetic footprinting revealed putative regulatory elements in non-coding flanking sequences. Conservation of both upstream and downstream flanking sequences was especially striking between the two paralogous black widow major ampullate silk genes. Because these genes are co-expressed within the same silk gland, there may have been selection for similarity in regulatory regions. Our new data provide complete templates for synthesis of recombinant silk proteins that significantly improve the degree to which artificial silks mimic natural spider dragline fibers

    Dramatic expansion of the black widow toxin arsenal uncovered by multi-tissue transcriptomics and venom proteomics.

    Get PDF
    BackgroundAnimal venoms attract enormous interest given their potential for pharmacological discovery and understanding the evolution of natural chemistries. Next-generation transcriptomics and proteomics provide unparalleled, but underexploited, capabilities for venom characterization. We combined multi-tissue RNA-Seq with mass spectrometry and bioinformatic analyses to determine venom gland specific transcripts and venom proteins from the Western black widow spider (Latrodectus hesperus) and investigated their evolution.ResultsWe estimated expression of 97,217 L. hesperus transcripts in venom glands relative to silk and cephalothorax tissues. We identified 695 venom gland specific transcripts (VSTs), many of which BLAST and GO term analyses indicate may function as toxins or their delivery agents. ~38% of VSTs had BLAST hits, including latrotoxins, inhibitor cystine knot toxins, CRISPs, hyaluronidases, chitinase, and proteases, and 59% of VSTs had predicted protein domains. Latrotoxins are venom toxins that cause massive neurotransmitter release from vertebrate or invertebrate neurons. We discovered ≥ 20 divergent latrotoxin paralogs expressed in L. hesperus venom glands, significantly increasing this biomedically important family. Mass spectrometry of L. hesperus venom identified 49 proteins from VSTs, 24 of which BLAST to toxins. Phylogenetic analyses showed venom gland specific gene family expansions and shifts in tissue expression.ConclusionsQuantitative expression analyses comparing multiple tissues are necessary to identify venom gland specific transcripts. We present a black widow venom specific exome that uncovers a trove of diverse toxins and associated proteins, suggesting a dynamic evolutionary history. This justifies a reevaluation of the functional activities of black widow venom in light of its emerging complexity

    The fate of Arabidopsis thaliana homeologous CNSs and their motifs in the Paleohexaploid Brassica rapa.

    Get PDF
    Following polyploidy, duplicate genes are often deleted, and if they are not, then duplicate regulatory regions are sometimes lost. By what mechanism is this loss and what is the chance that such a loss removes function? To explore these questions, we followed individual Arabidopsis thaliana-A. thaliana conserved noncoding sequences (CNSs) into the Brassica ancestor, through a paleohexaploidy and into Brassica rapa. Thus, a single Brassicaceae CNS has six potential orthologous positions in B. rapa; a single Arabidopsis CNS has three potential homeologous positions. We reasoned that a CNS, if present on a singlet Brassica gene, would be unlikely to lose function compared with a more redundant CNS, and this is the case. Redundant CNSs go nondetectable often. Using this logic, each mechanism of CNS loss was assigned a metric of functionality. By definition, proved deletions do not function as sequence. Our results indicated that CNSs that go nondetectable by base substitution or large insertion are almost certainly still functional (redundancy does not matter much to their detectability frequency), whereas those lost by inferred deletion or indels are approximately 75% likely to be nonfunctional. Overall, an average nondetectable, once-redundant CNS more than 30 bp in length has a 72% chance of being nonfunctional, and that makes sense because 97% of them sort to a molecular mechanism with deletion in its description, but base substitutions do cause loss. Similarly, proved-functional G-boxes go undetectable by deletion 82% of the time. Fractionation mutagenesis is a procedure that uses polyploidy as a mutagenic agent to genetically alter RNA expression profiles, and then to construct testable hypotheses as to the function of the lost regulatory site. We show fractionation mutagenesis to be a deletion machine in the Brassica lineage

    Signal Transduction and Pathogenic Modifications at the Melanocortin-4 Receptor: A Structural Perspective

    Get PDF
    The melanocortin-4 receptor (MC4R) can be endogenously activated by binding of melanocyte-stimulating hormones (MSH), which mediates anorexigenic effects. In contrast, the agouti-related peptide (AgRP) acts as an endogenous inverse agonist and suppresses ligand-independent basal signaling activity (orexigenic effects). Binding of ligands to MC4R leads to the activation of different G-protein subtypes or arrestin and concomitant signaling pathways. This receptor is a key protein in the hypothalamic regulation of food intake and energy expenditure and naturally-occurring inactivating MC4R variants are the most frequent cause of monogenic obesity. In general, obesity is a growing problem on a global scale and is of social, medical, and economic relevance. A significant goal is to develop optimized pharmacological tools targeting MC4R without adverse effects. To date, this has not been achieved because of inter alia non-selective ligands across the five functionally different MCR subtypes (MC1-5R). This motivates further investigation of (i) the three-dimensional MC4R structure, (ii) binding mechanisms of various ligands, and (iii) the molecular transfer process of signal transduction, with the aim of understanding how structural features are linked with functional-physiological aspects. Unfortunately, experimentally elucidated structural information is not yet available for theMC receptors, a group of class A G-protein coupled receptors (GPCRs). We, therefore, generated MC4R homology models and complexes with interacting partners to describe approximate structural properties associated with signaling mechanisms. In addition, molecular insights from pathogenic mutations were incorporated to discriminate more precisely their individual malfunction of the signal transfer mechanism

    Structural basis for activation of trimeric Gi proteins by multiple growth factor receptors via GIV/Girdin.

    Get PDF
    A long-standing issue in the field of signal transduction is to understand the cross-talk between receptor tyrosine kinases (RTKs) and heterotrimeric G proteins, two major and distinct signaling hubs that control eukaryotic cell behavior. Although stimulation of many RTKs leads to activation of trimeric G proteins, the molecular mechanisms behind this phenomenon remain elusive. We discovered a unifying mechanism that allows GIV/Girdin, a bona fide metastasis-related protein and a guanine-nucleotide exchange factor (GEF) for Gαi, to serve as a direct platform for multiple RTKs to activate Gαi proteins. Using a combination of homology modeling, protein-protein interaction, and kinase assays, we demonstrate that a stretch of ∼110 amino acids within GIV C-terminus displays structural plasticity that allows folding into a SH2-like domain in the presence of phosphotyrosine ligands. Using protein-protein interaction assays, we demonstrated that both SH2 and GEF domains of GIV are required for the formation of a ligand-activated ternary complex between GIV, Gαi, and growth factor receptors and for activation of Gαi after growth factor stimulation. Expression of a SH2-deficient GIV mutant (Arg 1745→Leu) that cannot bind RTKs impaired all previously demonstrated functions of GIV-Akt enhancement, actin remodeling, and cell migration. The mechanistic and structural insights gained here shed light on the long-standing questions surrounding RTK/G protein cross-talk, set a novel paradigm, and characterize a unique pharmacological target for uncoupling GIV-dependent signaling downstream of multiple oncogenic RTKs
    • …
    corecore