296 research outputs found

    An ATP-binding cassette-type cysteine transporter in Campylobacter jejuni inferred from the structure of an extracytoplasmic solute receptor protein

    Get PDF
    Campylobacter jejuni is a Gram-negative food-borne pathogen associated with gastroenteritis in humans as well as cases of the autoimmune disease Guillain Barre syndrome. C. jejuni is asaccharolytic because it lacks an active glycolytic pathway for the use of sugars as a carbon source. This suggests an increased reliance on amino acids as nutrients and indeed the genome sequence of this organism indicates the presence of a number of amino acid uptake systems. Cj0982, also known as CjaA, is a putative extracytoplasmic solute receptor for one such uptake system as well as a major surface antigen and vaccine candidate. The crystal structure of Cj0982 reveals a two-domain protein with density in the enclosed cavity between the domains that clearly defines the presence of a bound cysteine ligand. Fluorescence titration experiments were used to demonstrate that Cj0982 binds cysteine tightly and specifically with a K-d of similar to 10(-7) M consistent with a role as a receptor for a high- affinity transporter. These data imply that Cj0982 is the binding protein component of an ABC-type cysteine transporter system and that cysteine uptake is important in the physiology of C. jejuni

    Quasi-radial growth of metal tube on si nanowires template

    Get PDF
    It is reported in this article that Si nanowires can be employed as a positive template for the controllable electrochemical deposition of noble metal tube. The deposited tube exhibits good crystallinity. Scanning electron microscope and transmission electron microscope characterizations are conducted to reveal the growth process of metal tube, showing that the metal tube grows quasi-radially on the wall of Si nanowire. The quasi-radial growth of metal enables the fabrication of thickness-defined metal tube via changing deposition time. Inner-diameter-defined metal tube is achieved by choosing Si nanowires with desired diameter as a template. Metal tubes with inner diameters ranging from 1 μm to sub-50 nm are fabricated

    A new family of periplasmic-binding proteins that sense arsenic oxyanions

    Get PDF
    Arsenic contamination of drinking water affects more than 140 million people worldwide. While toxic to humans, inorganic forms of arsenic (arsenite and arsenate), can be used as energy sources for microbial respiration. AioX and its orthologues (ArxX and ArrX) represent the first members of a new sub-family of periplasmic-binding proteins that serve as the first component of a signal transduction system, that's role is to positively regulate expression of arsenic metabolism enzymes. As determined by X-ray crystallography for AioX, arsenite binding only requires subtle conformational changes in protein structure, providing insights into protein-ligand interactions. The binding pocket of all orthologues is conserved but this alone is not sufficient for oxyanion selectivity, with proteins selectively binding either arsenite or arsenate. Phylogenetic evidence, clearly demonstrates that the regulatory proteins evolved together early in prokaryotic evolution and had a separate origin from the metabolic enzymes whose expression they regulate

    Gene fusions and gene duplications: relevance to genomic annotation and functional analysis

    Get PDF
    BACKGROUND: Escherichia coli a model organism provides information for annotation of other genomes. Our analysis of its genome has shown that proteins encoded by fused genes need special attention. Such composite (multimodular) proteins consist of two or more components (modules) encoding distinct functions. Multimodular proteins have been found to complicate both annotation and generation of sequence similar groups. Previous work overstated the number of multimodular proteins in E. coli. This work corrects the identification of modules by including sequence information from proteins in 50 sequenced microbial genomes. RESULTS: Multimodular E. coli K-12 proteins were identified from sequence similarities between their component modules and non-fused proteins in 50 genomes and from the literature. We found 109 multimodular proteins in E. coli containing either two or three modules. Most modules had standalone sequence relatives in other genomes. The separated modules together with all the single (un-fused) proteins constitute the sum of all unimodular proteins of E. coli. Pairwise sequence relationships among all E. coli unimodular proteins generated 490 sequence similar, paralogous groups. Groups ranged in size from 92 to 2 members and had varying degrees of relatedness among their members. Some E. coli enzyme groups were compared to homologs in other bacterial genomes. CONCLUSION: The deleterious effects of multimodular proteins on annotation and on the formation of groups of paralogs are emphasized. To improve annotation results, all multimodular proteins in an organism should be detected and when known each function should be connected with its location in the sequence of the protein. When transferring functions by sequence similarity, alignment locations must be noted, particularly when alignments cover only part of the sequences, in order to enable transfer of the correct function. Separating multimodular proteins into module units makes it possible to generate protein groups related by both sequence and function, avoiding mixing of unrelated sequences. Organisms differ in sizes of groups of sequence-related proteins. A sample comparison of orthologs to selected E. coli paralogous groups correlates with known physiological and taxonomic relationships between the organisms

    N-gram analysis of 970 microbial organisms reveals presence of biological language models

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>It has been suggested previously that genome and proteome sequences show characteristics typical of natural-language texts such as "signature-style" word usage indicative of authors or topics, and that the algorithms originally developed for natural language processing may therefore be applied to genome sequences to draw biologically relevant conclusions. Following this approach of 'biological language modeling', statistical n-gram analysis has been applied for comparative analysis of whole proteome sequences of 44 organisms. It has been shown that a few particular amino acid n-grams are found in abundance in one organism but occurring very rarely in other organisms, thereby serving as genome signatures. At that time proteomes of only 44 organisms were available, thereby limiting the generalization of this hypothesis. Today nearly 1,000 genome sequences and corresponding translated sequences are available, making it feasible to test the existence of biological language models over the evolutionary tree.</p> <p>Results</p> <p>We studied whole proteome sequences of 970 microbial organisms using n-gram frequencies and cross-perplexity employing the Biological Language Modeling Toolkit and Patternix Revelio toolkit. Genus-specific signatures were observed even in a simple unigram distribution. By taking statistical n-gram model of one organism as reference and computing cross-perplexity of all other microbial proteomes with it, cross-perplexity was found to be predictive of branch distance of the phylogenetic tree. For example, a 4-gram model from proteome of <it>Shigellae flexneri 2a</it>, which belongs to the <it>Gammaproteobacteria </it>class showed a self-perplexity of 15.34 while the cross-perplexity of other organisms was in the range of 15.59 to 29.5 and was proportional to their branching distance in the evolutionary tree from <it>S. flexneri</it>. The organisms of this genus, which happen to be pathotypes of <it>E.coli</it>, also have the closest perplexity values with <it>E. coli.</it></p> <p>Conclusion</p> <p>Whole proteome sequences of microbial organisms have been shown to contain particular n-gram sequences in abundance in one organism but occurring very rarely in other organisms, thereby serving as proteome signatures. Further it has also been shown that perplexity, a statistical measure of similarity of n-gram composition, can be used to predict evolutionary distance within a genus in the phylogenetic tree.</p

    Crystal structure of human XLF/Cernunnos reveals unexpected differences from XRCC4 with implications for NHEJ

    Get PDF
    The recently characterised 299-residue human XLF/Cernunnos protein plays a crucial role in DNA repair by non-homologous end joining (NHEJ) and interacts with the XRCC4–DNA Ligase IV complex. Here, we report the crystal structure of the XLF (1–233) homodimer at 2.3 Å resolution, confirming the predicted structural similarity to XRCC4. The XLF coiled-coil, however, is shorter than that of XRCC4 and undergoes an unexpected reverse in direction giving rise to a short distorted four helical bundle and a C-terminal helical structure wedged between the coiled-coil and head domain. The existence of a dimer as the major species is confirmed by size-exclusion chromatography, analytical ultracentrifugation, small-angle X-ray scattering and other biophysical methods. We show that the XLF structure is not easily compatible with a proposed XRCC4:XLF heterodimer. However, we demonstrate interactions between dimers of XLF and XRCC4 by surface plasmon resonance and analyse these in terms of surface properties, amino-acid conservation and mutations in immunodeficient patients. Our data are most consistent with head-to-head interactions in a 2:2:1 XRCC4:XLF:Ligase IV complex

    Reconstructing an Ancestral Mammalian Immune Supercomplex from a Marsupial Major Histocompatibility Complex

    Get PDF
    The first sequenced marsupial genome promises to reveal unparalleled insights into mammalian evolution. We have used theMonodelphis domestica (gray short-tailed opossum) sequence to construct the first map of a marsupial major histocompatibility complex (MHC). The MHC is the most gene-dense region of the mammalian genome and is critical to immunity and reproductive success. The marsupial MHC bridges the phylogenetic gap between the complex MHC of eutherian mammals and the minimal essential MHC of birds. Here we show that the opossum MHC is gene dense and complex, as in humans, but shares more organizational features with non-mammals. The Class I genes have amplified within the Class II region, resulting in a unique Class I/II region. We present a model of the organization of the MHC in ancestral mammals and its elaboration during mammalian evolution. The opossum genome, together with other extant genomes, reveals the existence of an ancestral “immune supercomplex” that contained genes of both types of natural killer receptors together with antigen processing genes and MHC genes

    Proteome sequence features carry signatures of the environmental niche of prokaryotes

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Prokaryotic environmental adaptations occur at different levels within cells to ensure the preservation of genome integrity, proper protein folding and function as well as membrane fluidity. Although specific composition and structure of cellular components suitable for the variety of extreme conditions has already been postulated, a systematic study describing such adaptations has not yet been performed. We therefore explored whether the environmental niche of a prokaryote could be deduced from the sequence of its proteome. Finally, we aimed at finding the precise differences between proteome sequences of prokaryotes from different environments.</p> <p>Results</p> <p>We analyzed the proteomes of 192 prokaryotes from different habitats. We collected detailed information about the optimal growth conditions of each microorganism. Furthermore, we selected 42 physico-chemical properties of amino acids and computed their values for each proteome. Further, on the same set of features we applied two fundamentally different machine learning methods, Support Vector Machines and Random Forests, to successfully classify between bacteria and archaea, halophiles and non-halophiles, as well as mesophiles, thermophiles and mesothermophiles. Finally, we performed feature selection by using Random Forests.</p> <p>Conclusions</p> <p>To our knowledge, this is the first time that three different classification cases (domain of life, halophilicity and thermophilicity) of proteome adaptation are successfully performed with the same set of 42 features. The characteristic features of a specific adaptation constitute a signature that may help understanding the mechanisms of adaptation to extreme environments.</p

    Correlated Evolution of Nearby Residues in Drosophilid Proteins

    Get PDF
    Here we investigate the correlations between coding sequence substitutions as a function of their separation along the protein sequence. We consider both substitutions between the reference genomes of several Drosophilids as well as polymorphisms in a population sample of Zimbabwean Drosophila melanogaster. We find that amino acid substitutions are “clustered” along the protein sequence, that is, the frequency of additional substitutions is strongly enhanced within ≈10 residues of a first such substitution. No such clustering is observed for synonymous substitutions, supporting a “correlation length” associated with selection on proteins as the causative mechanism. Clustering is stronger between substitutions that arose in the same lineage than it is between substitutions that arose in different lineages. We consider several possible origins of clustering, concluding that epistasis (interactions between amino acids within a protein that affect function) and positional heterogeneity in the strength of purifying selection are primarily responsible. The role of epistasis is directly supported by the tendency of nearby substitutions that arose on the same lineage to preserve the total charge of the residues within the correlation length and by the preferential cosegregation of neighboring derived alleles in our population sample. We interpret the observed length scale of clustering as a statistical reflection of the functional locality (or modularity) of proteins: amino acids that are near each other on the protein backbone are more likely to contribute to, and collaborate toward, a common subfunction

    Human-Specific Evolution and Adaptation Led to Major Qualitative Differences in the Variable Receptors of Human and Chimpanzee Natural Killer Cells

    Get PDF
    Natural killer (NK) cells serve essential functions in immunity and reproduction. Diversifying these functions within individuals and populations are rapidly-evolving interactions between highly polymorphic major histocompatibility complex (MHC) class I ligands and variable NK cell receptors. Specific to simian primates is the family of Killer cell Immunoglobulin-like Receptors (KIR), which recognize MHC class I and associate with a range of human diseases. Because KIR have considerable species-specificity and are lacking from common animal models, we performed extensive comparison of the systems of KIR and MHC class I interaction in humans and chimpanzees. Although of similar complexity, they differ in genomic organization, gene content, and diversification mechanisms, mainly because of human-specific specialization in the KIR that recognizes the C1 and C2 epitopes of MHC-B and -C. Humans uniquely focused KIR recognition on MHC-C, while losing C1-bearing MHC-B. Reversing this trend, C1-bearing HLA-B46 was recently driven to unprecedented high frequency in Southeast Asia. Chimpanzees have a variety of ancient, avid, and predominantly inhibitory receptors, whereas human receptors are fewer, recently evolved, and combine avid inhibitory receptors with attenuated activating receptors. These differences accompany human-specific evolution of the A and B haplotypes that are under balancing selection and differentially function in defense and reproduction. Our study shows how the qualitative differences that distinguish the human and chimpanzee systems of KIR and MHC class I predominantly derive from adaptations on the human line in response to selective pressures placed on human NK cells by the competing needs of defense and reproduction
    corecore