50 research outputs found

    GenoQuery: a new querying module for functional annotation in a genomic warehouse

    Get PDF
    Motivation: We have to cope with both a deluge of new genome sequences and a huge amount of data produced by high-throughput approaches used to exploit these genomic features. Crossing and comparing such heterogeneous and disparate data will help improving functional annotation of genomes. This requires designing elaborate integration systems such as warehouses for storing and querying these data

    Analysis of Combinatorial Regulation: Scaling of Partnerships between Regulators with the Number of Governed Targets

    Get PDF
    Through combinatorial regulation, regulators partner with each other to control common targets and this allows a small number of regulators to govern many targets. One interesting question is that given this combinatorial regulation, how does the number of regulators scale with the number of targets? Here, we address this question by building and analyzing co-regulation (co-transcription and co-phosphorylation) networks that describe partnerships between regulators controlling common genes. We carry out analyses across five diverse species: Escherichia coli to human. These reveal many properties of partnership networks, such as the absence of a classical power-law degree distribution despite the existence of nodes with many partners. We also find that the number of co-regulatory partnerships follows an exponential saturation curve in relation to the number of targets. (For E. coli and Bacillus subtilis, only the beginning linear part of this curve is evident due to arrangement of genes into operons.) To gain intuition into the saturation process, we relate the biological regulation to more commonplace social contexts where a small number of individuals can form an intricate web of connections on the internet. Indeed, we find that the size of partnership networks saturates even as the complexity of their output increases. We also present a variety of models to account for the saturation phenomenon. In particular, we develop a simple analytical model to show how new partnerships are acquired with an increasing number of target genes; with certain assumptions, it reproduces the observed saturation. Then, we build a more general simulation of network growth and find agreement with a wide range of real networks. Finally, we perform various down-sampling calculations on the observed data to illustrate the robustness of our conclusions

    Miiuy Croaker Hepcidin Gene and Comparative Analyses Reveal Evidence for Positive Selection

    Get PDF
    Hepcidin antimicrobial peptide (HAMP) is a small cysteine-rich peptide and a key molecule of the innate immune system against bacterial infections. Molecular cloning and genomic characterization of HAMP gene in the miiuy croaker (Miichthys miiuy) were reported in this study. The miiuy croaker HAMP was predicted to encode a prepropeptide of 99 amino acids, a tentative RX(K/R)R cleavage motif and eight characteristic cysteine residues were also identified. The gene organization is also similar to corresponding genes in mammals and fish consisting of three exons and two introns. Sequence polymorphism analysis showed that only two different sequences were identified and encoded two proteins in six individuals. As reported for most other species, the expression level was highest in liver and an up-regulation of transcription was seen in spleen, intestine and kidney examined at 24 h after injection of pathogenic bacteria, Vibrio anguillarum, the expression pattern implied that miiuy croaker HAMP is an important component of the first line defense against invading pathogens. In addition, we report on the underlying mechanism that maintains sequences diversity among fish and mammalian species, respectively. A series of site-model tests implemented in the CODEML program revealed that moderate positive Darwinian selection is likely to cause the molecular evolution in the fish HAMP2 genes and it also showed that the fish HAMP1 genes and HAMP2 genes under different selection pressures

    Phylogenetics and evolution of Su(var)3-9 SET genes in land plants: rapid diversification in structure and function

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Plants contain numerous <it>Su(var)3-9 </it>homologues (<it>SUVH</it>) and related (<it>SUVR</it>) genes, some of which await functional characterization. Although there have been studies on the evolution of plant <it>Su(var)3-9 SET </it>genes, a systematic evolutionary study including major land plant groups has not been reported. Large-scale phylogenetic and evolutionary analyses can help to elucidate the underlying molecular mechanisms and contribute to improve genome annotation.</p> <p>Results</p> <p>Putative orthologs of plant Su(var)3-9 SET protein sequences were retrieved from major representatives of land plants. A novel clustering that included most members analyzed, henceforth referred to as core <it>Su(var)3-9 </it>homologues and related (<it>cSUVHR</it>) gene clade, was identified as well as all orthologous groups previously identified. Our analysis showed that plant Su(var)3-9 SET proteins possessed a variety of domain organizations, and can be classified into five types and ten subtypes. Plant <it>Su(var)3-9 SET </it>genes also exhibit a wide range of gene structures among different paralogs within a family, even in the regions encoding conserved PreSET and SET domains. We also found that the majority of SUVH members were intronless and formed three subclades within the SUVH clade.</p> <p>Conclusions</p> <p>A detailed phylogenetic analysis of the plant <it>Su(var)3-9 SET g</it>enes was performed. A novel deep phylogenetic relationship including most plant <it>Su(var)3-9 SET </it>genes was identified. Additional domains such as SAR, ZnF_C2H2 and WIYLD were early integrated into primordial PreSET/SET/PostSET domain organization. At least three classes of gene structures had been formed before the divergence of <it>Physcomitrella patens </it>(moss) from other land plants. One or multiple retroposition events might have occurred among <it>SUVH </it>genes with the donor genes leading to the V-2 orthologous group. The structural differences among evolutionary groups of plant <it>Su(var)3-9 SET </it>genes with different functions were described, contributing to the design of further experimental studies.</p

    A Global Characterization and Identification of Multifunctional Enzymes

    Get PDF
    Multi-functional enzymes are enzymes that perform multiple physiological functions. Characterization and identification of multi-functional enzymes are critical for communication and cooperation between different functions and pathways within a complex cellular system or between cells. In present study, we collected literature-reported 6,799 multi-functional enzymes and systematically characterized them in structural, functional, and evolutionary aspects. It was found that four physiochemical properties, that is, charge, polarizability, hydrophobicity, and solvent accessibility, are important for characterization of multi-functional enzymes. Accordingly, a combinational model of support vector machine and random forest model was constructed, based on which 6,956 potential novel multi-functional enzymes were successfully identified from the ENZYME database. Moreover, it was observed that multi-functional enzymes are non-evenly distributed in species, and that Bacteria have relatively more multi-functional enzymes than Archaebacteria and Eukaryota. Comparative analysis indicated that the multi-functional enzymes experienced a fluctuation of gene gain and loss during the evolution from S. cerevisiae to H. sapiens. Further pathway analyses indicated that a majority of multi-functional enzymes were well preserved in catalyzing several essential cellular processes, for example, metabolisms of carbohydrates, nucleotides, and amino acids. What’s more, a database of known multi-functional enzymes and a server for novel multi-functional enzyme prediction were also constructed for free access at http://bioinf.xmu.edu.cn/databases/MFEs/index.htm

    Domain architecture evolution of pattern-recognition receptors

    Get PDF
    In animals, the innate immune system is the first line of defense against invading microorganisms, and the pattern-recognition receptors (PRRs) are the key components of this system, detecting microbial invasion and initiating innate immune defenses. Two families of PRRs, the intracellular NOD-like receptors (NLRs) and the transmembrane Toll-like receptors (TLRs), are of particular interest because of their roles in a number of diseases. Understanding the evolutionary history of these families and their pattern of evolutionary changes may lead to new insights into the functioning of this critical system. We found that the evolution of both NLR and TLR families included massive species-specific expansions and domain shuffling in various lineages, which resulted in the same domain architectures evolving independently within different lineages in a process that fits the definition of parallel evolution. This observation illustrates both the dynamics of the innate immune system and the effects of “combinatorially constrained” evolution, where existence of the limited numbers of functionally relevant domains constrains the choices of domain architectures for new members in the family, resulting in the emergence of independently evolved proteins with identical domain architectures, often mistaken for orthologs

    Mutagenesis and Functional Studies with Succinate Dehydrogenase Inhibitors in the Wheat Pathogen Mycosphaerella graminicola

    Get PDF
    A range of novel carboxamide fungicides, inhibitors of the succinate dehydrogenase enzyme (SDH, EC 1.3.5.1) is currently being introduced to the crop protection market. The aim of this study was to explore the impact of structurally distinct carboxamides on target site resistance development and to assess possible impact on fitness

    Symbiodinium Transcriptomes: Genome Insights into the Dinoflagellate Symbionts of Reef-Building Corals

    Get PDF
    Dinoflagellates are unicellular algae that are ubiquitously abundant in aquatic environments. Species of the genus Symbiodinium form symbiotic relationships with reef-building corals and other marine invertebrates. Despite their ecologic importance, little is known about the genetics of dinoflagellates in general and Symbiodinium in particular. Here, we used 454 sequencing to generate transcriptome data from two Symbiodinium species from different clades (clade A and clade B). With more than 56,000 assembled sequences per species, these data represent the largest transcriptomic resource for dinoflagellates to date. Our results corroborate previous observations that dinoflagellates possess the complete nucleosome machinery. We found a complete set of core histones as well as several H3 variants and H2A.Z in one species. Furthermore, transcriptome analysis points toward a low number of transcription factors in Symbiodinium spp. that also differ in the distribution of DNA-binding domains relative to other eukaryotes. In particular the cold shock domain was predominant among transcription factors. Additionally, we found a high number of antioxidative genes in comparison to non-symbiotic but evolutionary related organisms. These findings might be of relevance in the context of the role that Symbiodinium spp. play as coral symbionts
    corecore