355 research outputs found

    Gene & Genome Duplication in Acanthamoeba Polyphaga Mimivirus

    Full text link
    Gene duplication is key to molecular evolution in all three domains of life and may be the first step in the emergence of new gene function. It is a well recognized feature in large DNA viruses, but has not been studied extensively in the largest known virus to date, the recently discovered Acanthamoeba Polyphaga Mimivirus. Here we present a systematic analysis of gene and genome duplication events in the Mimivirus genome. We find that one third of the Mimivirus genes are related to at least one other gene in the Mimivirus genome, either through a large segmental genome duplication event that occurred in the more remote past, either through more recent gene duplication events, which often occur in tandem. This shows that gene and genome duplication played a major role in shaping the Mimivirus genome. Using multiple alignments together with remote homology detection methods based on Hidden Markov Model comparison, we assign putative functions to some of the paralogous gene families. We suggest that a large part of the duplicated Mimivirus gene families are likely to interfere with important host cell processes, such as transcription control, protein degradation, and cell regulatory processes. Our findings support the view that large DNA viruses are complex evolving organisms, possibly deeply rooted within the tree of life, and oppose the paradigm that viral evolution is dominated by lateral gene acquisition, at least in what concerns large DNA viruses

    Mit dem Computer auf der Spur von Mimi, dem Riesenvirus

    Get PDF

    Phydbac "Gene Function Predictor" : a gene annotation tool based on genomic context analysis

    Get PDF
    BACKGROUND: The large amount of completely sequenced genomes allows genomic context analysis to predict reliable functional associations between prokaryotic proteins. Major methods rely on the fact that genes encoding physically interacting partners or members of shared metabolic pathways tend to be proximate on the genome, to evolve in a correlated manner and to be fused as a single sequence in another organism. RESULTS: The new "Gene Function Predictor", linked to the web server Phydbac proposes putative associations between Escherichia coli K-12 proteins derived from a combination of these methods. We show that associations made by this tool are more accurate than linkages found in the other established databases. Predicted assignments to GO categories, based on pre-existing functional annotations of associated proteins are also available. This new database currently holds 9,379 pairwise links at an expected success rate of at least 80%, the 6,466 functional predictions to GO terms derived from these links having a level of accuracy higher than 70%. CONCLUSION: The "Gene Function Predictor" is an automatic tool that aims to help biologists by providing them hypothetical functional predictions out of genomic context characteristics. The "Gene Function predictor" is available at

    Determination of strongly overlapping signaling activity from microarray data

    Get PDF
    BACKGROUND: As numerous diseases involve errors in signal transduction, modern therapeutics often target proteins involved in cellular signaling. Interpretation of the activity of signaling pathways during disease development or therapeutic intervention would assist in drug development, design of therapy, and target identification. Microarrays provide a global measure of cellular response, however linking these responses to signaling pathways requires an analytic approach tuned to the underlying biology. An ongoing issue in pattern recognition in microarrays has been how to determine the number of patterns (or clusters) to use for data interpretation, and this is a critical issue as measures of statistical significance in gene ontology or pathways rely on proper separation of genes into groups. RESULTS: Here we introduce a method relying on gene annotation coupled to decompositional analysis of global gene expression data that allows us to estimate specific activity on strongly coupled signaling pathways and, in some cases, activity of specific signaling proteins. We demonstrate the technique using the Rosetta yeast deletion mutant data set, decompositional analysis by Bayesian Decomposition, and annotation analysis using ClutrFree. We determined from measurements of gene persistence in patterns across multiple potential dimensionalities that 15 basis vectors provides the correct dimensionality for interpreting the data. Using gene ontology and data on gene regulation in the Saccharomyces Genome Database, we identified the transcriptional signatures of several cellular processes in yeast, including cell wall creation, ribosomal disruption, chemical blocking of protein synthesis, and, criticially, individual signatures of the strongly coupled mating and filamentation pathways. CONCLUSION: This works demonstrates that microarray data can provide downstream indicators of pathway activity either through use of gene ontology or transcription factor databases. This can be used to investigate the specificity and success of targeted therapeutics as well as to elucidate signaling activity in normal and disease processes

    metaP-Server: A Web-Based Metabolomics Data Analysis Tool

    Get PDF
    Metabolomics is an emerging field that is based on the quantitative measurement of as many small organic molecules occurring in a biological sample as possible. Due to recent technical advances, metabolomics can now be used widely as an analytical high-throughput technology in drug testing and epidemiological metabolome and genome wide association studies. Analogous to chip-based gene expression analyses, the enormous amount of data produced by modern kit-based metabolomics experiments poses new challenges regarding their biological interpretation in the context of various sample phenotypes. We developed metaP-server to facilitate data interpretation. metaP-server provides automated and standardized data analysis for quantitative metabolomics data, covering the following steps from data acquisition to biological interpretation: (i) data quality checks, (ii) estimation of reproducibility and batch effects, (iii) hypothesis tests for multiple categorical phenotypes, (iv) correlation tests for metric phenotypes, (v) optionally including all possible pairs of metabolite concentration ratios, (vi) principal component analysis (PCA), and (vii) mapping of metabolites onto colored KEGG pathway maps. Graphical output is clickable and cross-linked to sample and metabolite identifiers. Interactive coloring of PCA and bar plots by phenotype facilitates on-line data exploration. For users of commercial metabolomics kits, cross-references to the HMDB, LipidMaps, KEGG, PubChem, and CAS databases are provided. metaP-server is freely accessible at http://metabolomics.helmholtz-muenchen.de/metap2/

    Bipolar disorders in the Arab world: a critical review

    Full text link
    Peer Reviewedhttp://deepblue.lib.umich.edu/bitstream/2027.42/111799/1/nyas12652.pd

    Mimivirus and the emerging concept of "giant" virus

    Full text link
    The recently discovered Acanthamoeba polyphaga Mimivirus is the largest known DNA virus. Its particle size (>400 nm), genome length (1.2 million bp) and large gene repertoire (911 protein coding genes) blur the established boundaries between viruses and parasitic cellular organisms. In addition, the analysis of its genome sequence identified new types of genes not expected to be seen in a virus, such as aminoacyl-tRNA synthetases and other central components of the translation machinery. In this article, we examine how the finding of a giant virus for the first time overlapping with the world of cellular organisms in terms of size and genome complexity might durably influence the way we look at microbial biodiversity, and force us to fundamentally revise our classification of life forms. We propose to introduce the word "girus" to recognize the intermediate status of these giant DNA viruses, the genome complexity of which make them closer to small parasitic prokaryotes than to regular viruses.Comment: Submitted to Virus Researc

    Reductive Genome Evolution from the Mother of Rickettsia

    Get PDF
    The Rickettsia genus is a group of obligate intracellular α-proteobacteria representing a paradigm of reductive evolution. Here, we investigate the evolutionary processes that shaped the genomes of the genus. The reconstruction of ancestral genomes indicates that their last common ancestor contained more genes, but already possessed most traits associated with cellular parasitism. The differences in gene repertoires across modern Rickettsia are mainly the result of differential gene losses from the ancestor. We demonstrate using computer simulation that the propensity of loss was variable across genes during this process. We also analyzed the ratio of nonsynonymous to synonymous changes (Ka/Ks) calculated as an average over large sets of genes to assay the strength of selection acting on the genomes of Rickettsia, Anaplasmataceae, and free-living γ-proteobacteria. As a general trend, Ka/Ks were found to decrease with increasing divergence between genomes. The high Ka/Ks for closely related genomes are probably due to a lag in the removal of slightly deleterious nonsynonymous mutations by natural selection. Interestingly, we also observed a decrease of the rate of gene loss with increasing divergence, suggesting a similar lag in the removal of slightly deleterious pseudogene alleles. For larger divergence (Ks > 0.2), Ka/Ks converge toward similar values indicating that the levels of selection are roughly equivalent between intracellular α-proteobacteria and their free-living relatives. This contrasts with the view that obligate endocellular microorganisms tend to evolve faster as a consequence of reduced effectiveness of selection, and suggests a major role of enhanced background mutation rates on the fast protein divergence in the obligate intracellular α-proteobacteria

    Mendelian inheritance of trimodal CpG methylation sites suggests distal cis-acting genetic effects.

    Get PDF
    Environmentally influenced phenotypes, such as obesity and insulin resistance, can be transmitted over multiple generations. Epigenetic modifications, such as methylation of DNA cytosine-guanine (CpG) pairs, may be carriers of inherited information. At the population level, the methylation state of such "heritable" CpG sites is expected to follow a trimodal distribution, and their mode of inheritance should be Mendelian. Using the Illumina Infinium 450 K DNA methylation array, we determined DNA CpG-methylation in blood cells from a family cohort 123 individuals of Arab ethnicity, including 18 elementary father-mother-child trios, we asked whether Mendelian inheritance of CpG methylation is observed, and most importantly, whether it is independent of any genetic signals. Using 40× whole genome sequencing, we therefore excluded all CpG sites with possibly confounding genetic variants (SNP) within the binding regions of the Illumina probes. We identified a total of 955 CpG sites that displayed a trimodal distribution and confirmed trimodality in a study of 1805 unrelated Caucasians. Of 955 CpG sites, 99.9% observed a strict Mendelian pattern of inheritance and had no SNP within +/-110 nucleotides of the CpG site by design. However, in 97% of these cases a distal cis-acting SNP within a +/-1 Mbp window was found that explained the observed CpG distribution, excluding the hypothesis of epigenetic inheritance for these clear-cut trimodal sites. Using power analysis, we showed that in 46% of all cases, the closest CpG-associated SNP was located more than 1000 bp from the CpG site. Our findings suggest that CpG methylation is maintained over larger genomic distances. Furthermore, nearly half of the SNPs associated with these trimodal sites were also associated with the expression of nearby genes (P = 4.08 × 10(-6)), implying a regulatory effect of these trimodal CpG sites

    Effect of induced hypoglycemia on inflammation and oxidative stress in type 2 diabetes and control subjects

    Get PDF
    Intensive diabetes control has been associated with increased mortality in type 2 diabetes (T2DM); this has been suggested to be due to increased hypoglycemia. We measured hypoglycemia-induced changes in endothelial parameters, oxidative stress markers and inflammation at baseline and after a 24-hour period in type 2 diabetic (T2DM) subjects versus age-matched controls. Case-control study: 10 T2DM and 8 control subjects. Blood glucose was reduced from 5 (90 mg/dl) to hypoglycemic levels of 2.8 mmol/L (50 mg/dl) for 1 hour by incremental hyperinsulinemic clamps using baseline and 24 hour samples. Measures of endothelial parameters, oxidative stress and inflammation at baseline and at 24-hours post hypoglycemia were performed: proteomic (Somalogic) analysis for inflammatory markers complemented by C-reactive protein (hsCRP) measurement, and proteomic markers and urinary isoprostanes for oxidative measures, together with endothelial function. Between baseline and 24 -hours after hypoglycemia, 15 of 140 inflammatory proteins differed in T2DM whilst only 1 of 140 differed in controls; all returned to baseline at 24-hours. However, elevated hsCRP levels were seen at 24-hours in T2DM (2.4 mg/L (1.2–5.4) vs. 3.9 mg/L (1.8–6.1), Baseline vs 24-hours, P < 0.05). In patients with T2DM, between baseline and 24-hour after hypoglycemia, only one of 15 oxidative stress proteins differed and this was not seen in controls. An increase (P = 0.016) from baseline (73.4 ng/mL) to 24 hours after hypoglycemia (91.7 ng/mL) was seen for urinary isoprostanes. Hypoglycemia resulted in inflammatory and oxidative stress markers being elevated in T2DM subjects but not controls 24-hours after the event
    corecore