170 research outputs found

    Knowledge formalization in experience feedback processes : an ontology-based approach

    Get PDF
    Because of the current trend of integration and interoperability of industrial systems, their size and complexity continue to grow making it more difficult to analyze, to understand and to solve the problems that happen in their organizations. Continuous improvement methodologies are powerful tools in order to understand and to solve problems, to control the effects of changes and finally to capitalize knowledge about changes and improvements. These tools involve suitably represent knowledge relating to the concerned system. Consequently, knowledge management (KM) is an increasingly important source of competitive advantage for organizations. Particularly, the capitalization and sharing of knowledge resulting from experience feedback are elements which play an essential role in the continuous improvement of industrial activities. In this paper, the contribution deals with semantic interoperability and relates to the structuring and the formalization of an experience feedback (EF) process aiming at transforming information or understanding gained by experience into explicit knowledge. The reuse of such knowledge has proved to have significant impact on achieving themissions of companies. However, the means of describing the knowledge objects of an experience generally remain informal. Based on an experience feedback process model and conceptual graphs, this paper takes domain ontology as a framework for the clarification of explicit knowledge and know-how, the aim of which is to get lessons learned descriptions that are significant, correct and applicable

    Comprehensive analysis of the base composition around the transcription start site in Metazoa

    Get PDF
    BACKGROUND: The transcription start site of a metazoan gene remains poorly understood, mostly because there is no clear signal present in all genes. Now that several sequenced metazoan genomes have been annotated, we have been able to compare the base composition around the transcription start site for all annotated genes across multiple genomes. RESULTS: The most prominent feature in the base compositions is a significant local variation in G+C content over a large region around the transcription start site. The change is present in all animal phyla but the extent of variation is different between distinct classes of vertebrates, and the shape of the variation is completely different between vertebrates and arthropods. Furthermore, the height of the variation correlates with CpG frequencies in vertebrates but not in invertebrates and it also correlates with gene expression, especially in mammals. We also detect GC and AT skews in all clades (where %G is not equal to %C or %A is not equal to %T respectively) but these occur in a more confined region around the transcription start site and in the coding region. CONCLUSIONS: The dramatic changes in nucleotide composition in humans are a consequence of CpG nucleotide frequencies and of gene expression, the changes in Fugu could point to primordial CpG islands, and the changes in the fly are of a totally different kind and unrelated to dinucleotide frequencies

    Codon usage in vertebrates is associated with a low risk of acquiring nonsense mutations

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Codon usage in genomes is biased towards specific subsets of codons. Codon usage bias affects translational speed and accuracy, and it is associated with the tRNA levels and the GC content of the genome. Spontaneous mutations drive genomes to a low GC content. Active cellular processes are needed to maintain a high GC content, which influences the codon usage of a species. Loss-of-function mutations, such as nonsense mutations, are the molecular basis of many recessive alleles, which can greatly affect the genome of an organism and are the cause of many genetic diseases in humans.</p> <p>Methods</p> <p>We developed an event based model to calculate the risk of acquiring nonsense mutations in coding sequences. Complete coding sequences and genomes of 40 eukaryotes were analyzed for GC and CpG content, codon usage, and the associated risk of acquiring nonsense mutations. We included one species per genus for all eukaryotes with available reference sequence.</p> <p>Results</p> <p>We discovered that the codon usage bias detected in genomes of high GC content decreases the risk of acquiring nonsense mutations (Pearson's <it>r </it>= -0.95; <it>P </it>< 0.0001). In the genomes of all examined vertebrates, including humans, this risk was lower than expected (0.93 ± 0.02; mean ± SD) and lower than the risk in genomes of non-vertebrates (1.02 ± 0.13; <it>P </it>= 0.019).</p> <p>Conclusions</p> <p>While the maintenance of a high GC content is energetically costly, it is associated with a codon usage bias harboring a low risk of acquiring nonsense mutations. The reduced exposure to this risk may contribute to the fitness of vertebrates.</p

    Analysis of CpG methylation sites and CGI among human papillomavirus DNA genomes

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The Human Papillomavirus (HPV) genome is divided into early and late coding sequences, including 8 open reading frames (ORFs) and a regulatory region (LCR). Viral gene expression may be regulated through epigenetic mechanisms, including cytosine methylation at CpG dinucleotides. We have analyzed the distribution of CpG sites and CpG islands/clusters (CGI) among 92 different HPV genomes grouped in function of their preferential tropism: cutaneous or mucosal. We calculated the proportion of CpG sites (PCS) for each ORF and calculated the expected CpG values for each viral type.</p> <p>Results</p> <p>CpGs are underrepresented in viral genomes. We found a positive correlation between CpG observed and expected values, with mucosal high-risk (HR) virus types showing the smallest O/E ratios. The ranges of the PCS were similar for most genomic regions except <it>E4</it>, where the majority of CpGs are found within islands/clusters. At least one CGI belongs to each <it>E2/E4 </it>region. We found positive correlations between PCS for each viral ORF when compared with the others, except for the LCR against four ORFs and <it>E6 </it>against three other ORFs. The distribution of CpG islands/clusters among HPV groups is heterogeneous and mucosal HR-HPV types exhibit both lower number and shorter island sizes compared to cutaneous and mucosal Low-risk (LR) HPVs (all of them significantly different).</p> <p>Conclusions</p> <p>There is a difference between viral and cellular CpG underrepresentation. There are significant correlations between complete genome PCS and a lack of correlations between several genomic region pairs, especially those involving LCR and <it>E6</it>. <it>L2 </it>and <it>L1 </it>ORF behavior is opposite to that of oncogenes <it>E6 </it>and <it>E7</it>. The first pair possesses relatively low numbers of CpG sites clustered in CGIs while the oncogenes possess a relatively high number of CpG sites not associated to CGIs. In all HPVs, <it>E2/E4 </it>is the only region with at least one CGI and shows a higher content of CpG sites in every HPV type with an identified <it>E4</it>. The mucosal HR-HPVs show either the shortest CGI size, followed by the mucosal LR-HPVs and lastly by the cutaneous viral subgroup, and a trend to the lowest CGI number, followed by the cutaneous viral subgroup and lastly by the mucosal LR-HPVs.</p

    Allele-Specific, Age-Dependent and BMI-Associated DNA Methylation of Human MCHR1

    Get PDF
    Background: Melanin-concentrating hormone receptor 1 (MCHR1) plays a significant role in regulation of energy balance, food intake, physical activity and body weight in humans and rodents. Several association studies for human obesity showed contrary results concerning the SNPs rs133072 (G/A) and rs133073 (T/C), which localize to the first exon of MCHR1. The variations constitute two main haplotypes (GT, AC). Both SNPs affect CpG dinucleotides, whereby each haplotype contains a potential methylation site at one of the two SNP positions. In addition, 15 CpGs in close vicinity of these SNPs constitute a weak CpG island. Here, we studied whether DNA methylation in this sequence context may contribute to population- and age-specific effects of MCHR1 alleles in obesity. \ud Principal Findings: We analyzed DNA methylation of a 315 bp region of MCHR1 encompassing rs133072 and rs133073 and the CpG island in blood samples of 49 individuals by bisulfite sequencing. The AC haplotype shows a significantly higher methylation level than the GT haplotype. This allele-specific methylation is age-dependent. In young individuals (20â\u80\u9330 years) the difference in DNA methylation between haplotypes is significant; whereas in individuals older than 60 years it is not detectable. Interestingly, the GT allele shows a decrease in methylation status with increasing BMI, whereas the methylation of the AC allele is not associated with this phenotype. Heterozygous lymphoblastoid cell lines show the same pattern of allele-specific DNA methylation. The cell line, which exhibits the highest difference in methylation levels between both haplotypes, also shows allele-specific transcription of MCHR1, which can be abolished by treatment with the DNA\ud methylase inhibitor 5-aza-2&apos;-deoxycytidine.\ud Conclusions:We show that DNA methylation at MCHR1 is allele-specific, age-dependent, BMI-associated and affects transcription. Conceivably, this epigenetic regulation contributes to the age- and/or population specific effects reported for MCHR1 in several human obesity studies.\ud \ud doi: 10.1371/journal.pone.0017711\u

    Adaptive Evolution of the Lactose Utilization Network in Experimentally Evolved Populations of Escherichia coli

    Get PDF
    Adaptation to novel environments is often associated with changes in gene regulation. Nevertheless, few studies have been able both to identify the genetic basis of changes in regulation and to demonstrate why these changes are beneficial. To this end, we have focused on understanding both how and why the lactose utilization network has evolved in replicate populations of Escherichia coli. We found that lac operon regulation became strikingly variable, including changes in the mode of environmental response (bimodal, graded, and constitutive), sensitivity to inducer concentration, and maximum expression level. In addition, some classes of regulatory change were enriched in specific selective environments. Sequencing of evolved clones, combined with reconstruction of individual mutations in the ancestral background, identified mutations within the lac operon that recapitulate many of the evolved regulatory changes. These mutations conferred fitness benefits in environments containing lactose, indicating that the regulatory changes are adaptive. The same mutations conferred different fitness effects when present in an evolved clone, indicating that interactions between the lac operon and other evolved mutations also contribute to fitness. Similarly, changes in lac regulation not explained by lac operon mutations also point to important interactions with other evolved mutations. Together these results underline how dynamic regulatory interactions can be, in this case evolving through mutations both within and external to the canonical lactose utilization network

    Genomic analysis and relatedness of P2-like phages of the Burkholderia cepacia complex

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The <it>Burkholderia cepacia </it>complex (BCC) is comprised of at least seventeen Gram-negative species that cause infections in cystic fibrosis patients. Because BCC bacteria are broadly antibiotic resistant, phage therapy is currently being investigated as a possible alternative treatment for these infections. The purpose of our study was to sequence and characterize three novel BCC-specific phages: KS5 (vB_BceM-KS5 or vB_BmuZ-ATCC 17616), KS14 (vB_BceM-KS14) and KL3 (vB_BamM-KL3 or vB_BceZ-CEP511).</p> <p>Results</p> <p>KS5, KS14 and KL3 are myoviruses with the A1 morphotype. The genomes of these phages are between 32317 and 40555 base pairs in length and are predicted to encode between 44 and 52 proteins. These phages have over 50% of their proteins in common with enterobacteria phage P2 and so can be classified as members of the <it>Peduovirinae </it>subfamily and the "P2-like viruses" genus. The BCC phage proteins similar to those encoded by P2 are predominantly structural components involved in virion morphogenesis. As prophages, KS5 and KL3 integrate into an AMP nucleosidase gene and a threonine tRNA gene, respectively. Unlike other P2-like viruses, the KS14 prophage is maintained as a plasmid. The P2 <it>E+E' </it>translational frameshift site is conserved among these three phages and so they are predicted to use frameshifting for expression of two of their tail proteins. The <it>lysBC </it>genes of KS14 and KL3 are similar to those of P2, but in KS5 the organization of these genes suggests that they may have been acquired via horizontal transfer from a phage similar to λ. KS5 contains two sequence elements that are unique among these three phages: an IS<it>Bmu</it>2-like insertion sequence and a reverse transcriptase gene. KL3 encodes an EcoRII-C endonuclease/methylase pair and Vsr endonuclease that are predicted to function during the lytic cycle to cleave non-self DNA, protect the phage genome and repair methylation-induced mutations.</p> <p>Conclusions</p> <p>KS5, KS14 and KL3 are the first BCC-specific phages to be identified as P2-like. As KS14 has previously been shown to be active against <it>Burkholderia cenocepacia in vivo</it>, genomic characterization of these phages is a crucial first step in the development of these and similar phages for clinical use against the BCC.</p

    Orphan CpG Islands Identify Numerous Conserved Promoters in the Mammalian Genome

    Get PDF
    CpG islands (CGIs) are vertebrate genomic landmarks that encompass the promoters of most genes and often lack DNA methylation. Querying their apparent importance, the number of CGIs is reported to vary widely in different species and many do not co-localise with annotated promoters. We set out to quantify the number of CGIs in mouse and human genomes using CXXC Affinity Purification plus deep sequencing (CAP-seq). We also asked whether CGIs not associated with annotated transcripts share properties with those at known promoters. We found that, contrary to previous estimates, CGI abundance in humans and mice is very similar and many are at conserved locations relative to genes. In each species CpG density correlates positively with the degree of H3K4 trimethylation, supporting the hypothesis that these two properties are mechanistically interdependent. Approximately half of mammalian CGIs (>10,000) are “orphans” that are not associated with annotated promoters. Many orphan CGIs show evidence of transcriptional initiation and dynamic expression during development. Unlike CGIs at known promoters, orphan CGIs are frequently subject to DNA methylation during development, and this is accompanied by loss of their active promoter features. In colorectal tumors, however, orphan CGIs are not preferentially methylated, suggesting that cancer does not recapitulate a developmental program. Human and mouse genomes have similar numbers of CGIs, over half of which are remote from known promoters. Orphan CGIs nevertheless have the characteristics of functional promoters, though they are much more likely than promoter CGIs to become methylated during development and hence lose these properties. The data indicate that orphan CGIs correspond to previously undetected promoters whose transcriptional activity may play a functional role during development

    The Impact of Recombination on Nucleotide Substitutions in the Human Genome

    Get PDF
    Unraveling the evolutionary forces responsible for variations of neutral substitution patterns among taxa or along genomes is a major issue for detecting selection within sequences. Mammalian genomes show large-scale regional variations of GC-content (the isochores), but the substitution processes at the origin of this structure are poorly understood. We analyzed the pattern of neutral substitutions in 1 Gb of primate non-coding regions. We show that the GC-content toward which sequences are evolving is strongly negatively correlated to the distance to telomeres and positively correlated to the rate of crossovers (R2 = 47%). This demonstrates that recombination has a major impact on substitution patterns in human, driving the evolution of GC-content. The evolution of GC-content correlates much more strongly with male than with female crossover rate, which rules out selectionist models for the evolution of isochores. This effect of recombination is most probably a consequence of the neutral process of biased gene conversion (BGC) occurring within recombination hotspots. We show that the predictions of this model fit very well with the observed substitution patterns in the human genome. This model notably explains the positive correlation between substitution rate and recombination rate. Theoretical calculations indicate that variations in population size or density in recombination hotspots can have a very strong impact on the evolution of base composition. Furthermore, recombination hotspots can create strong substitution hotspots. This molecular drive affects both coding and non-coding regions. We therefore conclude that along with mutation, selection and drift, BGC is one of the major factors driving genome evolution. Our results also shed light on variations in the rate of crossover relative to non-crossover events, along chromosomes and according to sex, and also on the conservation of hotspot density between human and chimp
    corecore