49 research outputs found

    Effect of Environmental and Spatial Factors on the Phylogenetic and Functional Diversity of the Mediterranean Tree Communities of Europe

    Get PDF
    The tree flora of the Mediterranean Basin contains an outstanding taxonomic richness and a high proportion of endemic taxa. Contrary to other regions of the Mediterranean biome, a comprehensive phylogenetic analysis of the relationship between phylogenetic diversity, trait diversity and environmental factors in a spatial ecological context is lacking. We inferred the first calibrated phylogeny of 203 native tree species occurring in the European Mediterranean Basin based on 12 DNA regions. Using a set of four functional traits, we computed phylogenetic diversity for all 10,042 grid cells of 10 × 10 km spatial resolution to completely cover Mediterranean Europe. Then, we tested the spatial influence of environmental factors on tree diversity. Our results suggest that the nature of the relationship between traits and phylogeny varies among the different studied traits and according to the evolutionary distance considered. Phylogenetic diversity and functional diversity of European Mediterranean trees correlated strongly with species richness. High values of these diversity indices were located in the north of the study area, at high altitude, and minimum temperature of the coldest month. In contrast, the two phylogenetic indices that were not correlated with species richness (Mean Phylogenetic Distance, Phylogenetic Species Variability) were located in the south of the study area and were positively correlated with high altitude, soil organic carbon stock and sand soil texture. Our study provides support for the use of phylogenies in conservation biology to assess ecosystem functioning, and provides insights for the implementation of sustainable forest ecosystem management

    Gene expression signatures in motor neurone disease fibroblasts reveal dysregulation of metabolism, hypoxia-response and RNA processing functions

    Get PDF
    Aims Amyotrophic lateral sclerosis (ALS) and primary lateral sclerosis (PLS) are two syndromic variants within the motor neurone disease spectrum. As PLS and most ALS cases are sporadic (SALS), this limits the availability of cellular models for investigating pathogenic mechanisms and therapeutic targets. The aim of this study was to use gene expression profiling to evaluate fibroblasts as cellular models for SALS and PLS, to establish whether dysregulated biological processes recapitulate those seen in the central nervous system and to elucidate pathways that distinguish the clinically defined variants of SALS and PLS. Methods Microarray analysis was performed on fibroblast RNA and differentially expressed genes identified. Genes in enriched biological pathways were validated by quantitative PCR and functional assays performed to establish the effect of altered RNA levels on the cellular processes. Results Gene expression profiling demonstrated that whilst there were many differentially expressed genes in common between SALS and PLS fibroblasts, there were many more expressed specifically in the SALS fibroblasts, including those involved in RNA processing and the stress response. Functional analysis of the fibroblasts confirmed a significant decrease in miRNA production and a reduced response to hypoxia in SALS fibroblasts. Furthermore, metabolic gene changes seen in SALS, many of which were also evident in PLS fibroblasts, resulted in dysfunctional cellular respiration. Conclusions The data demonstrate that fibroblasts can act as cellular models for ALS and PLS, by establishing the transcriptional changes in known pathogenic pathways that confer subsequent functional effects and potentially highlight targets for therapeutic intervention

    Particle Swarm Optimization with Reinforcement Learning for the Prediction of CpG Islands in the Human Genome

    Get PDF
    BACKGROUND: Regions with abundant GC nucleotides, a high CpG number, and a length greater than 200 bp in a genome are often referred to as CpG islands. These islands are usually located in the 5' end of genes. Recently, several algorithms for the prediction of CpG islands have been proposed. METHODOLOGY/PRINCIPAL FINDINGS: We propose here a new method called CPSORL to predict CpG islands, which consists of a complement particle swarm optimization algorithm combined with reinforcement learning to predict CpG islands more reliably. Several CpG island prediction tools equipped with the sliding window technique have been developed previously. However, the quality of the results seems to rely too much on the choices that are made for the window sizes, and thus these methods leave room for improvement. CONCLUSIONS/SIGNIFICANCE: Experimental results indicate that CPSORL provides results of a higher sensitivity and a higher correlation coefficient in all selected experimental contigs than the other methods it was compared to (CpGIS, CpGcluster, CpGProd and CpGPlot). A higher number of CpG islands were identified in chromosomes 21 and 22 of the human genome than with the other methods from the literature. CPSORL also achieved the highest coverage rate (3.4%). CPSORL is an application for identifying promoter and TSS regions associated with CpG islands in entire human genomic. When compared to CpGcluster, the islands predicted by CPSORL covered a larger region in the TSS (12.2%) and promoter (26.1%) region. If Alu sequences are considered, the islands predicted by CPSORL (Alu) covered a larger TSS (40.5%) and promoter (67.8%) region than CpGIS. Furthermore, CPSORL was used to verify that the average methylation density was 5.33% for CpG islands in the entire human genome

    Responsiveness of genes to manipulation of transcription factors in ES cells is associated with histone modifications and tissue specificity

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>In addition to determining static states of gene expression (high vs. low), it is important to characterize their dynamic status. For example, genes with H3K27me3 chromatin marks are not only suppressed but also poised for activation. However, the responsiveness of genes to perturbations has never been studied systematically. To distinguish gene responses to specific factors from responsiveness in general, it is necessary to analyze gene expression profiles of cells responding to a large variety of disturbances, and such databases did not exist before.</p> <p>Results</p> <p>We estimated the responsiveness of all genes in mouse ES cells using our recently published database on expression change after controlled induction of 53 transcription factors (TFs) and other genes. Responsive genes (<it>N </it>= 4746), which were readily upregulated or downregulated depending on the kind of perturbation, mostly have regulatory functions and a propensity to become tissue-specific upon differentiation. Tissue-specific expression was evaluated on the basis of published (GNF) and our new data for 15 organs and tissues. Non-responsive genes (<it>N </it>= 9562), which did not change their expression much following any perturbation, were enriched in housekeeping functions. We found that TF-responsiveness in ES cells is the best predictor known for tissue-specificity in gene expression. Among genes with CpG islands, high responsiveness is associated with H3K27me3 chromatin marks, and low responsiveness is associated with H3K36me3 chromatin, stronger tri-methylation of H3K4, binding of E2F1, and GABP binding motifs in promoters.</p> <p>Conclusions</p> <p>We thus propose the responsiveness of expression to perturbations as a new way to define the dynamic status of genes, which brings new insights into mechanisms of regulation of gene expression and tissue specificity.</p

    Gradual transition from mosaic to global DNA methylation patterns during deuterostome evolution

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>DNA methylation by the Dnmt family occurs in vertebrates and invertebrates, including ascidians, and is thought to play important roles in gene regulation and genome stability, especially in vertebrates. However, the global methylation patterns of vertebrates and invertebrates are distinctive. Whereas almost all CpG sites are methylated in vertebrates, with the exception of those in CpG islands, the ascidian genome contains approximately equal amounts of methylated and unmethylated regions. Curiously, methylation status can be reliably estimated from the local frequency of CpG dinucleotides in the ascidian genome. Methylated and unmethylated regions tend to have few and many CpG sites, respectively, consistent with our knowledge of the methylation status of CpG islands and other regions in mammals. However, DNA methylation patterns and levels in vertebrates and invertebrates have not been analyzed in the same way.</p> <p>Results</p> <p>Using a new computational methodology based on the decomposition of the bimodal distributions of methylated and unmethylated regions, we estimated the extent of the global methylation patterns in a wide range of animals. We then examined the epigenetic changes <it>in silico </it>along the phylogenetic tree. We observed a gradual transition from fractional to global patterns of methylation in deuterostomes, rather than a clear demarcation between vertebrates and invertebrates. When we applied this methodology to six piscine genomes, some of which showed features similar to those of invertebrates.</p> <p>Conclusions</p> <p>The mammalian global DNA methylation pattern was probably not acquired at an early stage of vertebrate evolution, but gradually expanded from that of a more ancient organism.</p

    Comprehensive analysis of the base composition around the transcription start site in Metazoa

    Get PDF
    BACKGROUND: The transcription start site of a metazoan gene remains poorly understood, mostly because there is no clear signal present in all genes. Now that several sequenced metazoan genomes have been annotated, we have been able to compare the base composition around the transcription start site for all annotated genes across multiple genomes. RESULTS: The most prominent feature in the base compositions is a significant local variation in G+C content over a large region around the transcription start site. The change is present in all animal phyla but the extent of variation is different between distinct classes of vertebrates, and the shape of the variation is completely different between vertebrates and arthropods. Furthermore, the height of the variation correlates with CpG frequencies in vertebrates but not in invertebrates and it also correlates with gene expression, especially in mammals. We also detect GC and AT skews in all clades (where %G is not equal to %C or %A is not equal to %T respectively) but these occur in a more confined region around the transcription start site and in the coding region. CONCLUSIONS: The dramatic changes in nucleotide composition in humans are a consequence of CpG nucleotide frequencies and of gene expression, the changes in Fugu could point to primordial CpG islands, and the changes in the fly are of a totally different kind and unrelated to dinucleotide frequencies

    Assessment of clusters of transcription factor binding sites in relationship to human promoter, CpG islands and gene expression

    Get PDF
    BACKGROUND: Gene expression is regulated mainly by transcription factors (TFs) that interact with regulatory cis-elements on DNA sequences. To identify functional regulatory elements, computer searching can predict TF binding sites (TFBS) using position weight matrices (PWMs) that represent positional base frequencies of collected experimentally determined TFBS. A disadvantage of this approach is the large output of results for genomic DNA. One strategy to identify genuine TFBS is to utilize local concentrations of predicted TFBS. It is unclear whether there is a general tendency for TFBS to cluster at promoter regions, although this is the case for certain TFBS. Also unclear is the identification of TFs that have TFBS concentrated in promoters and to what level this occurs. This study hopes to answer some of these questions. RESULTS: We developed the cluster score measure to evaluate the correlation between predicted TFBS clusters and promoter sequences for each PWM. Non-promoter sequences were used as a control. Using the cluster score, we identified a PWM group called PWM-PCP, in which TFBS clusters positively correlate with promoters, and another PWM group called PWM-NCP, in which TFBS clusters negatively correlate with promoters. The PWM-PCP group comprises 47% of the 199 vertebrate PWMs, while the PWM-NCP group occupied 11 percent. After reducing the effect of CpG islands (CGI) against the clusters using partial correlation coefficients among three properties (promoter, CGI and predicted TFBS cluster), we identified two PWM groups including those strongly correlated with CGI and those not correlated with CGI. CONCLUSION: Not all PWMs predict TFBS correlated with human promoter sequences. Two main PWM groups were identified: (1) those that show TFBS clustered in promoters associated with CGI, and (2) those that show TFBS clustered in promoters independent of CGI. Assessment of PWM matches will allow more positive interpretation of TFBS in regulatory regions

    Analysis of CpG methylation sites and CGI among human papillomavirus DNA genomes

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The Human Papillomavirus (HPV) genome is divided into early and late coding sequences, including 8 open reading frames (ORFs) and a regulatory region (LCR). Viral gene expression may be regulated through epigenetic mechanisms, including cytosine methylation at CpG dinucleotides. We have analyzed the distribution of CpG sites and CpG islands/clusters (CGI) among 92 different HPV genomes grouped in function of their preferential tropism: cutaneous or mucosal. We calculated the proportion of CpG sites (PCS) for each ORF and calculated the expected CpG values for each viral type.</p> <p>Results</p> <p>CpGs are underrepresented in viral genomes. We found a positive correlation between CpG observed and expected values, with mucosal high-risk (HR) virus types showing the smallest O/E ratios. The ranges of the PCS were similar for most genomic regions except <it>E4</it>, where the majority of CpGs are found within islands/clusters. At least one CGI belongs to each <it>E2/E4 </it>region. We found positive correlations between PCS for each viral ORF when compared with the others, except for the LCR against four ORFs and <it>E6 </it>against three other ORFs. The distribution of CpG islands/clusters among HPV groups is heterogeneous and mucosal HR-HPV types exhibit both lower number and shorter island sizes compared to cutaneous and mucosal Low-risk (LR) HPVs (all of them significantly different).</p> <p>Conclusions</p> <p>There is a difference between viral and cellular CpG underrepresentation. There are significant correlations between complete genome PCS and a lack of correlations between several genomic region pairs, especially those involving LCR and <it>E6</it>. <it>L2 </it>and <it>L1 </it>ORF behavior is opposite to that of oncogenes <it>E6 </it>and <it>E7</it>. The first pair possesses relatively low numbers of CpG sites clustered in CGIs while the oncogenes possess a relatively high number of CpG sites not associated to CGIs. In all HPVs, <it>E2/E4 </it>is the only region with at least one CGI and shows a higher content of CpG sites in every HPV type with an identified <it>E4</it>. The mucosal HR-HPVs show either the shortest CGI size, followed by the mucosal LR-HPVs and lastly by the cutaneous viral subgroup, and a trend to the lowest CGI number, followed by the cutaneous viral subgroup and lastly by the mucosal LR-HPVs.</p

    High Sensitivity TSS Prediction: Estimates of Locations Where TSS Cannot Occur

    Get PDF
    Although transcription in mammalian genomes can initiate from various genomic positions (e.g., 3′UTR, coding exons, etc.), most locations on genomes are not prone to transcription initiation. It is of practical and theoretical interest to be able to estimate such collections of non-TSS locations (NTLs). The identification of large portions of NTLs can contribute to better focusing the search for TSS locations and thus contribute to promoter and gene finding. It can help in the assessment of 5′ completeness of expressed sequences, contribute to more successful experimental designs, as well as more accurate gene annotation.Using comprehensive collections of Cap Analysis of Gene Expression (CAGE) and other transcript data from mouse and human genomes, we developed a methodology that allows us, by performing computational TSS prediction with very high sensitivity, to annotate, with a high accuracy in a strand specific manner, locations of mammalian genomes that are highly unlikely to harbor transcription start sites (TSSs). The properties of the immediate genomic neighborhood of 98,682 accurately determined mouse and 113,814 human TSSs are used to determine features that distinguish genomic transcription initiation locations from those that are not likely to initiate transcription. In our algorithm we utilize various constraining properties of features identified in the upstream and downstream regions around TSSs, as well as statistical analyses of these surrounding regions.

    Detailed Analysis of <em>ITPR1 </em>Missense Variants Guides Diagnostics and Therapeutic Design

    Get PDF
    \ua9 2023 The Authors. Movement Disorders published by Wiley Periodicals LLC on behalf of International Parkinson and Movement Disorder Society.Background: The ITPR1 gene encodes the inositol 1,4,5-trisphosphate (IP3) receptor type 1 (IP3R1), a critical player in cerebellar intracellular calcium signaling. Pathogenic missense variants in ITPR1 cause congenital spinocerebellar ataxia type 29 (SCA29), Gillespie syndrome (GLSP), and severe pontine/cerebellar hypoplasia. The pathophysiological basis of the different phenotypes is poorly understood. Objectives: We aimed to identify novel SCA29 and GLSP cases to define core phenotypes, describe the spectrum of missense variation across ITPR1, standardize the ITPR1 variant nomenclature, and investigate disease progression in relation to cerebellar atrophy. Methods: Cases were identified using next-generation sequencing through the Deciphering Developmental Disorders study, the 100,000 Genomes project, and clinical collaborations. ITPR1 alternative splicing in the human cerebellum was investigated by quantitative polymerase chain reaction. Results: We report the largest, multinational case series of 46 patients with 28 unique ITPR1 missense variants. Variants clustered in functional domains of the protein, especially in the N-terminal IP3-binding domain, the carbonic anhydrase 8 (CA8)-binding region, and the C-terminal transmembrane channel domain. Variants outside these domains were of questionable clinical significance. Standardized transcript annotation, based on our ITPR1 transcript expression data, greatly facilitated analysis. Genotype–phenotype associations were highly variable. Importantly, while cerebellar atrophy was common, cerebellar volume loss did not correlate with symptom progression. Conclusions: This dataset represents the largest cohort of patients with ITPR1 missense variants, expanding the clinical spectrum of SCA29 and GLSP. Standardized transcript annotation is essential for future reporting. Our findings will aid in diagnostic interpretation in the clinic and guide selection of variants for preclinical studies. \ua9 2023 The Authors. Movement Disorders published by Wiley Periodicals LLC on behalf of International Parkinson and Movement Disorder Society
    corecore