28 research outputs found

    Evolution of DNA Methylation Patterns in the Brassicaceae is Driven by Differences in Genome Organization

    No full text
    <div><p>DNA methylation is an ancient molecular modification found in most eukaryotes. In plants, DNA methylation is not only critical for transcriptionally silencing transposons, but can also affect phenotype by altering expression of protein coding genes. The extent of its contribution to phenotypic diversity over evolutionary time is, however, unclear, because of limited stability of epialleles that are not linked to DNA mutations. To dissect the relative contribution of DNA methylation to transposon surveillance and host gene regulation, we leveraged information from three species in the Brassicaceae that vary in genome architecture, <i>Capsella rubella</i>, <i>Arabidopsis lyrata</i>, and <i>Arabidopsis thaliana</i>. We found that the lineage-specific expansion and contraction of transposon and repeat sequences is the main driver of interspecific differences in DNA methylation. The most heavily methylated portions of the genome are thus not conserved at the sequence level. Outside of repeat-associated methylation, there is a surprising degree of conservation in methylation at single nucleotides located in gene bodies. Finally, dynamic DNA methylation is affected more by tissue type than by environmental differences in all species, but these responses are not conserved. The majority of DNA methylation variation between species resides in hypervariable genomic regions, and thus, in the context of macroevolution, is of limited phenotypic consequence.</p></div

    Conservation of methylated regions (MR).

    No full text
    <p>A) Annotation of all bases in MRs. B) Fraction of bases in MRs that occur either within or outside of the three-way whole genome alignments. C) Fraction of MR bases found within three-way whole genome alignments that occur in one, two, or three species. D) Conservation of MRs in the absence of sequence alignments. The total number of orthologous genes overlapping an MR in one, two, or three species is given, with location of MR overlap separated by genomic feature. Upstream region was defined as 1 kb before the start codon. Asterisk indicates two or three-way sharing of MRs that exceeds permutation values.</p

    allele_freq

    No full text
    Directory containing genotype data for F2 pools (reduced representation sequencing)

    allele_freq_BSA

    No full text
    Directory containing genotype data for F2 pools (whole genome sequencing)

    Site-level comparison of methylation.

    No full text
    <p>A) Annotation of all cytosines within a species (covered C) compared to the annotation of cytosines found in the three-way whole genome alignments (aligned C). B) Total number of mC by context for aligned site classes. Site classes are as follows: mC - methylated sites within a species. Conserved (3 species) - sites that are methylated in all three species. Gain - sites that are methylated in a single species. Loss - sites that have lost methylation in a single species. C) Total number of conserved mC and non-conserved mC by context. D) Density plot describing the distribution of variable sites in the genome (10 kb windows). For each window the following statistic was calculated: species-specific methylation gains/sum of species-specific methylation gains and losses. E) Windows with a high density of gains have more transposons and repetitive sequences. Density of transposons plotted against density of methylation gains (10 kb window). F) Methylation gains are enriched at the beginning and end of genes. Fraction of mC in each site class is plotted by exon position in a gene.</p

    Genomic distribution of DNA methylation.

    No full text
    <p>A) Circos plots <a href="http://www.plosgenetics.org/article/info:doi/10.1371/journal.pgen.1004785#pgen.1004785-Krzywinski1" target="_blank">[74]</a> of <i>C. rubella</i>, <i>A. lyrata</i>, and <i>A. thaliana</i>. Chromosome number is indicated on the inner circle. Data is plotted for 500 kb windows, except for sequencing coverage (100 kb). Gene expression (RPKM) was calculated using the sum of the expression counts from all samples within a species.</p

    Code

    No full text
    Directory containing scripts and associated input files

    Impact of repeat expansion on DNA methylation at genomic features.

    No full text
    <p>A) Feature annotation of all cytosines and methylated cytosines. Annotations are shown for all three contexts. B) Genome average of methylation rates for each genomic feature. Methylation rates are normalized to the outgroup species <i>C. rubella</i>. C) Fraction of intron bases annotated as transposable element or other repeat sequence. D) Total number of intron bases (millions) that are annotated as a particular transposable element class. E) Methylation rate distribution across gene bodies of orthologous genes and flanking sequences (1.5 kb up - and downstream). Orthologs that lacked methylation in both their gene body and flanking sequences were excluded. Distributions are plotted by context.</p

    Centromere loss impacts DNA methylation in <i>A. thaliana</i>.

    No full text
    <p>A) Orthologous genes, anchored on the <i>C. rubella</i> genome, were used to calculate several statistics to investigate the impact of centromere loss on DNA methylation in <i>A. thaliana</i>. <i>Capsella rubella</i> centromeres 2, 4, and 8 (grey boxes) were lost during chromosomal fusion events that occurred on the branch leading to <i>A. thaliana</i>. Gene density, repeat density, and methylation densities were calculated for a 20 Kb window centered on the midpoint of each orthologous gene (10 kb up- and 10 kb downstream). Gene density and repeat density were calculated as fractions of each 20 kb window annotated as either a gene (ATG to STOP) or a repeat. Methylation densities were calculated as fractions of cytosines methylated in each context. Gene body methylation and gene expression (RPKM) were calculated for each ortholog. Gene body methylation was calculated as the fraction of methylated CG sites in a gene (ATG to STOP). Gene expression data from all samples within a species were used to calculate the RPKM values. For each statistic, local linear regression was performed to smooth the data in 250 kb bins. Smoothing parameter was relative to chromosome length.</p

    Species gene expression and mC relationships.

    No full text
    <p>A) Principal component analysis on fitted gene expression values (log<sub>2</sub>) and B) mC rates at aligned methylated positions. All contexts are considered (see <a href="http://www.plosgenetics.org/article/info:doi/10.1371/journal.pgen.1004785#pgen-1004785-g006" target="_blank">Fig. 6B,C</a> and <a href="http://www.plosgenetics.org/article/info:doi/10.1371/journal.pgen.1004785#pgen.1004785.s024" target="_blank">Table S10</a> for further description of mC sites).</p
    corecore