14 research outputs found
Oxytricha trifallax macronuclear IDBA assembly
Contigs in gzipped fasta format
Oxytricha trifallax macronuclear PCAP 2.1.8 assembly
Oxytricha trifallax macronuclear PCAP 2.1.8 assembl
Oxytricha trifallax macronuclear PE-Assembler/SSAKE assembly
Contigs in gzipped fasta format
Oxytricha trifallax macronuclear genome fosmids
Oxytricha trifallax macronuclear genome fosmid
Key features of <i>Oxytricha</i> protein-coding nanochromosomes.
<p>Representative nanochromosome features are not drawn to scale, but their lengths are indicated. UTR, untranslated region; UTS, untranscribed region. 3âČ UTRs and the subtelomeric signal overlap. The subtelomeric base composition bias signal found on either end of the nanochromosome is shown above the nanochromosome diagram.</p
Development of the <i>Oxytricha</i> macronuclear genome from the micronuclear genome.
<p>During conjugation of <i>Oxytricha</i> cells, segments of the micronuclear genome (MDSs) are excised and stitched together to form the nanochromosomes of the new macronuclear genome, and the remainder of the micronuclear genome is eliminated (including the IESs interspersed between MDSs). The old macronuclear genome is also degraded during development. The segments that are stitched together may be either in order (e.g., forming nanochromosome 1, on the left) or out of order or inverted (e.g., forming the two forms of nanochromosome 2), in which case they need to be âunscrambled.â Two rounds of DNA amplification produce nanochromosomes at an average copy number of âŒ1,900 <a href="http://www.plosbiology.org/article/info:doi/10.1371/journal.pbio.1001473#pbio.1001473-Prescott1" target="_blank">[2]</a>. Alternative fragmentation of DNA during nanochromosome development may also occur, irrespective of unscrambling, giving rise to longer (2a) and shorter (2b) nanochromosome isoforms. The mature nanochromosomes are capped on both ends with telomeres.</p
Comparison of key ciliate macronuclear genomes.
<p>The phylogeny represents the bootstrap consensus of 100 replicates from PhyML (with the HKY85 substitution model) based on a MUSCLE multiple sequence alignment of 18S rRNA genes from seven ciliates (<i>Oxytricha trifallax</i>âFJ545743; <i>Stylonychia lemnae</i>âAJJRB310497; <i>Euplotes crassus</i>âAJJRB310492; <i>Nyctotherus ovalis</i>âAJ222678; <i>Tetrahymena thermophila</i>âM10932; <i>Ichthyophthirius multifiliis</i>âIMU17354; and <i>Paramecium tetraurelia</i>âAB252009) rooted with two other alveolates (<i>Perkinsus marinus</i>âX75762 and <i>Plasmodium falciparum</i>âNC_004325). All bootstrap values are â„80, except for the node between <i>Nyctotherus</i> and <i>Oxytricha</i>/<i>Stylonychia</i>/<i>Euplotes</i>, which has a boostrap value of 60. <i>Euplotes</i> and <i>Nyctotherus</i> both have nanochromosomes, like <i>Oxytricha</i>. Other than the genome statistics for <i>Oxytricha trifallax</i>, which were determined in this study, table statistics were obtained from the following sources: <sup>a</sup> - <a href="http://www.plosbiology.org/article/info:doi/10.1371/journal.pbio.1001473#pbio.1001473-Prescott1" target="_blank">[2]</a>, <sup>b</sup> - <a href="http://www.plosbiology.org/article/info:doi/10.1371/journal.pbio.1001473#pbio.1001473-Duerr1" target="_blank">[22]</a>,<a href="http://www.plosbiology.org/article/info:doi/10.1371/journal.pbio.1001473#pbio.1001473-Lipps1" target="_blank">[116]</a>, <sup>c</sup> - <a href="http://www.plosbiology.org/article/info:doi/10.1371/journal.pbio.1001473#pbio.1001473-Nock1" target="_blank">[117]</a>, <sup>d</sup> - <a href="http://www.plosbiology.org/article/info:doi/10.1371/journal.pbio.1001473#pbio.1001473-Bender1" target="_blank">[99]</a>, <sup>e</sup> - <a href="http://www.plosbiology.org/article/info:doi/10.1371/journal.pbio.1001473#pbio.1001473-Ricard1" target="_blank">[94]</a>, <sup>f</sup> - <a href="http://www.plosbiology.org/article/info:doi/10.1371/journal.pbio.1001473#pbio.1001473-Eisen1" target="_blank">[56]</a> (the number of chromosomes is an estimate), <sup>g</sup> -<a href="http://www.plosbiology.org/article/info:doi/10.1371/journal.pbio.1001473#pbio.1001473-Coyne3" target="_blank">[118]</a>, <sup>h</sup> - <a href="http://www.plosbiology.org/article/info:doi/10.1371/journal.pbio.1001473#pbio.1001473-White1" target="_blank">[119]</a>, <sup>i</sup> - <a href="http://www.plosbiology.org/article/info:doi/10.1371/journal.pbio.1001473#pbio.1001473-Austerberry1" target="_blank">[120]</a>, <sup>j</sup>- <a href="http://www.plosbiology.org/article/info:doi/10.1371/journal.pbio.1001473#pbio.1001473-Coyne2" target="_blank">[64]</a> (for a single stage of the <i>Ichthyophthirius</i> life cycle), <sup>k</sup> - <a href="http://www.plosbiology.org/article/info:doi/10.1371/journal.pbio.1001473#pbio.1001473-Aury1" target="_blank">[121]</a>, <sup>l</sup> - <a href="http://www.plosbiology.org/article/info:doi/10.1371/journal.pbio.1001473#pbio.1001473-Duret1" target="_blank">[69]</a>, <sup>m</sup> - <a href="http://www.plosbiology.org/article/info:doi/10.1371/journal.pbio.1001473#pbio.1001473-Gardner1" target="_blank">[122]</a>. Table statistics for <i>Perkinsus marinus</i> are for the current assembly deposited in GenBank (GCA_000006405.1).</p
Length distributions of alternatively and nonalternatively fragmented nanochromosomes.
<p>The shortest nanochromosome isoforms produced from single (directional) alternative fragmentation sites are labeled as âShort isoform.â The histograms show normalized frequencies for 1,587 alternatively fragmented nanochromosomes and 15,219 nonalternatively fragmented nanochromosomes. Alternatively fragmented nanochromosomes have at least one strongly supported (â„10 Illumina reads) alternative fragmentation site >250 bp from either end of the nanochromosome (these nanochromosomes are >500 bp long).</p
Telomere end-binding protein-α paralogs in ciliates.
<p>The phylogeny is an ML tree generated by PhyML <a href="http://www.plosbiology.org/article/info:doi/10.1371/journal.pbio.1001473#pbio.1001473-Guindon1" target="_blank">[123]</a> with a single substitution rate category and the JTT substitution model, optimized for tree topology and branch length. Bootstrap percentages for 1,000 replicates are indicated at the tree nodes. The multiple sequence alignments underlying the phylogeny were produced with MAFFT (v 6.418b <a href="http://www.plosbiology.org/article/info:doi/10.1371/journal.pbio.1001473#pbio.1001473-Katoh1" target="_blank">[124]</a>) (default parameters; BLOSUM 62 substitution matrix) and were trimmed with trimal1.2 <a href="http://www.plosbiology.org/article/info:doi/10.1371/journal.pbio.1001473#pbio.1001473-CapellaGutierrez1" target="_blank">[125]</a> with the â-automated1â parameter to remove excess gaps and poorly aligned regions. GenBank accessions are provided for the taxa unless otherwise indicated. <i>Euplotes crassus</i> is indicated in blue (Q06184 and Q06183), and an additional match from our preliminary <i>Euplotes</i> genome assembly is EUP_contig393834_f1_1. <i>Perkinsus marinus</i> is purple (EER00428) and <i>Oxytricha nova</i> is light green (P29549). <i>Tetrahymena thermophila</i> (salmon color) accessions are from the <i>Tetrahymena</i> genome database <a href="http://www.plosbiology.org/article/info:doi/10.1371/journal.pbio.1001473#pbio.1001473-Stover1" target="_blank">[126]</a>âTTHERM_00378980 and TTHERM_00378990; <i>Paramecium tetraurelia</i>'s TeBP-α protein (pink) is from ParameciumDB <a href="http://www.plosbiology.org/article/info:doi/10.1371/journal.pbio.1001473#pbio.1001473-Arnaiz2" target="_blank">[127]</a> (GSPATP00001065001). All the nodes beginning with âContigâ are <i>Oxytricha trifallax</i> TeBP-α paralogs (dark green) and Contig22209.0.g66 is TeBP-α1, the original TeBP-α. The tree is rooted at the midpoint of the branch between <i>Arabidopsis thaliana</i> (Pot1aâAAX78213 and Pot1bâAAS99712) and <i>Homo sapiens</i> (Pot1âEAW83616; black) and the rest of the phylogeny. Gene expression levels are normalized RNA-seq counts (see <a href="http://www.plosbiology.org/article/info:doi/10.1371/journal.pbio.1001473#pbio.1001473.s059" target="_blank">Text S1</a>; Supporting Materials and Methods) before (âfedâ) and during conjugation (0â60 h) are shown for the <i>Oxytricha trifallax</i> TeBP-α paralogs; coding sequence lengths are also indicated (in bp) for each of these paralogs.</p
Nanochromosomal variant frequencies.
<p>(A) Normalized to form a probability density (cumulative frequency of 1) and (B) unnormalized median nanochromosomal variant frequencies for six increasing ranges of mean SNP heterozygosity. Variant frequencies were determined for nanochromosomes with no non-self matches to the genome assembly (the same nanochromosomes underlying the SNP heterozygosity histogram for âmatchlessâ nanochromosomes in <a href="http://www.plosbiology.org/article/info:doi/10.1371/journal.pbio.1001473#pbio-1001473-g004" target="_blank">Figure 4</a>), with variant positions called at the same minimum variant frequency as that used to determine potentially heterozygous sites (5% for sites with â„20Ă read coverage). To exclude potentially paralogous mapped reads, we only analyzed nanochromosomes with â€4 reads mapped to other contigs (using all nanochromosomes does not substantially change the form of the distributions). Variant frequency bins are labeled by their lower bounds. Variant frequencies â„40 bp from either nanochromosome end were counted to avoid possible incorrect variant calling resulting from telomeric bases that were not masked (due to sequencing errors).</p