10 research outputs found

    Sorghum Genome Sequencing by Methylation Filtration

    Get PDF
    Sorghum bicolor is a close relative of maize and is a staple crop in Africa and much of the developing world because of its superior tolerance of arid growth conditions. We have generated sequence from the hypomethylated portion of the sorghum genome by applying methylation filtration (MF) technology. The evidence suggests that 96% of the genes have been sequence tagged, with an average coverage of 65% across their length. Remarkably, this level of gene discovery was accomplished after generating a raw coverage of less than 300 megabases of the 735-megabase genome. MF preferentially captures exons and introns, promoters, microRNAs, and simple sequence repeats, and minimizes interspersed repeats, thus providing a robust view of the functional parts of the genome. The sorghum MF sequence set is beneficial to research on sorghum and is also a powerful resource for comparative genomics among the grasses and across the entire plant kingdom. Thousands of hypothetical gene predictions in rice and Arabidopsis are supported by the sorghum dataset, and genomic similarities highlight evolutionarily conserved regions that will lead to a better understanding of rice and Arabidopsis

    Finishing the euchromatic sequence of the human genome

    Get PDF
    The sequence of the human genome encodes the genetic instructions for human physiology, as well as rich information about human evolution. In 2001, the International Human Genome Sequencing Consortium reported a draft sequence of the euchromatic portion of the human genome. Since then, the international collaboration has worked to convert this draft into a genome sequence with high accuracy and nearly complete coverage. Here, we report the result of this finishing process. The current genome sequence (Build 35) contains 2.85 billion nucleotides interrupted by only 341 gaps. It covers ∼99% of the euchromatic genome and is accurate to an error rate of ∼1 event per 100,000 bases. Many of the remaining euchromatic gaps are associated with segmental duplications and will require focused work with new methods. The near-complete sequence, the first for a vertebrate, greatly improves the precision of biological analyses of the human genome including studies of gene number, birth and death. Notably, the human enome seems to encode only 20,000-25,000 protein-coding genes. The genome sequence reported here should serve as a firm foundation for biomedical research in the decades ahead

    Annotation of <i>Arabidopsis</i> by Sorghum MF Versus Rice Gene Sequences

    No full text
    <p>Shown are the number of <i>Arabidopsis</i> proteins that are matched in a TBLASTN comparison to the sorghum MF set (blue) versus the rice gene sequences (yellow). The <i>Arabidopsis</i> proteins, after having known repetitive elements removed (see <a href="http://www.plosbiology.org/article/info:doi/10.1371/journal.pbio.0030013#s3" target="_blank">Materials and Methods</a>), have been categorized as either hypothetical or known based on the definition line. <i>Arabidopsis</i> proteins were considered supported if they matched with an E-value less than or equal to 1 × 10<sup>−8</sup>. Sb, S. bicolor MF set; Osj:seq, <i>Oryza sativa japonica</i> gene sequences.</p

    Methylation Status of <i>tb2</i> and Kafirin Cluster

    No full text
    <div><p>(A and B) Restriction maps of the <i>tb2</i> gene (A) and the kafirin consensus sequences (B) are shown. The relevant restriction sites are indicated vertically and the numbers indicate the distances scale in basepairs. Each CDS is depicted as a blue-shaded arrow, and the region assayed is indicated by a black bar. The circles depict sites that are not present in every kafirin gene, and the color represents the number of genes that do not share the site. The orange circle (5′-most HhaI site) is a site conserved in nine of 11 kafirin genes, and the red circle (3′-most PstI site) is a site present in ten of the 11.</p> <p>(C) Results from a representative methylation analysis of <i>tb2;</i> the inset depicts the template dilution standard curve used to set the threshold for the experiment. Each experiment was performed three times with four on-board replicates per assay point. The results for each of the four differentially treated reactions are depicted with different colors. Red, mock-treated; blue, mcrBC-digested; orange, HhaI-digested; and green, HhaI + mcrBC double-digest. The inset shows the standard dilution control with two replicates at each dilution. The control was used to set the threshold for detection. The specificity of each reaction was confirmed using melt-curve analysis.</p> <p>(D) Results from a representative methylation analysis of the 11 kafirin genes. The results for each of the six differentially treated reactions are the same as in (C), with the following additional digests: pink, PstI-digested; light blue, PstI + mcrBC double-digest. Notice that the mcrBC with and without PstI yields the same Ct, while HhaI + mcrBC (green) yields a higher Ct on average; suggesting additional cleavage.</p></div

    Gene Discovery Rate

    No full text
    <p>Gene discovery rates for sorghum MF (blue), sorghum ESTs (pink), and an <i>Arabidopsis</i> simulation (dotted black) are shown. The gene discovery rates for the MF and ESTs were calculated based on matches to a set of 137 genes annotated on sorghum BAC clones versus the number of MF and EST reads. The <i>Arabidopsis</i> simulation was calculated based on the fold-coverage of chromosome 1, which contains 7,520 genes. The fold coverage was converted into read numbers as detailed in the <a href="http://www.plosbiology.org/article/info:doi/10.1371/journal.pbio.0030013#s3" target="_blank">Materials and Methods</a>.</p

    Phylogenetic Comparison of Sorghum <i>DREB1</i> Genes

    No full text
    <p>A phylogenetic tree comparing the AP2 domain of the sorghum <i>DREB1</i> genes to those of <i>Arabidopsis</i> and rice was constructed using CLUSTALX [<a href="http://www.plosbiology.org/article/info:doi/10.1371/journal.pbio.0030013#pbio-0030013-b61" target="_blank">61</a>]. The genes encoding proteins from <i>Arabidopsis</i> are <i>DREB1A, DREB1B,</i> and <i>DREB1C</i>. Rice genes are <i>OsDREB1A, OsDREB1B, OsDREB1C</i> (nucleotides 142,337–142,981), and <i>OsDREB1D</i> (nucleotides 1,489–2,250). AP2 domains from other <i>Arabidopsis</i> proteins are also included: <i>APETALA2</i> (R2 domain), <i>AtERF-1, LEAFY PETIOLE,</i> and <i>TINY</i>.</p
    corecore