15 research outputs found

    Validation of mapped long-read splice sites.

    No full text
    <p>TopHat2 was used to identify observed splice junctions from the short-read data. The light red and light blue lines show the distribution of distances from Ensembl-annotated splice sites to the experimentally observed splice sites. Both of these lines peak heavily at 0 indicating the degree of agreement between these orthogonal datasets. Splice sites annotated from long-read sequencing (blue and red), also show overall agreement, with a small peak of misidentified splice donor sites within 10 bp of the accurate one, which is possibly due to alignment errors near sites with multiple possible splice donor sites.</p

    Long-Read Sequencing of Chicken Transcripts and Identification of New Transcript Isoforms

    No full text
    <div><p>The chicken has long served as an important model organism in many fields, and continues to aid our understanding of animal development. Functional genomics studies aimed at probing the mechanisms that regulate development require high-quality genomes and transcript annotations. The quality of these resources has improved dramatically over the last several years, but many isoforms and genes have yet to be identified. We hope to contribute to the process of improving these resources with the data presented here: a set of long cDNA sequencing reads, and a curated set of new genes and transcript isoforms not currently represented in the most up-to-date genome annotation currently available to the community of researchers who rely on the chicken genome.</p></div

    Identification of new isoforms and genes.

    No full text
    <p>Shown in this image from the UCSC genome browser are examples of the new genes and isoforms identified from among the short and long-read annotations: A. This region carries two distinct annotations, one for an alternate transcription start site (TSS), and another for a completely new gene that is currently unannotated. B. A heart-specific isoform of the FKBP7 gene. C. Both long-read and short-read data support the existence of transcripts going in opposite directions in this region of chromosome 9.</p

    Broad coverage of existing chicken genes using long-read sequencing.

    No full text
    <p>Along the x-axis representing transcript length is a histogram of the number of RefSeq transcripts within a given range of lengths (grey). A similar histogram is shown for those transcripts that overlap each RefSeq annotation by any amount (red), or by more than 90% of the gene length (dark red).</p

    New isoforms and genes with tissue-specific expression.

    No full text
    <p>The relative expression of (a) the 2,018 differentially expressed isoforms and (b) the 120 new differentially expressed gene annotations are provided across chicken adult brain, cerebellum, heart, kidney, liver and testes datasets, along the scale provided.</p

    Additional file 1: of Normalized long read RNA sequencing in chicken reveals transcriptome complexity similar to human

    No full text
    PacBio sequencing read lengths (pdf format) (a) Read lengths for chicken brain 1-kb size selection. (b) Read lengths for chicken brain 2-kb size selection. (c) Read lengths for chicken embryo 0.8-kb size selection. (d) Read lengths for chicken embryo 2-kb size selection. (PDF 184 kb

    ENCODE-like study using PacBio sequencing

    No full text
    <p>Integrative functional biology of a fungus:  Using PacBio SMRT sequencing to interrogate the genome, epigenome, and transcriptome of Neurospora crassa</p
    corecore