5 research outputs found

    A genome database for a Japanese population of the larvacean Oikopleura dioica

    Get PDF
    The larvacean Oikopleura dioica is a planktonic chordate, and is tunicate that belongs to the closest relatives to vertebrates. Its simple and transparent body, invariant embryonic cell lineages, and short life cycle of five days make it a promising model organism for developmental biology research. The genome browser OikoBase was established in 2013 using Norwegian O. dioica. However, genome information for other populations is not available, even though many researchers have studied local populations. In the present study, we sequenced using Illumina and PacBio RSII technologies the genome of O. dioica from a southwestern Japanese population that was cultured in our laboratory for three years. The genome of Japanese O. dioica was assembled into 576 scaffold sequences with a total length and N50 length of 56.6 Mb and 1.5 Mb, respectively. A total of 18,743 gene models (transcript models) were predicted in the genome assembly, named as OSKA2016. In addition, 19,277 non-redundant transcripts were assembled using RNA-seq data. The OSKA2016 has global sequence similarity of only 86.5% when compared with the OikoBase, highlighting the sequence difference between the two far distant O. dioica populations on the globe. The genome assembly, transcript assembly, and transcript models were incorporated into ANISEED (https://www.aniseed.cnrs.fr/) for genome browsing and blast searches. Moreover, screening of the male-specific scaffolds revealed that over 2.6 Mb of sequence were included in the male-specific Yregion. The genome and transcriptome resources from two distinct populations will be useful datasets for developmental biology, evolutionary biology, and molecular ecology using this model organism

    Finishing the euchromatic sequence of the human genome

    Get PDF
    The sequence of the human genome encodes the genetic instructions for human physiology, as well as rich information about human evolution. In 2001, the International Human Genome Sequencing Consortium reported a draft sequence of the euchromatic portion of the human genome. Since then, the international collaboration has worked to convert this draft into a genome sequence with high accuracy and nearly complete coverage. Here, we report the result of this finishing process. The current genome sequence (Build 35) contains 2.85 billion nucleotides interrupted by only 341 gaps. It covers ∼99% of the euchromatic genome and is accurate to an error rate of ∼1 event per 100,000 bases. Many of the remaining euchromatic gaps are associated with segmental duplications and will require focused work with new methods. The near-complete sequence, the first for a vertebrate, greatly improves the precision of biological analyses of the human genome including studies of gene number, birth and death. Notably, the human enome seems to encode only 20,000-25,000 protein-coding genes. The genome sequence reported here should serve as a firm foundation for biomedical research in the decades ahead
    corecore