11 research outputs found

    Finishing the euchromatic sequence of the human genome

    Get PDF
    The sequence of the human genome encodes the genetic instructions for human physiology, as well as rich information about human evolution. In 2001, the International Human Genome Sequencing Consortium reported a draft sequence of the euchromatic portion of the human genome. Since then, the international collaboration has worked to convert this draft into a genome sequence with high accuracy and nearly complete coverage. Here, we report the result of this finishing process. The current genome sequence (Build 35) contains 2.85 billion nucleotides interrupted by only 341 gaps. It covers ∼99% of the euchromatic genome and is accurate to an error rate of ∼1 event per 100,000 bases. Many of the remaining euchromatic gaps are associated with segmental duplications and will require focused work with new methods. The near-complete sequence, the first for a vertebrate, greatly improves the precision of biological analyses of the human genome including studies of gene number, birth and death. Notably, the human enome seems to encode only 20,000-25,000 protein-coding genes. The genome sequence reported here should serve as a firm foundation for biomedical research in the decades ahead

    Complete Genome Sequence of the Genetically Tractable Hydrogenotrophic Methanogen Methanococcus maripaludis

    No full text
    The genome sequence of the genetically tractable, mesophilic, hydrogenotrophic methanogen Methanococcus maripaludis contains 1,722 protein-coding genes in a single circular chromosome of 1,661,137 bp. Of the protein-coding genes (open reading frames [ORFs]), 44% were assigned a function, 48% were conserved but had unknown or uncertain functions, and 7.5% (129 ORFs) were unique to M. maripaludis. Of the unique ORFs, 27 were confirmed to encode proteins by the mass spectrometric identification of unique peptides. Genes for most known functions and pathways were identified. For example, a full complement of hydrogenases and methanogenesis enzymes was identified, including eight selenocysteine-containing proteins, with each being paralogous to a cysteine-containing counterpart. At least 59 proteins were predicted to contain iron-sulfur centers, including ferredoxins, polyferredoxins, and subunits of enzymes with various redox functions. Unusual features included the absence of a Cdc6 homolog, implying a variation in replication initiation, and the presence of a bacterial-like RNase HI as well as an RNase HII typical of the Archaea. The presence of alanine dehydrogenase and alanine racemase, which are uniquely present among the Archaea, explained the ability of the organism to use l- and d-alanine as nitrogen sources. Features that contrasted with the related organism Methanocaldococcus jannaschii included the absence of inteins, even though close homologs of most intein-containing proteins were encoded. Although two-thirds of the ORFs had their highest Blastp hits in Methanocaldococcus jannaschii, lateral gene transfer or gene loss has apparently resulted in genes, which are often clustered, with top Blastp hits in more distantly related groups

    The DNA sequence and biological annotation of human chromosome 1

    No full text

    The DNA sequence and biological annotation of human chromosome 1.

    No full text
    The reference sequence for each human chromosome provides the framework for understanding genome function, variation and evolution. Here we report the finished sequence and biological annotation of human chromosome 1. Chromosome 1 is gene-dense, with 3,141 genes and 991 pseudogenes, and many coding sequences overlap. Rearrangements and mutations of chromosome 1 are prevalent in cancer and many other diseases. Patterns of sequence variation reveal signals of recent selection in specific genes that may contribute to human fitness, and also in regions where no function is evident. Fine-scale recombination occurs in hotspots of varying intensity along the sequence, and is enriched near genes. These and other studies of human biology and disease encoded within chromosome 1 are made possible with the highly accurate annotated sequence, as part of the completed set of chromosome sequences that comprise the reference human genome

    The DNA sequence and biological annotation of human chromosome 1

    No full text

    The DNA sequence, annotation and analysis of human chromosome 3

    No full text
    corecore