32 research outputs found

    Finishing the euchromatic sequence of the human genome

    Get PDF
    The sequence of the human genome encodes the genetic instructions for human physiology, as well as rich information about human evolution. In 2001, the International Human Genome Sequencing Consortium reported a draft sequence of the euchromatic portion of the human genome. Since then, the international collaboration has worked to convert this draft into a genome sequence with high accuracy and nearly complete coverage. Here, we report the result of this finishing process. The current genome sequence (Build 35) contains 2.85 billion nucleotides interrupted by only 341 gaps. It covers ∼99% of the euchromatic genome and is accurate to an error rate of ∼1 event per 100,000 bases. Many of the remaining euchromatic gaps are associated with segmental duplications and will require focused work with new methods. The near-complete sequence, the first for a vertebrate, greatly improves the precision of biological analyses of the human genome including studies of gene number, birth and death. Notably, the human enome seems to encode only 20,000-25,000 protein-coding genes. The genome sequence reported here should serve as a firm foundation for biomedical research in the decades ahead

    Structural Organization of Virulence-Associated Plasmids of <i>Yersinia pestis</i>

    Full text link
    ABSTRACT The complete nucleotide sequence and gene organization of the three virulence plasmids from Yersinia pestis KIM5 were determined. Plasmid pPCP1 (9,610 bp) has a GC content of 45.3% and encodes two previously known virulence factors, an associated protein, and a single copy of IS 100 . Plasmid pCD1 (70,504 bp) has a GC content of 44.8%. It is known to encode a number of essential virulence determinants, regulatory functions, and a multiprotein secretory system comprising the low-calcium response stimulation that is shared with the other two Yersinia species pathogenic for humans ( Y. pseudotuberculosis and Y. enterocolitica ). A new pseudogene, which occurs as an intact gene in the Y. enterocolitica and Y. pseudotuberculosis -derived analogues, was found in pCD1. It corresponds to that encoding the lipoprotein YlpA. Several intact and partial insertion sequences and/or transposons were also found in pCD1, as well as six putative structural genes with high homology to proteins of unknown function in other yersiniae. The sequences of the genes involved in the replication of pCD1 are highly homologous to those of the cognate plasmids in Y. pseudotuberculosis and Y. enterocolitica , but their localization within the plasmid differs markedly from those of the latter. Plasmid pMT1 (100,984 bp) has a GC content of 50.2%. It possesses two copies of IS 100 , which are located 25 kb apart and in opposite orientations. Adjacent to one of these IS 100 inserts is a partial copy of IS 285 . A single copy of an IS 200 -like element (recently named IS 1541 ) was also located in pMT1. In addition to 5 previously described genes, such as murine toxin, capsule antigen, capsule anchoring protein, etc., 30 homologues to genes of several bacterial species were found in this plasmid, and another 44 open reading frames without homology to any known or hypothetical protein in the databases were predicted. </jats:p
    corecore