Search CORE

60 research outputs found

Genetic Variation in an Individual Human Exome

Author: Axelrod Nelson
Busam Dana A.
Huang Jiaqi
Levy Samuel
Li Kelvin
Ng Pauline C.
Stockwell Timothy B.
Strausberg Robert L.
Venter J. Craig
Walenz Brian P.
Publication venue: Public Library of Science
Publication date: 01/01/2008
Field of study

There is much interest in characterizing the variation in a human individual, because this may elucidate what contributes significantly to a person's phenotype, thereby enabling personalized genomics. We focus here on the variants in a person's ‘exome,’ which is the set of exons in a genome, because the exome is believed to harbor much of the functional variation. We provide an analysis of the ∼12,500 variants that affect the protein coding portion of an individual's genome. We identified ∼10,400 nonsynonymous single nucleotide polymorphisms (nsSNPs) in this individual, of which ∼15–20% are rare in the human population. We predict ∼1,500 nsSNPs affect protein function and these tend be heterozygous, rare, or novel. Of the ∼700 coding indels, approximately half tend to have lengths that are a multiple of three, which causes insertions/deletions of amino acids in the corresponding protein, rather than introducing frameshifts. Coding indels also occur frequently at the termini of genes, so even if an indel causes a frameshift, an alternative start or stop site in the gene can still be used to make a functional protein. In summary, we reduced the set of ∼12,500 nonsilent coding variants by ∼8-fold to a set of variants that are most likely to have major effects on their proteins' functions. This is our first glimpse of an individual's exome and a snapshot of the current state of personalized genomics. The majority of coding variants in this individual are common and appear to be functionally neutral. Our results also indicate that some variants can be used to improve the current NCBI human reference genome. As more genomes are sequenced, many rare variants and non-SNP variants will be discovered. We present an approach to analyze the coding variation in humans by proposing multiple bioinformatic methods to hone in on possible functional variation

CiteSeerX

Directory of Open Access Journals

PubMed Central

ScholarBank@NUS

Nanoliter Reactors Improve Multiple Displacement Amplification of Genomes from Single Cells

Author: Aaron L Halpern
Brian P Walenz
Karen Y Beeson
Paul M Richardson
Roger S Lasken
Stephen R Quake
Susanne M. D Goldberg
Thomas Ishoey
Timothy B Stockwell
Yann Marcy
Publication venue: Public Library of Science
Publication date: 01/09/2007
Field of study

Since only a small fraction of environmental bacteria are amenable to laboratory culture, there is great interest in genomic sequencing directly from single cells. Sufficient DNA for sequencing can be obtained from one cell by the Multiple Displacement Amplification (MDA) method, thereby eliminating the need to develop culture methods. Here we used a microfluidic device to isolate individual Escherichia coli and amplify genomic DNA by MDA in 60-nl reactions. Our results confirm a report that reduced MDA reaction volume lowers nonspecific synthesis that can result from contaminant DNA templates and unfavourable interaction between primers. The quality of the genome amplification was assessed by qPCR and compared favourably to single-cell amplifications performed in standard 50-μl volumes. Amplification bias was greatly reduced in nanoliter volumes, thereby providing a more even representation of all sequences. Single-cell amplicons from both microliter and nanoliter volumes provided high-quality sequence data by high-throughput pyrosequencing, thereby demonstrating a straightforward route to sequencing genomes from single cells

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

One-dimensional fluids with second nearest-neighbor interactions

Author: Brian Walenz (520884)
Hayan Lee (2275462)
James Gurtowski (4260199)
Jason Miller (791094)
Joann Mudge (140254)
Junqi Liu (3864574)
Kevin Silverstein (3580874)
Li Song (34475)
Lyza Maron (4260205)
Michael Schatz (475909)
Namrata Singh (1971532)
Nevin Young (3864571)
Peng Zhou (116747)
Peter Tiffin (104127)
Robert Stupar (3398648)
Roxanne Denny (417309)
Susan McCouch (14331)
Thiruvarangan Ramaraj (3240726)
W. McCombie (4260202)
Publication venue
Publication date: 01/01/2017
Field of study

As is well known, one-dimensional systems with interactions restricted to first nearest neighbors admit a full analytically exact statistical-mechanical solution. This is essentially due to the fact that the knowledge of the first nearest-neighbor probability distribution function,

p_1(r)

, is enough to determine the structural and thermodynamic properties of the system. On the other hand, if the interaction between second nearest-neighbor particles is turned on, the analytically exact solution is lost. Not only the knowledge of

p_1(r)

is not sufficient anymore, but even its determination becomes a complex many-body problem. In this work we systematically explore different approximate solutions for one-dimensional second nearest-neighbor fluid models. We apply those approximations to the square-well and the attractive two-step pair potentials and compare them with Monte Carlo simulations, finding an excellent agreement.Comment: 26 pages, 12 figures; v2: more references adde

arXiv.org e-Print Archive

Crossref

FigShare

Improved reference genome for the domestic horse increases assembly contiguity and composition

Author: Antczak Douglas F.
Bailey Ernest
Bellone Rebecca R.
Brooks Samantha A.
DePriest Michael S., Jr.
Fiddes Ian T.
Finno Carrie J.
Greene Richard E.
Hestand Matthew S.
Kalbfleisch Theodore S.
MacLeod James N.
McCue Molly E.
Miller Donald C.
O'Connell Brendan L.
Orlando Ludovic
Petersen Jessica L.
Rice Edward S.
Saremi Nedda F.
Vermeesch Joris R.
Vershinina Alisa O.
Walenz Brian P.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2018
Field of study

Theodore Kalbfleisch et al. present an improved genome assembly for the domestic horse by combining short- and long-read data, as well as proximity ligation data. They improve contiguity of the assembly by 40-fold, with a 10-fold reduction in gaps

Directory of Open Access Journals

Copenhagen University Research Information System

Improved reference genome for the domestic horse increases assembly contiguity and composition

Author: Antczak Douglas F.
Bailey Ernest
Bellone Rebecca R.
Brooks Samantha A.
DePriest Michael S., Jr.
Fiddes Ian T.
Finno Carrie J.
Green Richard E.
Hestand Matthew S.
Kalbfleisch Theodore S.
MacLeod James N.
McCue Molly E.
Miller Donald C.
O\u27Connell Brendan L.
Orlando Ludovic
Petersen Jessica L.
Rice Edward S.
Saremi Nedda F.
Vermeesch Joris R.
Vershinina Alisa O.
Walenz Brian P.
Publication venue: DigitalCommons@University of Nebraska - Lincoln
Publication date: 01/01/2018
Field of study

Recent advances in genomic sequencing technology and computational assembly methods have allowed scientists to improve reference genome assemblies in terms of contiguity and composition. EquCab2, a reference genome for the domestic horse, was released in 2007. Although of equal or better quality compared to other first-generation Sanger assemblies, it had many of the shortcomings common to them. In 2014, the equine genomics research community began a project to improve the reference sequence for the horse, building upon the solid foundation of EquCab2 and incorporating new short-read data, long-read data, and proximity ligation data. Here, we present EquCab3. The count of non-N bases in the incorporated chromosomes is improved from 2.33 Gb in EquCab2 to 2.41 Gb in EquCab3. Contiguity has also been improved nearly 40-fold with a contig N50 of 4.5 Mb and scaffold contiguity enhanced to where all but one of the 32 chromosomes is comprised of a single scaffold

DigitalCommons@University of Nebraska

Directory of Open Access Journals

Copenhagen University Research Information System

University of Kentucky

eScholarship - University of California

The Diploid Genome Sequence of an Individual Human

Presented here is a genome sequence of an individual human. It was produced from ∼32 million random DNA fragments, sequenced by Sanger dideoxy technology and assembled into 4,528 scaffolds, comprising 2,810 million bases (Mb) of contiguous sequence with approximately 7.5-fold coverage for any given region. We developed a modified version of the Celera assembler to facilitate the identification and comparison of alternate alleles within this individual diploid genome. Comparison of this genome and the National Center for Biotechnology Information human reference assembly revealed more than 4.1 million DNA variants, encompassing 12.3 Mb. These variants (of which 1,288,319 were novel) included 3,213,401 single nucleotide polymorphisms (SNPs), 53,823 block substitutions (2–206 bp), 292,102 heterozygous insertion/deletion events (indels)(1–571 bp), 559,473 homozygous indels (1–82,711 bp), 90 inversions, as well as numerous segmental duplications and copy number variation regions. Non-SNP DNA variation accounts for 22% of all events identified in the donor, however they involve 74% of all variant bases. This suggests an important role for non-SNP genetic alterations in defining the diploid genome structure. Moreover, 44% of genes were heterozygous for one or more variants. Using a novel haplotype assembly strategy, we were able to span 1.5 Gb of genome sequence in segments >200 kb, providing further precision to the diploid nature of the genome. These data depict a definitive molecular portrait of a diploid human genome that provides a starting point for future genome comparisons and enables an era of individualized genomic information

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Diposit Digital de la Universitat de Barcelona

ScholarBank@NUS

The Atlantic salmon genome provides insights into rediploidization

Author: Antony Samy Jeevan Karloss
Baranski Matthew
Caler Lis
Davidson William S
de Jong Pieter J.
Fan Dingding
Genova Alex Di
Gjuvsland Arne Bjørke
Grammes Fabian
Grimholt Unni
Grove Harald
Hermansen Russell A.
Hu Yan
Hvidsten Torgeir Rhoden
Iturra Patricia
Jakobsen Kjetill Sigurd
Jentoft Sissel
Jiang Xuanting
Jonassen Inge
Jones Steven J.M.
Kent Matthew Peter
Koop Ben F
Leong Jong
Liberles David A.
Lien Sigbjørn
Maass Alejandro
Miller Jason R.
Minkley David R.
Moen Thomas
Nederbragt Alexander J.
Nome Torfinn
Omholt Stig William
Palti Yniv
Rondeau Eric
Sandve Simen Rød
Smith Douglas W.
Tooming-Klunderud Ave
Vidal Rodrigo
Vigeland Magnus Dehli
Vik Jon Olav
von Schalburg Kristian R.
Våge Dag Inge
Walenz Brian
Yorke James A.
Zimin Aleksey
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 30/01/2018
Field of study

The whole-genome duplication 80 million years ago of the common ancestor of salmonids (salmonid-specific fourth vertebrate whole-genome duplication, Ss4R) provides unique opportunities to learn about the evolutionary fate of a duplicated vertebrate genome in 70 extant lineages. Here we present a high-quality genome assembly for Atlantic salmon (Salmo salar), and show that large genomic reorganizations, coinciding with bursts of transposon-mediated repeat expansions, were crucial for the post-Ss4R rediploidization process. Comparisons of duplicate gene expression patterns across a wide range of tissues with orthologous genes from a pre-Ss4R outgroup unexpectedly demonstrate far more instances of neofunctionalization than subfunctionalization. Surprisingly, we find that genes that were retained as duplicates after the teleost-specific whole-genome duplication 320 million years ago were not more likely to be retained after the Ss4R, and that the duplicate retention was not influenced to a great extent by the nature of the predicted protein interactions of the gene products. Finally, we demonstrate that the Atlantic salmon assembly can serve as a reference sequence for the study of other salmonids for a range of purposes.publishedVersio

University of Bergen

Genomic Insights Into The Ixodes scapularis Tick Vector Of Lyme Disease

Author: Abrudan Jenica L.
Almeida Francisca C.
Arensburger Peter
Ayllon Nieves
Barker Stephen C.
Benton Richard
Bhide Ketaki
Bidwell Shelby
Birren Bruce
Bissinger Brooke W.
Bonzon-Kulichenko Elena
Buckingham Steven D.
Caffrey Daniel R.
Caimano Melissa J.
Caler Elisabet
Collins Frank H.
Croset Vincent
Driscoll Timothy
Fraser Claire M.
Fuente Jose de la
Gilbert Don
Gillespie Joseph J.
Giraldo-Calderon Gloria I.
Grabowski Jeffrey M.
Grimmelikhuijzen Cornelis J. P.
Gulia-Nuss Monika
Hammond Martin P.
Hannick Linda I.
Hauser Frank
Hill Catherine A.
Hostetler Jessica B.
Jiang David
Joardar Vinita S.
Kennedy Ryan C.
Khalil Sayed M. S.
Kim Donghun
Kocan Katherine M.
Koci Juraj
Koren Sergey
Kuhn Richard J.
Kurtti Timothy J.
Kwon Hyeogsun
Lang Emma G.
Lawson Daniel
Lees Kristin
Megy Karine
Meyer Jason M.
Miller Jason R.
Nelson David R.
Nelson Karen E.
Nene Vishvanath M.
Nuss Andrew B.
Park Yoonseong
Pedra Joao H. F.
Perera Rushika
Pietrantonio Patricia V.
Qi Yumin
Radolf Justin D.
Ribeiro Jose M.
Robertson Hugh M.
Roe R. Michael
Rozas Julio
Sakamoto Joyce M.
Sanchez-Gracia Alejandro
Sattelle David B.
Severo Maiara S.
Shao Renfu
Shumway Martin
Silverman Neal
Simo Ladislav
Sonenshine Daniel E.
Sutton Granger
Thiagarajan Mathangi
Thimmapuram Jyothi
Tojo Marta
Tornador Cristian
Tu Zhijian
Tubio Jose M. C.
Unger Maria F.
Van Zee Janice P.
Vazquez Jesus
Vieira Filipe G.
Villar Margarita
Walenz Brian P.
Waterhouse Robert M.
Wespiser Adam R.
Wikel Stephen K.
Wortman Jennifer R.
Yang Yunlong
Young Sarah
Zdobnov Evgeny M.
Zeng Qiandong
Zhu Jiwei
Publication venue: The Research Repository @ WVU
Publication date: 01/01/2016
Field of study

Ticks transmit more pathogens to humans and animals than any other arthropod. We describe the 2.1 Gbp nuclear genome of the tick, Ixodes scapularis (Say), which vectors pathogens that cause Lyme disease, human granulocytic anaplasmosis, babesiosis and other diseases. The large genome reflects accumulation of repetitive DNA, new lineages of retrotransposons, and gene architecture patterns resembling ancient metazoans rather than pancrustaceans. Annotation of scaffolds representing B57% of the genome, reveals 20,486 protein-coding genes and expansions of gene families associated with tick–host interactions. We report insights from genome analyses into parasitic processes unique to ticks, including host ‘questing’, prolonged feeding, cuticle synthesis, blood meal concentration, novel methods of haemoglobin digestion, haem detoxification, vitellogenesis and prolonged off-host survival. We identify proteins associated with the agent of human granulocytic anaplasmosis, an emerging disease, and the encephalitis-causing Langat virus, and a population structure correlated to life-history traits and transmission of the Lyme disease agent

The Research Repository @ WVU (West Virginia University)