60 research outputs found
Recommended from our members
Viral coinfection analysis using a MinHash toolkit
Abstract: Background: Human papillomavirus (HPV) is a common sexually transmitted infection associated with cervical cancer that frequently occurs as a coinfection of types and subtypes. Highly similar sublineages that show over 100-fold differences in cancer risk are not distinguishable in coinfections with current typing methods. Results: We describe an efficient set of computational tools, rkmh, for analyzing complex mixed infections of related viruses based on sequence data. rkmh makes extensive use of MinHash similarity measures, and includes utilities for removing host DNA and classifying reads by type, lineage, and sublineage. We show that rkmh is capable of assigning reads to their HPV type as well as HPV16 lineage and sublineages. Conclusions: Accurate read classification enables estimates of percent composition when there are multiple infecting lineages or sublineages. While we demonstrate rkmh for HPV with multiple sequencing technologies, it is also applicable to other mixtures of related sequences
Submicron Structures Technology and Research
Contains reports on fourteen research projects.Joint Services Electronics Program (Contract DAAG29-83-K-0003)U.S. Navy - Office of Naval Research (Contract N00014-79-C-0908)National Science Foundation (Grant ECS82-05701)Semiconductor Research Corporation (Grant 83-01-033)U.S. Department of Energy (Contract DE-ACO2-82-ER-13019)Lawrence Livermore National Laboratory (Contract 2069209)National Aeronautics and Space Administration (Contract NAS5-27591)Defense Advanced Research Projects Agency (Contract N00014-79-C-0908)National Science Foundation (Grant ECS80-17705)National Aeronautics and Space Administration (Contract NGL22-009-638
Submicron Structures Technology and Research
Contains reports on ten research projects.Joint Services Electronics Program (Contract DAAG29-83-K-0003)Joint Services Electronics Program (Contract DAAL03-86-K-0002)National Science Foundation (Grant ECS82-05701)National Science Foundation (Grant ECS85-06565)Lawrence Livermore Laboratory (Subcontract 2069209)National Science Foundation (Grant ECS85-03443)U.S. Air Force - Office of Scientific Research (Grant AFOSR-85-0154)National Aeronautics and Space Administration (Grant NGL22-009-638)National Science Foundation (through KMS Fusion, Inc.)U.S. Navy - Office of Naval Research (Contract N00014-79-C-0908
An integrated map of structural variation in 2,504 human genomes
© 2015 Macmillan Publishers Limited. All rights reserved. Structural variants are implicated in numerous diseases and make up the majority of varying nucleotides among human genomes. Here we describe an integrated set of eight structural variant classes comprising both balanced and unbalanced variants, which we constructed using short-read DNA sequencing data and statistically phased onto haplotype blocks in 26 human populations. Analysing this set, we identify numerous gene-intersecting structural variants exhibiting population stratification and describe naturally occurring homozygous gene knockouts that suggest the dispensability of a variety of human genes. We demonstrate that structural variants are enriched on haplotypes identified by genome-wide association studies and exhibit enrichment for expression quantitative trait loci. Additionally, we uncover appreciable levels of structural variant complexity at different scales, including genic loci subject to clusters of repeated rearrangement and complex structural variants with multiple breakpoints likely to have formed through individual mutational events. Our catalogue will enhance future studies into structural variant demography, functional impact and disease association
Submicron Structures Technology and Research
Contains reports on fifteen research projects.Joint Services Electronics Program (Contract DAALO3-86-K-0002)National Science Foundation (Grant ECS 87-09806)Semiconductor Research Corporation (Contract 87-SP-080)National Science Foundation (Grant ECS 85-03443)U.S. Air Force - Office of Scientific Research (Grant AFOSR 85-0376)National Science Foundation (Grant ECS 85-06565)U.S. Air Force - Office of Scientific Research (Grant AFOSR 85-0154)Lawrence Livermore National Laboratory (Subcontract 2069209)National Aeronautics and Space Adminstration (Grant NGL22-009-683)Collaboration with KMS Fusion, Inc
A Comprehensive Map of Mobile Element Insertion Polymorphisms in Humans
As a consequence of the accumulation of insertion events over evolutionary time, mobile elements now comprise nearly half of the human genome. The Alu, L1, and SVA mobile element families are still duplicating, generating variation between individual genomes. Mobile element insertions (MEI) have been identified as causes for genetic diseases, including hemophilia, neurofibromatosis, and various cancers. Here we present a comprehensive map of 7,380 MEI polymorphisms from the 1000 Genomes Project whole-genome sequencing data of 185 samples in three major populations detected with two detection methods. This catalog enables us to systematically study mutation rates, population segregation, genomic distribution, and functional properties of MEI polymorphisms and to compare MEI to SNP variation from the same individuals. Population allele frequencies of MEI and SNPs are described, broadly, by the same neutral ancestral processes despite vastly different mutation mechanisms and rates, except in coding regions where MEI are virtually absent, presumably due to strong negative selection. A direct comparison of MEI and SNP diversity levels suggests a differential mobile element insertion rate among populations
Determining value in health technology assessment: Stay the course or tack away?
The economic evaluation of new health technologies to assess whether the value of the expected health benefits warrants the proposed additional costs has become an essential step in making novel interventions available to patients. This assessment of value is problematic because there exists no natural means to measure it. One approach is to assume that society wishes to maximize aggregate health, measured in terms of quality-adjusted life-years (QALYs). Commonly, a single 'cost-effectiveness' threshold is used to gauge whether the intervention is sufficiently efficient in doing so. This approach has come under fire for failing to account for societal values that favor treating more severe illness and ensuring equal access to resources, regardless of pre-existing conditions or capacity to benefit. Alternatives involving expansion of the measure of benefit or adjusting the threshold have been proposed and some have advocated tacking away from the cost per QALY entirely to implement therapeutic area-specific efficiency frontiers, multicriteria decision analysis or other approaches that keep the dimensions of benefit distinct and value them separately. In this paper, each of these alternative courses is considered, based on the experiences of the authors, with a view to clarifying their implications
Punctuated bursts in human male demography inferred from 1,244 worldwide Y-chromosome sequences
Computational pan-genomics: Status, promises and challenges
Many disciplines, from human genetics and oncology to plant breeding, microbiology and virology, commonly face the challenge of analyzing rapidly increasing numbers of genomes. In case of Homo sapiens, the number of sequenced genomes will approach hundreds of thousands in the next few years. Simply scaling up established bioinformatics pipelines will not be sufficient for leveraging the full potential of such rich genomic data sets. Instead, novel, qualitatively different Computational methods and paradigms are needed.We will witness the rapid extension of Computational pan-genomics, a new sub-area of research in Computational biology. In this article, we generalize existing definitions and understand a pangenome as any collection of genomic sequences to be analyzed jointly or to be used as a reference. We examine already available approaches to construct and use pan-genomes, discuss the potential benefits of future technologies and methodologies and review open challenges from the vantage point of the above-mentioned biological disciplines. As a prominent example for a Computational paradigm shift, we particularly highlight the transition from the representation of reference genomes as strings to representations
Retrospective evaluation of whole exome and genome mutation calls in 746 cancer samples
Funder: NCI U24CA211006Abstract: The Cancer Genome Atlas (TCGA) and International Cancer Genome Consortium (ICGC) curated consensus somatic mutation calls using whole exome sequencing (WES) and whole genome sequencing (WGS), respectively. Here, as part of the ICGC/TCGA Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium, which aggregated whole genome sequencing data from 2,658 cancers across 38 tumour types, we compare WES and WGS side-by-side from 746 TCGA samples, finding that ~80% of mutations overlap in covered exonic regions. We estimate that low variant allele fraction (VAF < 15%) and clonal heterogeneity contribute up to 68% of private WGS mutations and 71% of private WES mutations. We observe that ~30% of private WGS mutations trace to mutations identified by a single variant caller in WES consensus efforts. WGS captures both ~50% more variation in exonic regions and un-observed mutations in loci with variable GC-content. Together, our analysis highlights technological divergences between two reproducible somatic variant detection efforts
- …