Search CORE

23 research outputs found

Assembly of non-unique insertion content using next-generation sequencing

Author: Eskin Eleazar
Hormozdiari Farhad
Parrish Nathaniel
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Recent studies in genomics have highlighted the significance of sequence insertions in determining individual variation. Efforts to discover the content of these sequence insertions have been limited to short insertions and long unique insertions. Much of the inserted sequence in the typical human genome, however, is a mixture of repeated and unique sequence. Current methods are designed to assemble only unique sequence insertions, using reads that do not map to the reference. These methods are not able to assemble repeated sequence insertions, as the reads will map to the reference in a different locus

Crossref

Springer - Publisher Connector

PubMed Central

eScholarship - University of California

inGAP-sv: a novel scheme to identify and visualize structural variation from paired end mapping data

Author: Abel
Bentley
Campbell
Chen
Conrad
Fangqing Zhao
Feuk
Hajirasouliha
Hormozdiari
Ji Qi
Kidd
Korbel
Korbel
Lee
Li
McCarroll
Medvedev
Medvedev
Mills
Qi
Sharp
Sindi
Stankiewicz
Tuzun
Wong
Ye
Zeitouni
Publication venue: Oxford University Press
Publication date
Field of study

Mining genetic variation from personal genomes is a crucial step towards investigating the relationship between genotype and phenotype. However, compared to the detection of SNPs and small indels, characterizing large and particularly complex structural variation is much more difficult and less intuitive. In this article, we present a new scheme (inGAP-sv) to detect and visualize structural variation from paired-end mapping data. Under this scheme, abnormally mapped read pairs are clustered based on the location of a gap signature. Several important features, including local depth of coverage, mapping quality and associated tandem repeat, are used to evaluate the quality of predicted structural variation. Compared with other approaches, it can detect many more large insertions and complex variants with lower false discovery rate. Moreover, inGAP-sv, written in Java programming language, provides a user-friendly interface and can be performed in multiple operating systems. It can be freely accessed at http://ingap.sourceforge.net/

Crossref

PubMed Central

Recommended from our members

Mapping Copy Number Variation by Population Scale Genome Sequencing

Genomic structural variants (SVs) are abundant in humans, differing from other forms of variation in extent, origin and functional impact. Despite progress in SV characterization, the nucleotide resolution architecture of most SVs remains unknown. We constructed a map of unbalanced SVs (that is, copy number variants) based on whole genome DNA sequencing data from 185 human genomes, integrating evidence from complementary SV discovery approaches with extensive experimental validations. Our map encompassed 22,025 deletions and 6,000 additional SVs, including insertions and tandem duplications. Most SVs (53%) were mapped to nucleotide resolution, which facilitated analysing their origin and functional impact. We examined numerous whole and partial gene deletions with a genotyping approach and observed a depletion of gene disruptions amongst high frequency deletions. Furthermore, we observed differences in the size spectra of SVs originating from distinct formation mechanisms, and constructed a map of SV hotspots formed by common mechanisms. Our analytical framework and SV map serves as a resource for sequencing-based association studies.Organismic and Evolutionary Biolog

Harvard University - DASH

PAIR: polymorphic Alu insertion recognition

Author: AH Salem
B Halldórsson
Bjarni V Halldórsson
C Papadimitriou
C Stewart
D Hartl
DF Gudbjartsson
E Ullu
F Hormozdiari
F Hormozdiari
F Hormozdiari
F Rivadeneira
H Holm
H Li
H Stefansson
I Hajirasouliha
J Jurka
J Korbel
J Wang
Jón Ingi Sveinbjörnsson
M Batzer
M Batzer
N Siva
P Medvedev
P Sulem
PL Deininger
R Durbin
RE Mills
U Styrkarsdottir
Publication venue: BioMed Central
Publication date: 01/01/2012
Field of study

Crossref

Springer - Publisher Connector

PubMed Central

Detection of Genomic Structural Variants from Next-Generation Sequencing Data

Author: D\u27Aurizio Romina
Publication venue: -
Publication date: 01/01/2015
Field of study

Structural variants are genomic rearrangements larger than 50?bp accounting for around 1% of the variation among human genomes. They impact on phenotypic diversity and play a role in various diseases including neurological/neurocognitive disorders and cancer development and progression. Dissecting structural variants from next-generation sequencing data presents several challenges and a number of approaches have been proposed in the literature. In this mini review, we describe and summarize the latest tools ? and their underlying algorithms ? designed for the analysis of whole-genome sequencing, whole-exome sequencing, custom captures, and amplicon sequencing data, pointing out the major advantages/drawbacks. We also report a summary of the most recent applications of third-generation sequencing platforms. This assessment provides a guided indication ? with particular emphasis on human genetics and copy number variants ? for researchers involved in the investigation of these genomic events

PUblication MAnagement

Detection of Genomic Structural Variants from Next-Generation Sequencing Data

Author: D&apos
Magi Alberto
Tattini Lorenzo
Publication venue: 'Frontiers Media SA'
Publication date: 01/01/2015
Field of study

Florence Research

Analysis of variable retroduplications in human populations suggests coupling of retrotransposition to cell division

Author: 1000 Genomes Project C.
Abyzov A.
Balasubramanian S.
Gerstein M.
Gokcumen O.
Habegger L.
Iskow R.
Lee C.
Pei B.
Radke D.
Timmermann B.
Publication venue: 'Cold Spring Harbor Laboratory'
Publication date: 01/12/2013
Field of study

In primates and other animals, reverse transcription of mRNA followed by genomic integration creates retroduplications. Expressed retroduplications are either “retrogenes” coding for functioning proteins, or expressed “processed pseudogenes,” which can function as noncoding RNAs. To date, little is known about the variation in retroduplications in terms of their presence or absence across individuals in the human population. We have developed new methodologies that allow us to identify “novel” retroduplications (i.e., those not present in the reference genome), to find their insertion points, and to genotype them. Using these methods, we catalogued and analyzed 174 retroduplication variants in almost one thousand humans, which were sequenced as part of Phase 1 of The 1000 Genomes Project Consortium. The accuracy of our data set was corroborated by (1) multiple lines of sequencing evidence for retroduplication (e.g., depth of coverage in exons vs. introns), (2) experimental validation, and (3) the fact that we can reconstruct a correct phylogenetic tree of human subpopulations based solely on retroduplications. We also show that parent genes of retroduplication variants tend to be expressed at the M-to-G1 transition in the cell cycle and that M-to-G1 expressed genes have more copies of fixed retroduplications than genes expressed at other times. These findings suggest that cell division is coupled to retrotransposition and, perhaps, is even a requirement for it

The Jackson Laboratory: The Mouseion at the JAXlibrary

PubMed Central

MPG.PuRe