61 research outputs found
Forensic use of Y-chromosome DNA: a general overview
The male-specific part of the human Y chromosome is widely used in forensic DNA analysis, particularly in cases where standard autosomal DNA profiling is not informative. A Y-chromosomal gene fragment is applied for inferring the biological sex of a crime scene trace donor. Haplotypes composed of Y-chromosomal short tandem repeat polymorphisms (Y-STRs) are used to characterise paternal lineages of unknown male trace donors, especially suitable when males and females have contributed to the same trace, such as in sexual assault cases. Y-STR haplotyping applied in crime scene investigation can (i) exclude male suspects from involvement in crime, (ii) identify the paternal lineage of male perpetrators, (iii) highlight multiple male contributors to a trace, and (iv) provide investigative leads for finding unknown male perpetrators. Y-STR haplotype analysis is employed in paternity disputes of male offspring and other types of paternal kinship testing, including historical cases, as well as in special cases of missing person and disaster victim identification involving men. Y-chromosome polymorphisms are applied for inferring the paternal bio-geographic ancestry of unknown trace donors or missing persons, in cases where autosomal DNA profiling is uninformative. In this overview, all different forensic applications of Y-chromosome DNA are described. To illustrate the necessity of forensic Y-chromosome analysis, the investigation of a prominent murder case is described, which initiated two changes in national forensic DNA legislation both covering Y-chromosome use, and was finally solved via an innovative Y-STR dragnet involving thousands of volunteers after 14 years. Finally, expectations for the future of forensic Y-chromosome DNA analysis are discussed
Multiplex genotyping system for efficient inference of matrilineal genetic ancestry with continental resolution
Abstract
Background: In recent years, phylogeographic studies have produced detailed knowledge on the worldwide
distribution of mitochondrial DNA (mtDNA) variants, linking specific clades of the mtDNA phylogeny with certain
geographic areas. However, a multiplex genotyping system for the detection of the mtDNA haplogroups of major
continental distribution that would be desirable for efficient DNA-based bio-geographic ancestry testing in various
applications is still missing.
Results: Three multiplex genotyping assays, based on single-base primer extension technology, were developed
targeting a total of 36 coding-region mtDNA variants that together differentiate 43 matrilineal haplo-/paragroups. These
include the major diagnostic haplogroups for Africa, Western Eurasia, Eastern Eurasia and Native America. The assays
show high sensitivity with respect to the amount of template DNA: successful amplification could still be obtained
when using as little as 4 pg of genomic DNA and the technology is suitable for medium-throughput analyses.
Conclusions: We introduce an efficient and sensitive multiplex genotyping system for bio-geographic ancestry
inference from mtDNA that provides resolution on the continental level. The method can be applied in forensics,
to aid tracing unknown suspects, as well as in population studies, genealogy and personal ancestry testing. For
more complete inferences of overall bio-geographic ancestry from DNA, the mtDNA system provided here can be
combined with multiplex systems for suitable autosomal and, in the case of males, Y-chromosomal ancestrysensitive
DNA markers
CollapsABEL: An R library for detecting compound heterozygote alleles in genome-wide association studies
Background: Compound Heterozygosity (CH) in classical genetics is the presence of two different recessive mutations at a particular gene locus. A relaxed form of CH alleles may account for an essential proportion of the missing heritability, i.e. heritability of phenotypes so far not accounted for by single genetic variants. Methods to detect CH-like effects in genome-wide association studies (GWAS) may facilitate explaining the missing heritability, but to our knowledge no viable software tools for this purpose are currently available. Results: In this work we present the Generalized Compound Double Heterozygosity (GCDH) test and its implementation in the R package CollapsABEL. Time-consuming procedures are optimized for computational efficiency using Java or C++. Intermediate results are stored either in an SQL database or in a so-called big.matrix file to achieve reasonable memory footprint. Our large scale simulation studies show that GCDH is capable of discovering genetic associations due to CH-like interactions with much higher power than a conventional single-SNP approach under various settings, whether the causal genetic variations are available or not. CollapsABEL provides a user-friendly pipeline for genotype collapsing, statistical testing, power estimation, type I error control and graphics generation in the R language. Conclusions: CollapsABEL provides a computationally efficient solution for screening general forms of CH alleles in densely imputed microarray or whole genome sequencing datasets. The GCDH test provides an improved power over single-SNP based methods in detecting the prevalence of CH in human complex phenotypes, offering an opportunity for tackling the missing heritability problem. Binary and source packages of CollapsABEL are available on CRAN (https://cran.r-project.org/web/packages/CollapsABEL) and the website of the GenABEL project (http://www.genabel.org/packages)
GAGA: A New Algorithm for Genomic Inference of Geographic Ancestry Reveals Fine Level Population Substructure in Europeans
Attempts to detect genetic population substructure in humans are troubled by the fact that the vast majority of the total amount of observed genetic variation is present within populations rather than between populations. Here we introduce a new algorithm for transforming a genetic distance matrix that reduces the within-population variation considerably. Extensive computer simulations revealed that the transformed matrix captured the genetic population differentiation better than the original one which was based on the T1 statistic. In an empirical genomic data set comprising 2,457 individuals from 23 different European subpopulations, the proportion of individuals that were determined as a genetic neighbour to another individual from the same sampling location increased from 25% with the original matrix to 52% with the transformed matrix. Similarly, the percentage of genetic variation explained between populations by means of Analysis of Molecular Variance (AMOVA) increased from 1.62% to 7.98%. Furthermore, the first two dimensions of a classical multidimensional scaling (MDS) using the transformed matrix explained 15% of the variance, compared to 0.7% obtained with the original matrix. Application of MDS with Mclust, SPA with Mclust, and GemTools algorithms to the same dataset also showed that the transformed matrix gave a better association of the genetic clusters with the sampling locations, and particularly so when it was used in the AMOVA framework with a genetic algorithm. Overall, the new matrix transformation introduced here substantially reduces the within population genetic differentiation, and can be broadly applied to methods such as AMOVA to enhance their sensitivity to reveal population substructure. We herewith provide a publically available (http://www.erasmusmc.nl/fmb/resources/GAGA) model-free method for improved genetic population substructure detection that can be applied to human as well as any other species data in future studies relevant to evolutionary biology, behavioural ecology, medicine, and forensics
Proportioning whole-genome single-nucleotide-polymorphism diversity for the identification of geographic population structure and genetic ancestry
The identification of geographic population structure and genetic ancestry
on the basis of a minimal set of genetic markers is desirable for a wide
range of applications in medical and forensic sciences. However, the
absence of sharp discontinuities in the neutral genetic diversity among
human populations implies that, in practice, a large number of neutral
markers will be required to identify the genetic ancestry of one
individual. We showed that it is possible to reduce the amount of markers
required for detecting continental population structure to only 10
single-nucleotide polymorphisms (SNPs), by applying a newly developed
ascertainment algorithm to Affymetrix GeneChip Mapping 10K SNP array data
that we obtained from samples of globally dispersed human individuals (the
Y Chromosome Consortium panel). Furthermore, this set of SNPs was able to
recover the genetic ancestry of individuals from all four continents
represented in the original data set when applied to an independent, much
larger, worldwide population data set (Centre d'Etude du Polymorphisme
Humain-Human Genome Diversity Project Cell Line Panel). Finally, we
provide evidence that the unusual patterns of genetic variation we
observed at the respective genomic regions surrounding the five most
informative SNPs is in agreement with local positive selection being the
explanation for the striking SNP allele-frequency differences we found
between continental groups of human populations
Evaluation of the VISAGE Basic Tool for Appearance and Ancestry Prediction Using PowerSeq Chemistry on the MiSeq FGx System
The study of DNA to predict externally visible characteristics (EVCs) and the biogeographical
ancestry (BGA) from unknown samples is gaining relevance in forensic genetics. Technical
developments in Massively Parallel Sequencing (MPS) enable the simultaneous analysis of hundreds of
DNA markers, which improves successful Forensic DNA Phenotyping (FDP). The EU-funded VISAGE
(VISible Attributes through GEnomics) Consortium has developed various targeted MPS-based lab
tools to apply FDP in routine forensic analyses. Here, we present an evaluation of the VISAGE Basic
tool for appearance and ancestry prediction based on PowerSeq chemistry (Promega) on a MiSeq FGx
System (Illumina). The panel consists of 153 single nucleotide polymorphisms (SNPs) that provide
information about EVCs (41 SNPs for eye, hair and skin color from HIrisPlex-S) and continental BGA
(115 SNPs; three overlap with the EVCs SNP set). The assay was evaluated for sensitivity, repeatability
and genotyping concordance, as well as its performance with casework-type samples. This targeted
MPS assay provided complete genotypes at all 153 SNPs down to 125 pg of input DNA and 99.67%
correct genotypes at 50 pg. It was robust in terms of repeatability and concordance and provided
useful results with casework-type samples. The results suggest that this MPS assay is a useful tool
for basic appearance and ancestry prediction in forensic genetics for users interested in applying
PowerSeq chemistry and MiSeq for this purpose
- …