6 research outputs found

    Human Whole-Exome Genotype Data For alzheimer\u27s Disease

    Get PDF
    The heterogeneity of the whole-exome sequencing (WES) data generation methods present a challenge to a joint analysis. Here we present a bioinformatics strategy for joint-calling 20,504 WES samples collected across nine studies and sequenced using ten capture kits in fourteen sequencing centers in the Alzheimer\u27s Disease Sequencing Project. The joint-genotype called variant-called format (VCF) file contains only positions within the union of capture kits. The VCF was then processed specifically to account for the batch effects arising from the use of different capture kits from different studies. We identified 8.2 million autosomal variants. 96.82% of the variants are high-quality, and are located in 28,579 Ensembl transcripts. 41% of the variants are intronic and 1.8% of the variants are with CADD \u3e 30, indicating they are of high predicted pathogenicity. Here we show our new strategy can generate high-quality data from processing these diversely generated WES samples. The improved ability to combine data sequenced in different batches benefits the whole genomics research community

    The early-onset Alzheimer's disease whole-genome sequencing project: Study design and methodology

    No full text
    INTRODUCTION Sequencing efforts to identify genetic variants and pathways underlying Alzheimer's disease (AD) have largely focused on late-onset AD although early-onset AD (EOAD), accounting for ∼10% of cases, is largely unexplained by known mutations, resulting in a lack of understanding of its molecular etiology. METHODS Whole-genome sequencing and harmonization of clinical, neuropathological, and biomarker data of over 5000 EOAD cases of diverse ancestries. RESULTS A publicly available genomics resource for EOAD with extensive harmonized phenotypes. Primary analysis will (1) identify novel EOAD risk loci and druggable targets; (2) assess local-ancestry effects; (3) create EOAD prediction models; and (4) assess genetic overlap with cardiovascular and other traits. DISCUSSION This novel resource complements over 50,000 control and late-onset AD samples generated through the Alzheimer's Disease Sequencing Project (ADSP). The harmonized EOAD/ADSP joint call will be available through upcoming ADSP data releases and will allow for additional analyses across the full onset range. Highlights Sequencing efforts to identify genetic variants and pathways underlying Alzheimer's disease (AD) have largely focused on late-onset AD although early-onset AD (EOAD), accounting for ∼10% of cases, is largely unexplained by known mutations. This results in a significant lack of understanding of the molecular etiology of this devastating form of the disease. The Early-Onset Alzheimer's Disease Whole-genome Sequencing Project is a collaborative initiative to generate a large-scale genomics resource for early-onset Alzheimer's disease with extensive harmonized phenotype data. Primary analyses are designed to (1) identify novel EOAD risk and protective loci and druggable targets; (2) assess local-ancestry effects; (3) create EOAD prediction models; and (4) assess genetic overlap with cardiovascular and other traits. The harmonized genomic and phenotypic data from this initiative will be available through NIAGADS

    Human whole-exome genotype data for Alzheimer's disease

    No full text
    The heterogeneity of the whole-exome sequencing (WES) data generation methods present a challenge to a joint analysis. Here we present a bioinformatics strategy for joint-calling 20,504 WES samples collected across nine studies and sequenced using ten capture kits in fourteen sequencing centers in the Alzheimer's Disease Sequencing Project. The joint-genotype called variant-called format (VCF) file contains only positions within the union of capture kits. The VCF was then processed specifically to account for the batch effects arising from the use of different capture kits from different studies. We identified 8.2 million autosomal variants. 96.82% of the variants are high-quality, and are located in 28,579 Ensembl transcripts. 41% of the variants are intronic and 1.8% of the variants are with CADD > 30, indicating they are of high predicted pathogenicity. Here we show our new strategy can generate high-quality data from processing these diversely generated WES samples. The improved ability to combine data sequenced in different batches benefits the whole genomics research community

    Human whole-exome genotype data for Alzheimer’s disease

    Get PDF
    The heterogeneity of the whole-exome sequencing (WES) data generation methods present a challenge to a joint analysis. Here we present a bioinformatics strategy for joint-calling 20,504 WES samples collected across nine studies and sequenced using ten capture kits in fourteen sequencing centers in the Alzheimer’s Disease Sequencing Project. The joint-genotype called variant-called format (VCF) file contains only positions within the union of capture kits. The VCF was then processed specifically to account for the batch effects arising from the use of different capture kits from different studies. We identified 8.2 million autosomal variants. 96.82% of the variants are high-quality, and are located in 28,579 Ensembl transcripts. 41% of the variants are intronic and 1.8% of the variants are with CADD &gt; 30, indicating they are of high predicted pathogenicity. Here we show our new strategy can generate high-quality data from processing these diversely generated WES samples. The improved ability to combine data sequenced in different batches benefits the whole genomics research community.</p
    corecore