71 research outputs found

    Multiplexed direct genomic selection (MDiGS): a pooled BAC capture approach for highly accurate CNV and SNP/INDEL detection

    Get PDF
    Despite declining sequencing costs, few methods are available for cost-effective single-nucleotide polymorphism (SNP), insertion/deletion (INDEL) and copy number variation (CNV) discovery in a single assay. Commercially available methods require a high investment to a specific region and are only cost-effective for large samples. Here, we introduce a novel, flexible approach for multiplexed targeted sequencing and CNV analysis of large genomic regions called multiplexed direct genomic selection (MDiGS). MDiGS combines biotinylated bacterial artificial chromosome (BAC) capture and multiplexed pooled capture for SNP/INDEL and CNV detection of 96 multiplexed samples on a single MiSeq run. MDiGS is advantageous over other methods for CNV detection because pooled sample capture and hybridization to large contiguous BAC baits reduces sample and probe hybridization variability inherent in other methods. We performed MDiGS capture for three chromosomal regions consisting of ∼550 kb of coding and non-coding sequence with DNA from 253 patients with congenital lower limb disorders. PITX1 nonsense and HOXC11 S191F missense mutations were identified that segregate in clubfoot families. Using a novel pooled-capture reference strategy, we identified recurrent chromosome chr17q23.1q23.2 duplications and small HOXC 5′ cluster deletions (51 kb and 12 kb). Given the current interest in coding and non-coding variants in human disease, MDiGS fulfills a niche for comprehensive and low-cost evaluation of CNVs, coding, and non-coding variants across candidate regions of interest

    Error-corrected sequencing strategies enable comprehensive detection of leukemic mutations relevant for diagnosis and minimal residual disease monitoring

    Get PDF
    BACKGROUND: Pediatric leukemias have a diverse genomic landscape associated with complex structural variants, including gene fusions, insertions and deletions, and single nucleotide variants. Routine karyotype and fluorescence in situ hybridization (FISH) techniques lack sensitivity for smaller genomic alternations. Next-generation sequencing (NGS) assays are being increasingly utilized for assessment of these various lesions. However, standard NGS lacks quantitative sensitivity for minimal residual disease (MRD) surveillance due to an inherently high error rate. METHODS: Primary bone marrow samples from pediatric leukemia (n = 32) and adult leukemia subjects (n = 5), cell line MV4-11, and an umbilical cord sample were utilized for this study. Samples were sequenced using molecular barcoding with targeted DNA and RNA library enrichment techniques based on anchored multiplexed PCR (AMP®) technology, amplicon based error-corrected sequencing (ECS) or a human cancer transcriptome assay. Computational analyses were performed to quantitatively assess limit of detection (LOD) for various DNA and RNA lesions, which could be systematically used for MRD assays. RESULTS: Matched leukemia patient samples were analyzed at three time points; diagnosis, end of induction (EOI), and relapse. Similar to flow cytometry for ALL MRD, the LOD for point mutations by these sequencing strategies was ≥0.001. For DNA structural variants, FLT3 internal tandem duplication (ITD) positive cell line and patient samples showed a LOD of ≥0.001 in addition to previously unknown copy number losses in leukemia genes. ECS in RNA identified multiple novel gene fusions, including a SPANT-ABL gene fusion in an ALL patient, which could have been used to alter therapy. Collectively, ECS for RNA demonstrated a quantitative and complex landscape of RNA molecules with 12% of the molecules representing gene fusions, 12% exon duplications, 8% exon deletions, and 68% with retained introns. Droplet digital PCR validation of ECS-RNA confirmed results to single mRNA molecule quantities. CONCLUSIONS: Collectively, these assays enable a highly sensitive, comprehensive, and simultaneous analysis of various clonal leukemic mutations, which can be tracked across disease states (diagnosis, EOI, and relapse) with a high degree of sensitivity. The approaches and results presented here highlight the ability to use NGS for MRD tracking

    The zebrafish xenograft platform-A novel tool for modeling KSHV-associated diseases

    Get PDF
    Kaposi\u27s sarcoma associated-herpesvirus (KSHV, also known as human herpesvirus-8) is a gammaherpesvirus that establishes life-long infection in human B lymphocytes. KSHV infection is typically asymptomatic, but immunosuppression can predispose KSHV-infected individuals to primary effusion lymphoma (PEL); a malignancy driven by aberrant proliferation of latently infected B lymphocytes, and supported by pro-inflammatory cytokines and angiogenic factors produced by cells that succumb to lytic viral replication. Here, we report the development of the firs

    Germline Sequencing Identifies Rare Variants in Finnish Subjects with Familial Germ Cell Tumors

    Get PDF
    Purpose: Pediatric germ cell tumors are rare, representing about 3% of childhood malignancies in children less than 15 years of age, presenting in neonates or adolescents with a greater incidence noted in older adolescents. Aberrations in primordial germ cell proliferation/differentiation can lead to a variety of neoplasms, including teratomas, embryonal carcinoma, choriocarcinoma, and yolk sac tumors. Patients and Methods: Three Finnish families with varying familial germ cell tumors were identified, and whole-genome sequencing was performed using an Illumina sequencing platform. In total, 22 unique subjects across the three families were sequenced. Family 1 proband (female) was affected by malignant ovarian teratoma, Family 2 proband (female) was affected by sacrococcygeal teratoma with yolk sac tumor in the setting of Cornelia de Lange syndrome, and Family 3 proband (male) was affected by malignant testicular teratoma. Rare variants were identified using an autosomal recessive or de novo model of inheritance. Results: For family 1 proband (female), an autosomal recessive or de novo model of inheritance identified variants of interest in the following genes: CD109, IKBKB, and CTNNA3, SUPT6H, MUC5AC, and FRG1. Family 2 proband (female) analysis identified gene variants of interest in the following genes: LONRF2, ANO7, HS6ST1, PRB2, and DNM2. Family 3 proband (male) analysis identified the following potential genes: CRIPAK, KRTAP5-7, and CACNA1B. Conclusion: Leveraging deep pedigrees and next-generation sequencing, rare germline variants were identified that were enriched in three families from Finland with a history of familial germ cell tumors. The data presented support the importance of germline mutations when analyzing complex cancers with a low somatic mutation landscape.Peer reviewe

    Population-based rare variant detection via pooled exome or custom hybridization capture with or without individual indexing

    Get PDF
    BACKGROUND: Rare genetic variation in the human population is a major source of pathophysiological variability and has been implicated in a host of complex phenotypes and diseases. Finding disease-related genes harboring disparate functional rare variants requires sequencing of many individuals across many genomic regions and comparing against unaffected cohorts. However, despite persistent declines in sequencing costs, population-based rare variant detection across large genomic target regions remains cost prohibitive for most investigators. In addition, DNA samples are often precious and hybridization methods typically require large amounts of input DNA. Pooled sample DNA sequencing is a cost and time-efficient strategy for surveying populations of individuals for rare variants. We set out to 1) create a scalable, multiplexing method for custom capture with or without individual DNA indexing that was amenable to low amounts of input DNA and 2) expand the functionality of the SPLINTER algorithm for calling substitutions, insertions and deletions across either candidate genes or the entire exome by integrating the variant calling algorithm with the dynamic programming aligner, Novoalign. RESULTS: We report methodology for pooled hybridization capture with pre-enrichment, indexed multiplexing of up to 48 individuals or non-indexed pooled sequencing of up to 92 individuals with as little as 70 ng of DNA per person. Modified solid phase reversible immobilization bead purification strategies enable no sample transfers from sonication in 96-well plates through adapter ligation, resulting in 50% less library preparation reagent consumption. Custom Y-shaped adapters containing novel 7 base pair index sequences with a Hamming distance of ≥2 were directly ligated onto fragmented source DNA eliminating the need for PCR to incorporate indexes, and was followed by a custom blocking strategy using a single oligonucleotide regardless of index sequence. These results were obtained aligning raw reads against the entire genome using Novoalign followed by variant calling of non-indexed pools using SPLINTER or SAMtools for indexed samples. With these pipelines, we find sensitivity and specificity of 99.4% and 99.7% for pooled exome sequencing. Sensitivity, and to a lesser degree specificity, proved to be a function of coverage. For rare variants (≤2% minor allele frequency), we achieved sensitivity and specificity of ≥94.9% and ≥99.99% for custom capture of 2.5 Mb in multiplexed libraries of 22–48 individuals with only ≥5-fold coverage/chromosome, but these parameters improved to ≥98.7 and 100% with 20-fold coverage/chromosome. CONCLUSIONS: This highly scalable methodology enables accurate rare variant detection, with or without individual DNA sample indexing, while reducing the amount of required source DNA and total costs through less hybridization reagent consumption, multi-sample sonication in a standard PCR plate, multiplexed pre-enrichment pooling with a single hybridization and lesser sequencing coverage required to obtain high sensitivity
    corecore