69 research outputs found
The next generation of target capture technologies - large DNA fragment enrichment and sequencing determines regional genomic variation of high complexity
Abstract Background The ability to capture and sequence large contiguous DNA fragments represents a significant advancement towards the comprehensive characterization of complex genomic regions. While emerging sequencing platforms are capable of producing several kilobases-long reads, the fragment sizes generated by current DNA target enrichment technologies remain a limiting factor, producing DNA fragments generally shorter than 1 kbp. The DNA enrichment methodology described herein, Region-Specific Extraction (RSE), produces DNA segments in excess of 20 kbp in length. Coupling this enrichment method to appropriate sequencing platforms will significantly enhance the ability to generate complete and accurate sequence characterization of any genomic region without the need for reference-based assembly. Results RSE is a long-range DNA target capture methodology that relies on the specific hybridization of short (20-25 base) oligonucleotide primers to selected sequence motifs within the DNA target region. These capture primers are then enzymatically extended on the 3’-end, incorporating biotinylated nucleotides into the DNA. Streptavidin-coated beads are subsequently used to pull-down the original, long DNA template molecules via the newly synthesized, biotinylated DNA that is bound to them. We demonstrate the accuracy, simplicity and utility of the RSE method by capturing and sequencing a 4 Mbp stretch of the major histocompatibility complex (MHC). Our results show an average depth of coverage of 164X for the entire MHC. This depth of coverage contributes significantly to a 99.94 % total coverage of the targeted region and to an accuracy that is over 99.99 %. Conclusions RSE represents a cost-effective target enrichment method capable of producing sequencing templates in excess of 20 kbp in length. The utility of our method has been proven to generate superior coverage across the MHC as compared to other commercially available methodologies, with the added advantage of producing longer sequencing templates amenable to DNA sequencing on recently developed platforms. Although our demonstration of the method does not utilize these DNA sequencing platforms directly, our results indicate that the capture of long DNA fragments produce superior coverage of the targeted region
A method for measuring mitochondrial DNA copy number in pediatric populations
The mitochondrion is a multifunctional organelle that modulates multiple systems critical for homeostasis during pathophysiological stress. Variation in mitochondrial DNA (mtDNA) copy number (mtDNAcn), a key mitochondrial change associated with chronic stress, is an emerging biomarker for disease pathology and progression. mtDNAcn can be quantified from whole blood samples using qPCR to determine the ratio of mtDNA to nuclear DNA. However, the collection of blood samples in pediatric populations, particularly in infants and young children, can be technically challenging, yield much smaller volume samples, and can be distressing for the patients and their caregivers. Therefore, we have validated a mtDNAcn assay utilizing DNA from simple buccal swabs (Isohelix SK-2S) and report here it's performance in specimens from infants (age = <12 months). Utilizing qPCR to amplify ∼200 bp regions from two mitochondrial (ND1, ND6) and two nuclear (BECN1, NEB) genes, we demonstrated absolute (100%) concordance with results from low-pass whole genome sequencing (lpWGS). We believe that this method overcomes key obstacles to measuring mtDNAcn in pediatric populations and creates the possibility for development of clinical assays to measure mitochondrial change during pathophysiological stress
CNV Workshop: an integrated platform for high-throughput copy number variation discovery and clinical diagnostics
<p>Abstract</p> <p>Background</p> <p>Recent studies have shown that copy number variations (CNVs) are frequent in higher eukaryotes and associated with a substantial portion of inherited and acquired risk for various human diseases. The increasing availability of high-resolution genome surveillance platforms provides opportunity for rapidly assessing research and clinical samples for CNV content, as well as for determining the potential pathogenicity of identified variants. However, few informatics tools for accurate and efficient CNV detection and assessment currently exist.</p> <p>Results</p> <p>We developed a suite of software tools and resources (CNV Workshop) for automated, genome-wide CNV detection from a variety of SNP array platforms. CNV Workshop includes three major components: detection, annotation, and presentation of structural variants from genome array data. CNV detection utilizes a robust and genotype-specific extension of the Circular Binary Segmentation algorithm, and the use of additional detection algorithms is supported. Predicted CNVs are captured in a MySQL database that supports cohort-based projects and incorporates a secure user authentication layer and user/admin roles. To assist with determination of pathogenicity, detected CNVs are also annotated automatically for gene content, known disease loci, and gene-based literature references. Results are easily queried, sorted, filtered, and visualized via a web-based presentation layer that includes a GBrowse-based graphical representation of CNV content and relevant public data, integration with the UCSC Genome Browser, and tabular displays of genomic attributes for each CNV.</p> <p>Conclusions</p> <p>To our knowledge, CNV Workshop represents the first cohesive and convenient platform for detection, annotation, and assessment of the biological and clinical significance of structural variants. CNV Workshop has been successfully utilized for assessment of genomic variation in healthy individuals and disease cohorts and is an ideal platform for coordinating multiple associated projects.</p> <p>Availability and Implementation</p> <p>Available on the web at: <url>http://sourceforge.net/projects/cnv</url></p
Sequence mining and transcript profiling to explore cyst nematode parasitism
<p>Abstract</p> <p>Background</p> <p>Cyst nematodes are devastating plant parasites that become sedentary within plant roots and induce the transformation of normal plant cells into elaborate feeding cells with the help of secreted effectors, the parasitism proteins. These proteins are the translation products of parasitism genes and are secreted molecular tools that allow cyst nematodes to infect plants.</p> <p>Results</p> <p>We present here the expression patterns of all previously described parasitism genes of the soybean cyst nematode, <it>Heterodera glycines</it>, in all major life stages except the adult male. These insights were gained by analyzing our gene expression dataset from experiments using the Affymetrix Soybean Genome Array GeneChip, which contains probeset sequences for 6,860 genes derived from preparasitic and parasitic <it>H. glycines </it>life stages. Targeting the identification of additional <it>H. glycines </it>parasitism-associated genes, we isolated 633 genes encoding secretory proteins using algorithms to predict secretory signal peptides. Furthermore, because some of the known <it>H. glycines </it>parasitism proteins have strongest similarity to proteins of plants and microbes, we searched for predicted protein sequences that showed their highest similarities to plant or microbial proteins and identified 156 <it>H. glycines </it>genes, some of which also contained a signal peptide. Analyses of the expression profiles of these genes allowed the formulation of hypotheses about potential roles in parasitism. This is the first study combining sequence analyses of a substantial EST dataset with microarray expression data of all major life stages (except adult males) for the identification and characterization of putative parasitism-associated proteins in any parasitic nematode.</p> <p>Conclusion</p> <p>We have established an expression atlas for all known <it>H. glycines </it>parasitism genes. Furthermore, in an effort to identify additional <it>H. glycines </it>genes with putative functions in parasitism, we have reduced the currently known 6,860 <it>H. glycines </it>genes to a pool of 788 most promising candidate genes (including known parasitism genes) and documented their expression profiles. Using our approach to pre-select genes likely involved in parasitism now allows detailed functional analyses in a manner not feasible for larger numbers of genes. The generation of the candidate pool described here is an important enabling advance because it will significantly facilitate the unraveling of fascinating plant-animal interactions and deliver knowledge that can be transferred to other pathogen-host systems. Ultimately, the exploration of true parasitism genes verified from the gene pool delineated here will identify weaknesses in the nematode life cycle that can be exploited by novel anti-nematode efforts.</p
Divergent evolution of arrested development in the dauer stage of Caenorhabditis elegans and the infective stage of Heterodera glycines
The generation and analysis of over 20,000 ESTs allowed the identification and expression profiling of 6,860 predicted genes in the nematode Heterodera glycines. This revealed that gene expression patterns in the dauer stage of Caenorhabditis elegans are not conserved in H. glycines
Mitochondrial genome sequence analysis: A custom bioinformatics pipeline substantially improves Affymetrix MitoChip v2.0 call rate and accuracy
BACKGROUND: Mitochondrial genome sequence analysis is critical to the diagnostic evaluation of mitochondrial disease. Existing methodologies differ widely in throughput, complexity, cost efficiency, and sensitivity of heteroplasmy detection. Affymetrix MitoChip v2.0, which uses a sequencing-by-genotyping technology, allows potentially accurate and high-throughput sequencing of the entire human mitochondrial genome to be completed in a cost-effective fashion. However, the relatively low call rate achieved using existing software tools has limited the wide adoption of this platform for either clinical or research applications. Here, we report the design and development of a custom bioinformatics software pipeline that achieves a much improved call rate and accuracy for the Affymetrix MitoChip v2.0 platform. We used this custom pipeline to analyze MitoChip v2.0 data from 24 DNA samples representing a broad range of tissue types (18 whole blood, 3 skeletal muscle, 3 cell lines), mutations (a 5.8 kilobase pair deletion and 6 known heteroplasmic mutations), and haplogroup origins. All results were compared to those obtained by at least one other mitochondrial DNA sequence analysis method, including Sanger sequencing, denaturing HPLC-based heteroduplex analysis, and/or the Illumina Genome Analyzer II next generation sequencing platform.
RESULTS: An average call rate of 99.75% was achieved across all samples with our custom pipeline. Comparison of calls for 15 samples characterized previously by Sanger sequencing revealed a total of 29 discordant calls, which translates to an estimated 0.012% for the base call error rate. We successfully identified 4 known heteroplasmic mutations and 24 other potential heteroplasmic mutations across 20 samples that passed quality control.
CONCLUSIONS: Affymetrix MitoChip v2.0 analysis using our optimized MitoChip Filtering Protocol (MFP) bioinformatics pipeline now offers the high sensitivity and accuracy needed for reliable, high-throughput and cost-efficient whole mitochondrial genome sequencing. This approach provides a viable alternative of potential utility for both clinical diagnostic and research applications to traditional Sanger and other emerging sequencing technologies for whole mitochondrial genome analysis
Recommended from our members
Characterization of cells from patient-derived fibrovascular membranes in proliferative diabetic retinopathy
Purpose Epiretinal fibrovascular membranes (FVMs) are a hallmark of proliferative diabetic retinopathy (PDR). Surgical removal of FVMs is often indicated to treat tractional retinal detachment. This potentially informative pathological tissue is usually disposed of after surgery without further examination. We developed a method for isolating and characterizing cells derived from FVMs and correlated their expression of specific markers in culture with that in tissue. Methods: FVMs were obtained from 11 patients with PDR during diabetic vitrectomy surgery and were analyzed with electron microscopy (EM), comparative genomic hybridization (CGH), immunohistochemistry, and/or digested with collagenase II for cell isolation and culture. Antibody arrays and enzyme-linked immunosorbent assay (ELISA) were used to profile secreted angiogenesis-related proteins in cell culture supernatants. Results: EM analysis of the FVMs showed abnormal vessels composed of endothelial cells with large nuclei and plasma membrane infoldings, loosely attached perivascular cells, and stromal cells. The cellular constituents of the FVMs lacked major chromosomal aberrations as shown with CGH. Cells derived from FVMs (C-FVMs) could be isolated and maintained in culture. The C-FVMs retained the expression of markers of cell identity in primary culture, which define specific cell populations including CD31-positive, alpha-smooth muscle actin-positive (SMA), and glial fibrillary acidic protein-positive (GFAP) cells. In primary culture, secretion of angiopoietin-1 and thrombospondin-1 was significantly decreased in culture conditions that resemble a diabetic environment in SMA-positive C-FVMs compared to human retinal pericytes derived from a non-diabetic donor. Conclusions: C-FVMs obtained from individuals with PDR can be isolated, cultured, and profiled in vitro and may constitute a unique resource for the discovery of cell signaling mechanisms underlying PDR that extends beyond current animal and cell culture models
Recommended from our members
NMNAT1 Mutations Cause Leber Congenital Amaurosis
Leber congenital amaurosis (LCA) is an infantile-onset form of inherited retinal degeneration characterized by severe vision loss. Two-thirds of LCA cases are caused by mutations in 17 known disease genes (RetNet Retinal Information Network). Using exome sequencing, we identified a homozygous missense mutation (c.25G>A, p.Val9Met) in NMNAT1 as likely disease-causing in two siblings of a consanguineous Pakistani kindred affected by LCA. This mutation segregated with disease in their kindred, including in three other children with LCA. NMNAT1 resides in the previously identified LCA9 locus and encodes the nuclear isoform of nicotinamide mononucleotide adenylyltransferase, a rate-limiting enzyme in nicotinamide adenine dinucleotide biosynthesis. Functional studies showed the p.Val9Met mutation decreased NMNAT1 enzyme activity. Sequencing NMNAT1 in 284 unrelated LCA families identified 14 rare mutations in 13 additional affected individuals. These results are the first to link an NMNAT isoform to disease and indicate that NMNAT1 mutations cause LCA
Resistance to autosomal dominant Alzheimer's disease in an APOE3 Christchurch homozygote: a case report.
We identified a PSEN1 (presenilin 1) mutation carrier from the world's largest autosomal dominant Alzheimer's disease kindred, who did not develop mild cognitive impairment until her seventies, three decades after the expected age of clinical onset. The individual had two copies of the APOE3 Christchurch (R136S) mutation, unusually high brain amyloid levels and limited tau and neurodegenerative measurements. Our findings have implications for the role of APOE in the pathogenesis, treatment and prevention of Alzheimer's disease
Recommended from our members
Panel-based Genetic Diagnostic Testing for Inherited Eye Diseases is Highly Accurate and Reproducible and More Sensitive for Variant Detection Than Exome Sequencing
Purpose Next-generation sequencing (NGS) based methods are being adopted broadly for genetic diagnostic testing, but the performance characteristics of these techniques have not been fully defined with regard to test accuracy and reproducibility. Methods: We developed a targeted enrichment and NGS approach for genetic diagnostic testing of patients with inherited eye disorders, including inherited retinal degenerations, optic atrophy and glaucoma. In preparation for providing this Genetic Eye Disease (GEDi) test on a CLIA-certified basis, we performed experiments to measure the sensitivity, specificity, reproducibility as well as the clinical sensitivity of the test. Results: The GEDi test is highly reproducible and accurate, with sensitivity and specificity for single nucleotide variant detection of 97.9% and 100%, respectively. The sensitivity for variant detection was notably better than the 88.3% achieved by whole exome sequencing (WES) using the same metrics, due to better coverage of targeted genes in the GEDi test compared to commercially available exome capture sets. Prospective testing of 192 patients with IRDs indicated that the clinical sensitivity of the GEDi test is high, with a diagnostic rate of 51%. Conclusion: The data suggest that based on quantified performance metrics, selective targeted enrichment is preferable to WES for genetic diagnostic testing
- …