81 research outputs found

    Dissecting Allele Architecture of Early Onset IBD Using High-Density Genotyping

    Get PDF
    BACKGROUND: The inflammatory bowel diseases (IBD) are common, complex disorders in which genetic and environmental factors are believed to interact leading to chronic inflammatory responses against the gut microbiota. Earlier genetic studies performed in mostly adult population of European descent identified 163 loci affecting IBD risk, but most have relatively modest effect sizes, and altogether explain only ~20% of the genetic susceptibility. Pediatric onset represents about 25% of overall incident cases in IBD, characterized by distinct disease physiology, course and risks. The goal of this study is to compare the allelic architecture of early onset IBD with adult onset in population of European descent. METHODS: We performed a fine mapping association study of early onset IBD using high-density Immunochip genotyping on 1008 pediatric-onset IBD cases (801 Crohn\u27s disease; 121 ulcerative colitis and 86 IBD undetermined) and 1633 healthy controls. Of the 158 SNP genotypes obtained (out of the 163 identified in adult onset), this study replicated 4% (5 SNPs out of 136) of the SNPs identified in the Crohn\u27s disease (CD) cases and 0.8% (1 SNP out of 128) in the ulcerative colitis (UC) cases. Replicated SNPs implicated the well known NOD2 and IL23R. The point estimate for the odds ratio (ORs) for NOD2 was above and outside the confidence intervals reported in adult onset. A polygenic liability score weakly predicted the age of onset for a larger collection of CD cases (p\u3c 0.03, R2= 0.007), but not for the smaller number of UC cases. CONCLUSIONS: The allelic architecture of common susceptibility variants for early onset IBD is similar to that of adult onset. This immunochip genotyping study failed to identify additional common variants that may explain the distinct phenotype that characterize early onset IBD. A comprehensive dissection of genetic loci is necessary to further characterize the genetic architecture of early onset IBD

    Efficient oligonucleotide probe selection for pan-genomic tiling arrays

    Get PDF
    Background: Array comparative genomic hybridization is a fast and cost-effective method for detecting, genotyping, and comparing the genomic sequence of unknown bacterial isolates. This method, as with all microarray applications, requires adequate coverage of probes targeting the regions of interest. An unbiased tiling of probes across the entire length of the genome is the most flexible design approach. However, such a whole-genome tiling requires that the genome sequence is known in advance. For the accurate analysis of uncharacterized bacteria, an array must query a fully representative set of sequences from the species' pan-genome. Prior microarrays have included only a single strain per array or the conserved sequences of gene families. These arrays omit potentially important genes and sequence variants from the pan-genome. Results: This paper presents a new probe selection algorithm (PanArray) that can tile multiple whole genomes using a minimal number of probes. Unlike arrays built on clustered gene families, PanArray uses an unbiased, probe-centric approach that does not rely on annotations, gene clustering, or multi-alignments. Instead, probes are evenly tiled across all sequences of the pangenome at a consistent level of coverage. To minimize the required number of probes, probes conserved across multiple strains in the pan-genome are selected first, and additional probes are used only where necessary to span polymorphic regions of the genome. The viability of the algorithm is demonstrated by array designs for seven different bacterial pan-genomes and, in particular, the design of a 385,000 probe array that fully tiles the genomes of 20 different Listeria monocytogenes strains with overlapping probes at greater than twofold coverage. Conclusion: PanArray is an oligonucleotide probe selection algorithm for tiling multiple genome sequences using a minimal number of probes. It is capable of fully tiling all genomes of a species on a single microarray chip. These unique pan-genome tiling arrays provide maximum flexibility for the analysis of both known and uncharacterized strains.https://doi.org/10.1186/1471-2105-10-29

    Targeted next-generation sequencing of DNA regions proximal to a conserved GXGXXG signaling motif enables systematic discovery of tyrosine kinase fusions in cancer

    Get PDF
    Tyrosine kinase (TK) fusions are attractive drug targets in cancers. However, rapid identification of these lesions has been hampered by experimental limitations. Our in silico analysis of known cancer-derived TK fusions revealed that most breakpoints occur within a defined region upstream of a conserved GXGXXG kinase motif. We therefore designed a novel DNA-based targeted sequencing approach to screen systematically for fusions within the 90 human TKs; it should detect 92% of known TK fusions. We deliberately paired β€˜in-solution’ DNA capture with 454 sequencing to minimize starting material requirements, take advantage of long sequence reads, and facilitate mapping of fusions. To validate this platform, we analyzed genomic DNA from thyroid cancer cells (TPC-1) and leukemia cells (KG-1) with fusions known only at the mRNA level. We readily identified for the first time the genomic fusion sequences of CCDC6-RET in TPC-1 cells and FGFR1OP2-FGFR1 in KG-1 cells. These data demonstrate the feasibility of this approach to identify TK fusions across multiple human cancers in a high-throughput, unbiased manner. This method is distinct from other similar efforts, because it focuses specifically on targets with therapeutic potential, uses only 1.5 ¡g of DNA, and circumvents the need for complex computational sequence analysis

    Comprehensive assessment of sequence variation within the copy number variable defensin cluster on 8p23 by target enriched in-depth 454 sequencing

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>In highly copy number variable (CNV) regions such as the human defensin gene locus, comprehensive assessment of sequence variations is challenging. PCR approaches are practically restricted to tiny fractions, and next-generation sequencing (NGS) approaches of whole individual genomes e.g. by the 1000 Genomes Project is confined by an affordable sequence depth. Combining target enrichment with NGS may represent a feasible approach.</p> <p>Results</p> <p>As a proof of principle, we enriched a ~850 kb section comprising the CNV defensin gene cluster DEFB, the invariable DEFA part and 11 control regions from two genomes by sequence capture and sequenced it by 454 technology. 6,651 differences to the human reference genome were found. Comparison to HapMap genotypes revealed sensitivities and specificities in the range of 94% to 99% for the identification of variations.</p> <p>Using error probabilities for rigorous filtering revealed 2,886 unique single nucleotide variations (SNVs) including 358 putative novel ones. DEFB CN determinations by haplotype ratios were in agreement with alternative methods.</p> <p>Conclusion</p> <p>Although currently labor extensive and having high costs, target enriched NGS provides a powerful tool for the comprehensive assessment of SNVs in highly polymorphic CNV regions of individual genomes. Furthermore, it reveals considerable amounts of putative novel variations and simultaneously allows CN estimation.</p

    Transethnic analysis of the human leukocyte antigen region for ulcerative colitis reveals not only shared but also ethnicity-specific disease associations

    Get PDF
    Inflammatory bowel disease (IBD) is a chronic inflammatory disease of the gut. Genetic association studies have identified the highly variable human leukocyte antigen (HLA) region as the strongest susceptibility locus for IBD, and specifically DRB1*01:03 as a determining factor for ulcerative colitis (UC). However, for most of the association signal such a delineation could not be made due to tight structures of linkage disequilibrium within the HLA. The aim of this study was therefore to further characterize the HLA signal using a trans-ethnic approach. We performed a comprehensive fine mapping of single HLA alleles in UC in a cohort of 9,272 individuals with African American, East Asian, Puerto Rican, Indian and Iranian descent and 40,691 previously analyzed Caucasians, additionally analyzing whole HLA haplotypes. We computationally characterized the binding of associated HLA alleles to human self-peptides and analysed the physico-chemical properties of the HLA proteins and predicted self-peptidomes. Highlighting alleles of the HLA-DRB1*15 group and their correlated HLA-DQ-DR haplotypes, we identified consistent associations across different ethnicities but also identified population-specific signals. We observed that DRB1*01:03 is mostly present in individuals of Western European descent and hardly present in non-Caucasian individuals. We found peptides predicted to bind to risk HLA alleles to be rich in positively charged amino acids such. We conclude that the HLA plays an important role for UC susceptibility across different ethnicities. This research further implicates specific features of peptides that are predicted to bind risk and protective HLA proteins

    High-throughput SNP genotyping in the highly heterozygous genome of Eucalyptus: assay success, polymorphism and transferability across species

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>High-throughput SNP genotyping has become an essential requirement for molecular breeding and population genomics studies in plant species. Large scale SNP developments have been reported for several mainstream crops. A growing interest now exists to expand the speed and resolution of genetic analysis to outbred species with highly heterozygous genomes. When nucleotide diversity is high, a refined diagnosis of the target SNP sequence context is needed to convert queried SNPs into high-quality genotypes using the Golden Gate Genotyping Technology (GGGT). This issue becomes exacerbated when attempting to transfer SNPs across species, a scarcely explored topic in plants, and likely to become significant for population genomics and inter specific breeding applications in less domesticated and less funded plant genera.</p> <p>Results</p> <p>We have successfully developed the first set of 768 SNPs assayed by the GGGT for the highly heterozygous genome of <it>Eucalyptus </it>from a mixed Sanger/454 database with 1,164,695 ESTs and the preliminary 4.5X draft genome sequence for <it>E. grandis</it>. A systematic assessment of <it>in silico </it>SNP filtering requirements showed that stringent constraints on the SNP surrounding sequences have a significant impact on SNP genotyping performance and polymorphism. SNP assay success was high for the 288 SNPs selected with more rigorous <it>in silico </it>constraints; 93% of them provided high quality genotype calls and 71% of them were polymorphic in a diverse panel of 96 individuals of five different species.</p> <p>SNP reliability was high across nine <it>Eucalyptus </it>species belonging to three sections within subgenus Symphomyrtus and still satisfactory across species of two additional subgenera, although polymorphism declined as phylogenetic distance increased.</p> <p>Conclusions</p> <p>This study indicates that the GGGT performs well both within and across species of <it>Eucalyptus </it>notwithstanding its nucleotide diversity β‰₯2%. The development of a much larger array of informative SNPs across multiple <it>Eucalyptus </it>species is feasible, although strongly dependent on having a representative and sufficiently deep collection of sequences from many individuals of each target species. A higher density SNP platform will be instrumental to undertake genome-wide phylogenetic and population genomics studies and to implement molecular breeding by Genomic Selection in <it>Eucalyptus</it>.</p

    Identification of factors required for meristem function in Arabidopsis using a novel next generation sequencing fast forward genetics approach

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Phenotype-driven forward genetic experiments are powerful approaches for linking phenotypes to genomic elements but they still involve a laborious positional cloning process. Although sequencing of complete genomes now becomes available, discriminating causal mutations from the enormous amounts of background variation remains a major challenge.</p> <p>Method</p> <p>To improve this, we developed a universal two-step approach, named 'fast forward genetics', which combines traditional bulk segregant techniques with targeted genomic enrichment and next-generation sequencing technology</p> <p>Results</p> <p>As a proof of principle we successfully applied this approach to two Arabidopsis mutants and identified a novel factor required for stem cell activity.</p> <p>Conclusion</p> <p>We demonstrated that the 'fast forward genetics' procedure efficiently identifies a small number of testable candidate mutations. As the approach is independent of genome size, it can be applied to any model system of interest. Furthermore, we show that experiments can be multiplexed and easily scaled for the identification of multiple individual mutants in a single sequencing run.</p

    Systematic generation of in vivo G protein-coupled receptor mutants in the rat

    Get PDF
    G-protein-coupled receptors (GPCRs) constitute a large family of cell surface receptors that are involved in a wide range of physiological and pathological processes, and are targets for many therapeutic interventions. However, genetic models in the rat, one of the most widely used model organisms in physiological and pharmacological research, are largely lacking. Here, we applied N-ethyl-N-nitrosourea (ENU)-driven target-selected mutagenesis to generate an in vivo GPCR mutant collection in the rat. A pre-selected panel of 250 human GPCR homologs was screened for mutations in 813 rats, resulting in the identification of 131 non-synonymous mutations. From these, seven novel potential rat gene knockouts were established as well as 45 lines carrying missense mutations in various genes associated with or involved in human diseases. We provide extensive in silico modeling results of the missense mutations and show experimental data, suggesting loss-of-function phenotypes for several models, including Mc4r and Lpar1. Taken together, the approach used resulted not only in a set of novel gene knockouts, but also in allelic series of more subtle amino acid variants, similar as commonly observed in human disease. The mutants presented here may greatly benefit studies to understand specific GPCR function and support the development of novel therapeutic strategies

    Targeted high throughput sequencing in clinical cancer Settings: formaldehyde fixed-paraffin embedded (FFPE) tumor tissues, input amount and tumor heterogeneity

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Massively parallel sequencing technologies have brought an enormous increase in sequencing throughput. However, these technologies need to be further improved with regard to reproducibility and applicability to clinical samples and settings.</p> <p>Methods</p> <p>Using identification of genetic variations in prostate cancer as an example we address three crucial challenges in the field of targeted re-sequencing: Small nucleotide variation (SNV) detection in samples of formalin-fixed paraffin embedded (FFPE) tissue material, minimal amount of input sample and sampling in view of tissue heterogeneity.</p> <p>Results</p> <p>We show that FFPE tissue material can supplement for fresh frozen tissues for the detection of SNVs and that solution-based enrichment experiments can be accomplished with small amounts of DNA with only minimal effects on enrichment uniformity and data variance.</p> <p>Finally, we address the question whether the heterogeneity of a tumor is reflected by different genetic alterations, e.g. different foci of a tumor display different genomic patterns. We show that the tumor heterogeneity plays an important role for the detection of copy number variations.</p> <p>Conclusions</p> <p>The application of high throughput sequencing technologies in cancer genomics opens up a new dimension for the identification of disease mechanisms. In particular the ability to use small amounts of FFPE samples available from surgical tumor resections and histopathological examinations facilitates the collection of precious tissue materials. However, care needs to be taken in regard to the locations of the biopsies, which can have an influence on the prediction of copy number variations. Bearing these technological challenges in mind will significantly improve many large-scale sequencing studies and will - in the long term - result in a more reliable prediction of individual cancer therapies.</p

    Mapping of the Disease Locus and Identification of ADAMTS10 As a Candidate Gene in a Canine Model of Primary Open Angle Glaucoma

    Get PDF
    Primary open angle glaucoma (POAG) is a leading cause of blindness worldwide, with elevated intraocular pressure as an important risk factor. Increased resistance to outflow of aqueous humor through the trabecular meshwork causes elevated intraocular pressure, but the specific mechanisms are unknown. In this study, we used genome-wide SNP arrays to map the disease gene in a colony of Beagle dogs with inherited POAG to within a single 4 Mb locus on canine chromosome 20. The Beagle POAG locus is syntenic to a previously mapped human quantitative trait locus for intraocular pressure on human chromosome 19. Sequence capture and next-generation sequencing of the entire canine POAG locus revealed a total of 2,692 SNPs segregating with disease. Of the disease-segregating SNPs, 54 were within exons, 8 of which result in amino acid substitutions. The strongest candidate variant causes a glycine to arginine substitution in a highly conserved region of the metalloproteinase ADAMTS10. Western blotting revealed ADAMTS10 protein is preferentially expressed in the trabecular meshwork, supporting an effect of the variant specific to aqueous humor outflow. The Gly661Arg variant in ADAMTS10 found in the POAG Beagles suggests that altered processing of extracellular matrix and/or defects in microfibril structure or function may be involved in raising intraocular pressure, offering specific biochemical targets for future research and treatment strategies
    • …
    corecore