14 research outputs found

    Whole exome capture in solution with 3 Gbp of data

    Get PDF
    We have developed a solution-based method for targeted DNA capture-sequencing that is directed to the complete human exome. Using this approach allows the discovery of greater than 95% of all expected heterozygous singe base variants, requires as little as 3 Gbp of raw sequence data and constitutes an effective tool for identifying rare coding alleles in large scale genomic studies

    A comprehensive resequence analysis of the KLK15–KLK3–KLK2 locus on chromosome 19q13.33

    Get PDF
    Single nucleotide polymorphisms (SNPs) in the KLK3 gene on chromosome 19q13.33 are associated with serum prostate-specific antigen (PSA) levels. Recent genome wide association studies of prostate cancer have yielded conflicting results for association of the same SNPs with prostate cancer risk. Since the KLK3 gene encodes the PSA protein that forms the basis for a widely used screening test for prostate cancer, it is critical to fully characterize genetic variation in this region and assess its relationship with the risk of prostate cancer. We have conducted a next-generation sequence analysis in 78 individuals of European ancestry to characterize common (minor allele frequency, MAF >1%) genetic variation in a 56 kb region on chromosome 19q13.33 centered on the KLK3 gene (chr19:56,019,829–56,076,043 bps). We identified 555 polymorphic loci in the process including 116 novel SNPs and 182 novel insertion/deletion polymorphisms (indels). Based on tagging analysis, 144 loci are necessary to tag the region at an r2 threshold of 0.8 and MAF of 1% or higher, while 86 loci are required to tag the region at an r2 threshold of 0.8 and MAF >5%. Our sequence data augments coverage by 35 and 78% as compared to variants in dbSNP and HapMap, respectively. We observed six non-synonymous amino acid or frame shift changes in the KLK3 gene and three changes in each of the neighboring genes, KLK15 and KLK2. Our study has generated a detailed map of common genetic variation in the genomic region surrounding the KLK3 gene, which should be useful for fine-mapping the association signal as well as determining the contribution of this locus to prostate cancer risk and/or regulation of PSA expression

    Fluorescence in situ hybridization with high-complexity repeat-free oligonucleotide probes generated by massively parallel synthesis

    Get PDF
    The ability to visualize specific DNA sequences, on chromosomes and in nuclei, by fluorescence in situ hybridization (FISH) is fundamental to many aspects of genetics, genomics and cell biology. Probe selection is currently limited by the availability of DNA clones or the appropriate pool of DNA sequences for PCR amplification. Here, we show that liquid-phase probe pools from sequence capture technology can be adapted to generate fluorescently labelled pools of oligonucleotides that are very effective as repeat-free FISH probes in mammalian cells. As well as detection of small (15 kb) and larger (100 kb) specific loci in both cultured cells and tissue sections, we show that complex oligonucleotide pools can be used as probes to visualize features of nuclear organization. Using this approach, we dramatically reveal the disposition of exons around the outside of a chromosome territory core and away from the nuclear periphery

    Higher-Order Looping and Nuclear Organization of Tcra Facilitate Targeted RAG Cleavage and Regulated Rearrangement in Recombination Centers

    Get PDF
    V(D)J recombination is essential for generating a diverse array of B and T cell receptors that can recognize and combat foreign antigens. As with any recombination event, tight control is essential to prevent the occurrence of genetic anomalies that drive cellular transformation. One important aspect of regulation is directed targeting of the RAG recombinase. Indeed, RAG accumulates at the 3′ end of individual antigen receptor loci poised for rearrangement; however, it is not known whether focal binding is involved in regulating cleavage, and what mechanisms lead to enrichment of RAG in this region. Here, we show that monoallelic looping out of the 3′ end of the T cell receptor α (Tcra) locus, coupled with transcription and increased chromatin/nuclear accessibility, is linked to focal RAG binding and ATM-mediated regulation of monoallelic cleavage on looped-out 3′ regions. Our data identify higher-order loop formation as a key determinant of directed RAG targeting and the maintenance of genome stability

    Genome-wide analysis of allelic expression imbalance in human primary cells by high-throughput transcriptome resequencing

    Get PDF
    Many disease-associated variants identified by genome-wide association (GWA) studies are expected to regulate gene expression. Allele-specific expression (ASE) quantifies transcription from both haplotypes using individuals heterozygous at tested SNPs. We performed deep human transcriptome-wide resequencing (RNA-seq) for ASE analysis and expression quantitative trait locus discovery. We resequenced double poly(A)-selected RNA from primary CD4(+) T cells (n = 4 individuals, both activated and untreated conditions) and developed tools for paired-end RNA-seq alignment and ASE analysis. We generated an average of 20 million uniquely mapping 45 base reads per sample. We obtained sufficient read depth to test 1371 unique transcripts for ASE. Multiple biases inflate the false discovery rate which we estimate to be ∼50% for random SNPs. However, after controlling for these biases and considering the subset of SNPs that pass HapMap QC, 4.6% of heterozygous SNP-sample pairs show evidence of imbalance (P < 0.001). We validated four findings by both bacterial cloning and Sanger sequencing assays. We also found convincing evidence for allelic imbalance at multiple reporter exonic SNPs in CD6 for two samples heterozygous at the multiple sclerosis-associated variant rs17824933, linking GWA findings with variation in gene expression. Finally, we show in CD4(+) T cells from a further individual that high-throughput sequencing of genomic DNA and RNA-seq following enrichment for targeted gene sequences by sequence capture methods offers an unbiased means to increase the read depth for transcripts of interest, and therefore a method to investigate the regulatory role of many disease-associated genetic variants
    corecore