15 research outputs found
Genome-wide profiling of Populus small RNAs
<p>Abstract</p> <p>Background</p> <p>Short RNAs, and in particular microRNAs, are important regulators of gene expression both within defined regulatory pathways and at the epigenetic scale. We investigated the short RNA (sRNA) population (18-24 nt) of the transcriptome of green leaves from the sequenced <it>Populus trichocarpa </it>using a concatenation strategy in combination with 454 sequencing.</p> <p>Results</p> <p>The most abundant size class of sRNAs were 24 nt. Long Terminal Repeats were particularly associated with 24 nt sRNAs. Additionally, some repetitive elements were associated with 22 nt sRNAs. We identified an sRNA hot-spot on chromosome 19, overlapping a region containing both the proposed sex-determining locus and a major cluster of <it>NBS-LRR </it>genes. A number of phased siRNA loci were identified, a subset of which are predicted to target PPR and <it>NBS-LRR </it>disease resistance genes, classes of genes that have been significantly expanded in <it>Populus</it>. Additional loci enriched for sRNA production were identified and characterised. We identified 15 novel predicted microRNAs (miRNAs), including miRNA*sequences, and identified a novel locus that may encode a dual miRNA or a miRNA and short interfering RNAs (siRNAs).</p> <p>Conclusions</p> <p>The short RNA population of <it>P. trichocarpa </it>is at least as complex as that of <it>Arabidopsis thaliana</it>. We provide a first genome-wide view of short RNA production for <it>P. trichocarpa </it>and identify new, non-conserved miRNAs.</p
Increased Throughput by Parallelization of Library Preparation for Massive Sequencing
Background: Massively parallel sequencing systems continue to improve on data output, while leaving labor-intensive library preparations a potential bottleneck. Efforts are currently under way to relieve the crucial and time-consuming work to prepare DNA for high-throughput sequencing. Methodology/Principal Findings: In this study, we demonstrate an automated parallel library preparation protocol using generic carboxylic acid-coated superparamagnetic beads and polyethylene glycol precipitation as a reproducible and flexible method for DNA fragment length separation. With this approach the library preparation for DNA sequencing can easily be adjusted to a desired fragment length. The automated protocol, here demonstrated using the GS FLX Titanium instrument, was compared to the standard manual library preparation, showing higher yield, throughput and great reproducibility. In addition, 12 libraries were prepared and uniquely tagged in parallel, and the distribution of sequence reads between these indexed samples could be improved using quantitative PCR-assisted pooling. Conclusions/Significance: We present a novel automated procedure that makes it possible to prepare 36 indexed libraries per person and day, which can be increased to up to 96 libraries processed simultaneously. The yield, speed and robust performance of the protocol constitute a substantial improvement to present manual methods, without the need of extensive equipment investments. The described procedure enables a considerable efficiency increase for small to midsiz
Recommended from our members
Genome-wide profiling of Populus small RNAs
Background: Short RNAs, and in particular microRNAs, are important regulators of gene
expression both within defined regulatory pathways and at the epigenetic scale. We investigated
the short RNA (sRNA) population (18-24 nt) of the transcriptome of green leaves from the
sequenced Populus trichocarpa using a concatenation strategy in combination with 454 sequencing.
Results: The most abundant size class of sRNAs were 24 nt. Long Terminal Repeats were
particularly associated with 24 nt sRNAs. Additionally, some repetitive elements were associated
with 22 nt sRNAs. We identified an sRNA hot-spot on chromosome 19, overlapping a region
containing both the proposed sex-determining locus and a major cluster of NBS-LRR genes. A
number of phased siRNA loci were identified, a subset of which are predicted to target PPR and
NBS-LRR disease resistance genes, classes of genes that have been significantly expanded in Populus.
Additional loci enriched for sRNA production were identified and characterised. We identified 15
novel predicted microRNAs (miRNAs), including miRNA*sequences, and identified a novel locus
that may encode a dual miRNA or a miRNA and short interfering RNAs (siRNAs).
Conclusions: The short RNA population of P. trichocarpa is at least as complex as that of
Arabidopsis thaliana. We provide a first genome-wide view of short RNA production for P.
trichocarpa and identify new, non-conserved miRNAs.Copyrighted by BioMed Central
Evidence that breast cancer risk at the 2q35 locus is mediated through IGFBP5 regulation.
GWAS have identified a breast cancer susceptibility locus on 2q35. Here we report the fine mapping of this locus using data from 101,943 subjects from 50 case-control studies. We genotype 276 SNPs using the 'iCOGS' genotyping array and impute genotypes for a further 1,284 using 1000 Genomes Project data. All but two, strongly correlated SNPs (rs4442975 G/T and rs6721996 G/A) are excluded as candidate causal variants at odds against >100:1. The best functional candidate, rs4442975, is associated with oestrogen receptor positive (ER+) disease with an odds ratio (OR) in Europeans of 0.85 (95% confidence interval=0.84-0.87; P=1.7 × 10(-43)) per t-allele. This SNP flanks a transcriptional enhancer that physically interacts with the promoter of IGFBP5 (encoding insulin-like growth factor-binding protein 5) and displays allele-specific gene expression, FOXA1 binding and chromatin looping. Evidence suggests that the g-allele confers increased breast cancer susceptibility through relative downregulation of IGFBP5, a gene with known roles in breast cell biology
An intergenic risk locus containing an enhancer deletion in 2q35 modulates breast cancer risk by deregulating IGFBP5 expression
Breast cancer is the most diagnosed malignancy and the second leading cause of cancer mortality in females. Previous association studies have identified variants on 2q35 associated with the risk of breast cancer. To identify functional susceptibility loci for breast cancer, we interrogated the 2q35 gene desert for chromatin architecture and functional variation correlated with gene expression. We report a novel intergenic breast cancer risk locus containing an enhancer copy number variation (enCNV; deletion) located approximately 400Kb upstream to IGFBP5, which overlaps an intergenic ERα-bound enhancer that loops to the IGFBP5 promoter. The enCNV is correlated with modified ERα binding and monoallelic-repression of IGFBP5 following estrogen treatment. We investigated the association of enCNV genotype with breast cancer in 1,182 cases and 1,362 controls, and replicate our findings in an independent set of 62,533 cases and 60,966 controls from 41 case control studies and 11 GWAS. We report a dose-dependent inverse association of 2q35 enCNV genotype (percopy OR=0.68 95%CI 0.55-0.83, P=0.0002; replication OR=0.77 95%CI 0.73-0.82, P=2.1x10(-19)) and identify 13 additional linked variants (r(2)>0.8) in the 20Kb linkage block containing the enCNV (P=3.2x10(-15) - 5.6x10(-17)). These associations were independent of previously reported 2q35 variants, rs13387042/rs4442975 and rs16857609, and were stronger for ER-positive than ER-negative disease. Together, these results suggest that 2q35 breast cancer risk loci may be mediating their effect through IGFBP5
Publisher Correction: Evidence that breast cancer risk at the 2q35 locus is mediated through IGFBP5 regulation.
This corrects the article DOI: 10.1038/ncomms5999
Evidence that breast cancer risk at the 2q35 locus is mediated through IGFBP5 regulation
GWAS have identified a breast cancer susceptibility locus on 2q35. Here we report the fine mapping of this locus using data from 101,943 subjects from 50 case-control studies. We genotype 276 SNPs using the 'iCOGS' genotyping array and impute genotypes for a further 1,284 using 1000 Genomes Project data. All but two, strongly correlated SNPs (rs4442975 G/T and rs6721996 G/A) are excluded as candidate causal variants at odds against >100:1. The best functional candidate, rs4442975, is associated with oestrogen receptor positive (ER+) disease with an odds ratio (OR) in Europeans of 0.85 (95% confidence interval=0.84-0.87; P=1.7 × 10-43) per t-allele. This SNP flanks a transcriptional enhancer that physically interacts with the promoter of IGFBP5 (encoding insulin-like growth factor-binding protein 5) and displays allele-specific gene expression, FOXA1 binding and chromatin looping. Evidence suggests that the g-allele confers increased breast cancer susceptibility through relative downregulation of IGFBP5, a gene with known roles in breast cell biology