777 research outputs found
A systematic search for new mammalian noncoding RNAs indicates little conserved intergenic transcription
BACKGROUND: Systematic identification and functional characterization of novel types of noncoding (nc)RNA in genomes is more difficult than it is for protein coding mRNAs, since ncRNAs typically do not possess sequence features such as splicing or translation signals, or long open reading frames. Recent "tiling" microarray studies have reported that a surprisingly larger proportion of mammalian genomes is transcribed than was previously anticipated. However, these non-genic transcripts often appear to be low in abundance, and their functional significance is not known. RESULTS: To systematically search for functional ncRNAs, we designed microarrays to detect 3,478 intergenic and intronic sequences that are conserved between the human, mouse, and rat genomes, and that score highly by other criteria that characterize ncRNAs. We probed these arrays with total RNA isolated from 16 wild-type mouse tissues. Among 55 candidates for highly-expressed novel ncRNAs tested by northern blotting, eight were confirmed as small, highly-and ubiquitously-expressed RNAs in mouse. Of the eight, five were also detected in rat tissues, but none were detected at appreciable levels in human tissues or cultured cells. CONCLUSION: Since the sequence and expression of most known coding transcripts and functional ncRNAs is conserved between human and mouse, the lack of northern-detectable expression in human cells and tissues of the novel mouse and rat ncRNAs that we identified suggests that they are not functional or possibly have rodent-specific functions. Our results confirm that relatively little of the intergenic sequence conserved between human, mouse and rat is transcribed at high levels in mammalian tissues, possibly suggesting a limited role for transcribed intergenic and intronic sequences as independent functional elements
Considerations in the identification of functional RNA structural elements in genomic alignments
BACKGROUND: Accurate identification of novel, functional noncoding (nc) RNA features in genome sequence has proven more difficult than for exons. Current algorithms identify and score potential RNA secondary structures on the basis of thermodynamic stability, conservation, and/or covariance in sequence alignments. Neither the algorithms nor the information gained from the individual inputs have been independently assessed. Furthermore, due to issues in modelling background signal, it has been difficult to gauge the precision of these algorithms on a genomic scale, in which even a seemingly small false-positive rate can result in a vast excess of false discoveries. RESULTS: We developed a shuffling algorithm, shuffle-pair.pl, that simultaneously preserves dinucleotide frequency, gaps, and local conservation in pairwise sequence alignments. We used shuffle-pair.pl to assess precision and recall of six ncRNA search tools (MSARI, QRNA, ddbRNA, RNAz, Evofold, and several variants of simple thermodynamic stability on a test set of 3046 alignments of known ncRNAs. Relative to mononucleotide shuffling, preservation of dinucleotide content in shuffling the alignments resulted in a drastic increase in estimated false-positive detection rates for ncRNA elements, precluding evaluation of higher order alignments, which cannot not be adequately shuffled maintaining both dinucleotides and alignment structure. On pairwise alignments, none of the covariance-based tools performed markedly better than thermodynamic scoring alone. Although the high false-positive rates call into question the veracity of any individual predicted secondary structural element in our analysis, we nevertheless identified intriguing global trends in human genome alignments. The distribution of ncRNA prediction scores in 75-base windows overlapping UTRs, introns, and intergenic regions analyzed using both thermodynamic stability and EvoFold (which has no thermodynamic component) was significantly higher for real than shuffled sequence, while the distribution for coding sequences was lower than that of corresponding shuffles. CONCLUSION: Accurate prediction of novel RNA structural elements in genome sequence remains a difficult problem, and development of an appropriate negative-control strategy for multiple alignments is an important practical challenge. Nonetheless, the general trends we observed for the distributions of predicted ncRNAs across genomic features are biologically meaningful, supporting the presence of secondary structural elements in many 3' UTRs, and providing evidence for evolutionary selection against secondary structures in coding regions
Global analysis of yeast RNA processing identifies new targets of RNase III and uncovers a link between tRNA 5′ end processing and tRNA splicing
We used a microarray containing probes that tile all known yeast noncoding RNAs (ncRNAs) to investigate RNA biogenesis on a global scale. The microarray verified a general loss of Box C/D snoRNAs in the TetO(7)-BCD1 mutant, which had previously been shown for only a handful of snoRNAs. We also monitored the accumulation of improperly processed flank sequences of pre-RNAs in strains depleted for known RNA nucleases, including RNase III, Dbr1p, Xrn1p, Rat1p and components of the exosome and RNase P complexes. Among the hundreds of aberrant RNA processing events detected, two novel substrates of Rnt1p (the RUF1 and RUF3 snoRNAs) were identified. We also identified a relationship between tRNA 5′ end processing and tRNA splicing, processes that were previously thought to be independent. This analysis demonstrates the applicability of microarray technology to the study of global analysis of ncRNA synthesis and provides an extensive directory of processing events mediated by yeast ncRNA processing enzymes
Critical Evaluation of Imprinted Gene Expression by RNA–Seq: A New Perspective
In contrast to existing estimates of approximately 200 murine imprinted genes, recent work based on transcriptome sequencing uncovered parent-of-origin allelic effects at more than 1,300 loci in the developing brain and two adult brain regions, including hundreds present in only males or females. Our independent replication of the embryonic brain stage, where the majority of novel imprinted genes were discovered and the majority of previously known imprinted genes confirmed, resulted in only 12.9% concordance among the novel imprinted loci. Further analysis and pyrosequencing-based validation revealed that the vast majority of the novel reported imprinted loci are false-positives explained by technical and biological variation of the experimental approach. We show that allele-specific expression (ASE) measured with RNA–Seq is not accurately modeled with statistical methods that assume random independent sampling and that systematic error must be accounted for to enable accurate identification of imprinted expression. Application of a robust approach that accounts for these effects revealed 50 candidate genes where allelic bias was predicted to be parent-of-origin–dependent. However, 11 independent validation attempts through a range of allelic expression biases confirmed only 6 of these novel cases. The results emphasize the importance of independent validation and suggest that the number of imprinted genes is much closer to the initial estimates
Systematic Detection of Polygenic cis-Regulatory Evolution
The idea that most morphological adaptations can be attributed to changes in the
cis-regulation of gene expression levels has been gaining
increasing acceptance, despite the fact that only a handful of such cases have
so far been demonstrated. Moreover, because each of these cases involves only
one gene, we lack any understanding of how natural selection may act on
cis-regulation across entire pathways or networks. Here we
apply a genome-wide test for selection on cis-regulation to two
subspecies of the mouse Mus musculus. We find evidence for
lineage-specific selection at over 100 genes involved in diverse processes such
as growth, locomotion, and memory. These gene sets implicate candidate genes
that are supported by both quantitative trait loci and a validated
causality-testing framework, and they predict a number of phenotypic
differences, which we confirm in all four cases tested. Our results suggest that
gene expression adaptation is widespread and that these adaptations can be
highly polygenic, involving cis-regulatory changes at numerous
functionally related genes. These coordinated adaptations may contribute to
divergence in a wide range of morphological, physiological, and behavioral
phenotypes
Conservation of core gene expression in vertebrate tissues
Abstract
Background
Vertebrates share the same general body plan and organs, possess related sets of genes, and rely on similar physiological mechanisms, yet show great diversity in morphology, habitat and behavior. Alteration of gene regulation is thought to be a major mechanism in phenotypic variation and evolution, but relatively little is known about the broad patterns of conservation in gene expression in non-mammalian vertebrates.
Results
We measured expression of all known and predicted genes across twenty tissues in chicken, frog and pufferfish. By combining the results with human and mouse data and considering only ten common tissues, we have found evidence of conserved expression for more than a third of unique orthologous genes. We find that, on average, transcription factor gene expression is neither more nor less conserved than that of other genes. Strikingly, conservation of expression correlates poorly with the amount of conserved nonexonic sequence, even using a sequence alignment technique that accounts for non-collinearity in conserved elements. Many genes show conserved human/fish expression despite having almost no nonexonic conserved primary sequence.
Conclusions
There are clearly strong evolutionary constraints on tissue-specific gene expression. A major challenge will be to understand the precise mechanisms by which many gene expression patterns remain similar despite extensive cis-regulatory restructuring
Threats to ICF reactor materials: computational simulations of radiation damage induced topological changes in fused silica
Abstract We have performed molecular dynamics simulations of radiation damage in fused silica. In this study, we discuss the role of successive cascade overlap on the saturation and self-healing of oxygen vacancy defects in the amorphous fused silica network. Furthermore, we present findings on the topological changes in fused silica due to repeated energetic recoil atoms. These topological network modifications consistent with experimental Raman spectroscopic observation on neutron and ion irradiated fused silica are indicators of permanent densification that has also been observed experimentally. Published by Elsevier Science B.V
A First-Passage Kinetic Monte Carlo Algorithm for Complex Diffusion-Reaction Systems
We develop an asynchronous event-driven First-Passage Kinetic Monte Carlo
(FPKMC) algorithm for continuous time and space systems involving multiple
diffusing and reacting species of spherical particles in two and three
dimensions. The FPKMC algorithm presented here is based on the method
introduced in [Phys. Rev. Lett., 97:230602, 2006] and is implemented in a
robust and flexible framework. Unlike standard KMC algorithms such as the
n-fold algorithm, FPKMC is most efficient at low densities where it replaces
the many small hops needed for reactants to find each other with large
first-passage hops sampled from exact time-dependent Green's functions, without
sacrificing accuracy. We describe in detail the key components of the
algorithm, including the event-loop and the sampling of first-passage
probability distributions, and demonstrate the accuracy of the new method. We
apply the FPKMC algorithm to the challenging problem of simulation of long-term
irradiation of metals, relevant to the performance and aging of nuclear
materials in current and future nuclear power plants. The problem of radiation
damage spans many decades of time-scales, from picosecond spikes caused by
primary cascades, to years of slow damage annealing and microstructure
evolution. Our implementation of the FPKMC algorithm has been able to simulate
the irradiation of a metal sample for durations that are orders of magnitude
longer than any previous simulations using the standard Object KMC or more
recent asynchronous algorithms.Comment: See also arXiv:0905.357
- …