77 research outputs found

    Similarity-Detection and Localization

    Full text link
    The detection of similarities between long DNA and protein sequences is studied using concepts of statistical physics. It is shown that mutual similarities can be detected by sequence alignment methods only if their amount exceeds a threshold value. The onset of detection is a continuous phase transition which can be viewed as a localization-delocalization transition. The ``fidelity'' of the alignment is the order parameter of that transition; it leads to criteria for the selection of optimal alignment parameters.Comment: 4 pages including 4 figures (308kb post-script file

    Dinucleotide controlled null models for comparative RNA gene prediction

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Comparative prediction of RNA structures can be used to identify functional noncoding RNAs in genomic screens. It was shown recently by Babak <it>et al</it>. [BMC Bioinformatics. 8:33] that RNA gene prediction programs can be biased by the genomic dinucleotide content, in particular those programs using a thermodynamic folding model including stacking energies. As a consequence, there is need for dinucleotide-preserving control strategies to assess the significance of such predictions. While there have been randomization algorithms for single sequences for many years, the problem has remained challenging for multiple alignments and there is currently no algorithm available.</p> <p>Results</p> <p>We present a program called SISSIz that simulates multiple alignments of a given average dinucleotide content. Meeting additional requirements of an accurate null model, the randomized alignments are on average of the same sequence diversity and preserve local conservation and gap patterns. We make use of a phylogenetic substitution model that includes overlapping dependencies and site-specific rates. Using fast heuristics and a distance based approach, a tree is estimated under this model which is used to guide the simulations. The new algorithm is tested on vertebrate genomic alignments and the effect on RNA structure predictions is studied. In addition, we directly combined the new null model with the RNAalifold consensus folding algorithm giving a new variant of a thermodynamic structure based RNA gene finding program that is not biased by the dinucleotide content.</p> <p>Conclusion</p> <p>SISSIz implements an efficient algorithm to randomize multiple alignments preserving dinucleotide content. It can be used to get more accurate estimates of false positive rates of existing programs, to produce negative controls for the training of machine learning based programs, or as standalone RNA gene finding program. Other applications in comparative genomics that require randomization of multiple alignments can be considered.</p> <p>Availability</p> <p>SISSIz is available as open source C code that can be compiled for every major platform and downloaded here: <url>http://sourceforge.net/projects/sissiz</url>.</p

    Nuclear and global X-ray properties of LINER galaxies: Chandra and BeppoSAX results for Sombrero and NGC 4736

    Get PDF
    We report on the 0.1--100 keV BeppoSAX observations of two nearby LINER galaxies, Sombrero and NGC 4736. Chandra ACIS-S observations supplement this broad-beam spectral study with a high resolution look into the nuclear region, and show a dominating central point source in Sombrero and a complex X-ray binary dominated/starburst region in NGC 4736. A compact non-thermal radio source, present in the nucleus of both galaxies, coincides with the central source in Sombrero, while in NGC 4736 its X-ray counterpart is a much fainter point source, not the brightest of the central region. On the basis of these and other results, we conclude that the LINER activity is linked to the presence of a low luminosity AGN in Sombrero and to a recent starburst in NGC 4736, and that Chandra's spectroscopic capabilities coupled to high resolution imaging are essential to establish the origin of the nuclear activity.Comment: 13 pages, 12 figures (Fig. 4 and Fig. 5 are colour figures), to be published in Astronomy & Astrophysic

    Evolutionary rates vary among rRNA structural elements

    Get PDF
    Understanding patterns of rRNA evolution is critical for a number of fields, including structure prediction and phylogeny. The standard model of RNA evolution is that compensatory mutations in stems make up the bulk of the changes between homologous sequences, while unpaired regions are relatively homogeneous. We show that considerable heterogeneity exists in the relative rates of evolution of different secondary structure categories (stems, loops, bulges, etc.) within the rRNA, and that in eukaryotes, loops actually evolve much faster than stems. Both rates of evolution and abundance of different structural categories vary with distance from functionally important parts of the ribosome such as the tRNA path and the peptidyl transferase center. For example, fast-evolving residues are mainly found at the surface; stems are enriched at the subunit interface, and junctions near the peptidyl transferase center. However, different secondary structure categories evolve at different rates even when these effects are accounted for. The results demonstrate that relative rates and patterns of evolution are lineage specific, suggesting that phylogenetically and structurally specific models will improve evolutionary and structural predictions

    Microsatellites for the genus Cucurbita and an SSR-based genetic linkage map of Cucurbita pepo L.

    Get PDF
    Until recently, only a few microsatellites have been available for Cucurbita, thus their development is highly desirable. The Austrian oil-pumpkin variety Gleisdorfer Ölkürbis (C. pepo subsp. pepo) and the C. moschata cultivar Soler (Puerto Rico) were used for SSR development. SSR-enriched partial genomic libraries were established and 2,400 clones were sequenced. Of these 1,058 (44%) contained an SSR at least four repeats long. Primers were designed for 532 SSRs; 500 primer pairs produced fragments of expected size. Of these, 405 (81%) amplified polymorphic fragments in a set of 12 genotypes: three C. moschata, one C. ecuadorensis, and eight C. pepo representing all eight cultivar groups. On an average, C. pepo and C. moschata produced 3.3 alleles per primer pair, showing high inter-species transferability. There were 187 SSR markers detecting polymorphism between the USA oil-pumpkin variety “Lady Godiva” (O5) and the Italian crookneck variety “Bianco Friulano” (CN), which are the parents of our previous F2 mapping population. It has been used to construct the first published C. pepo map, containing mainly RAPD and AFLP markers. Now the updated map comprises 178 SSRs, 244 AFLPs, 230 RAPDs, five SCARs, and two morphological traits (h and B). It contains 20 linkage groups with a map density of 2.9 cM. The observed genome coverage (Co) is 86.8%

    Homoplastic microinversions and the avian tree of life

    Get PDF
    Background: Microinversions are cytologically undetectable inversions of DNA sequences that accumulate slowly in genomes. Like many other rare genomic changes (RGCs), microinversions are thought to be virtually homoplasyfree evolutionary characters, suggesting that they may be very useful for difficult phylogenetic problems such as the avian tree of life. However, few detailed surveys of these genomic rearrangements have been conducted, making it difficult to assess this hypothesis or understand the impact of microinversions upon genome evolution. Results: We surveyed non-coding sequence data from a recent avian phylogenetic study and found substantially more microinversions than expected based upon prior information about vertebrate inversion rates, although this is likely due to underestimation of these rates in previous studies. Most microinversions were lineage-specific or united well-accepted groups. However, some homoplastic microinversions were evident among the informative characters. Hemiplasy, which reflects differences between gene trees and the species tree, did not explain the observed homoplasy. Two specific loci were microinversion hotspots, with high numbers of inversions that included both the homoplastic as well as some overlapping microinversions. Neither stem-loop structures nor detectable sequence motifs were associated with microinversions in the hotspots. Conclusions: Microinversions can provide valuable phylogenetic information, although power analysis indicate

    Pervasive Cryptic Epistasis in Molecular Evolution

    Get PDF
    The functional effects of most amino acid replacements accumulated during molecular evolution are unknown, because most are not observed naturally and the possible combinations are too numerous. We created 168 single mutations in wild-type Escherichia coli isopropymalate dehydrogenase (IMDH) that match the differences found in wild-type Pseudomonas aeruginosa IMDH. 104 mutant enzymes performed similarly to E. coli wild-type IMDH, one was functionally enhanced, and 63 were functionally compromised. The transition from E. coli IMDH, or an ancestral form, to the functional wild-type P. aeruginosa IMDH requires extensive epistasis to ameliorate the combined effects of the deleterious mutations. This result stands in marked contrast with a basic assumption of molecular phylogenetics, that sites in sequences evolve independently of each other. Residues that affect function are scattered haphazardly throughout the IMDH structure. We screened for compensatory mutations at three sites, all of which lie near the active site and all of which are among the least active mutants. No compensatory mutations were found at two sites indicating that a single site may engage in compound epistatic interactions. One complete and three partial compensatory mutations of the third site are remote and lie in a different domain. This demonstrates that epistatic interactions can occur between distant (>20Å) sites. Phylogenetic analysis shows that incompatible mutations were fixed in different lineages

    Oxygen and nitrogen abundances in nearby galaxies. Correlations between oxygen abundance and macroscopic properties

    Full text link
    We performed a compilation of more than 1000 published spectra of HII regions in spiral galaxies. The oxygen and nitrogen abundances in each HII region were recomputed in a homogeneous way, using the P-method. The radial distributions of oxygen and nitrogen abundances were derived. The correlations between oxygen abundance and macroscopic properties are examined. There is a significant difference between the L-Z relationship obtained here and that based on the oxygen abundances determined through the R_23-calibrations. The oxygen abundance of NGC 5457 recently determined using direct measurements of Te (Kennicutt, Bresolin & Garnett 2003) agrees with the L-Z relationship derived here, but is in conflict with the L-Z relationship derived with the R_23-based oxygen abundances. The obtained L-Z relation for spirals is compared to that for irregulars. Our sample of galaxies shows evidence that the slope of the O/H-M_B relationship for spirals is slightly more shallow than that for irregulars. The effective oxygen yields were estimated for spiral and irregular galaxies. The effective oxygen yield increases with increasing luminosity from M_B=-11 to M_B=-18 (or with increasing rotation velocity from Vrot=10 km/s to Vrot=100 km/s) and then remains approximately constant. Irregular galaxies from our sample have effective oxygen yields lowered by a factor of 3 at maximum, i.e. irregular galaxies usually keep at least 1/3 of the oxygen they manufactured during their evolution.Comment: Accepted for publication in Astronomy and Astrophysics (Figures 2-5, Tables 2,6 and Appendix will only be published in the electronic version of the Journal
    corecore