445 research outputs found

    Assigning confidence scores to homoeologs using fuzzy logic.

    Get PDF
    In polyploid genomes, homoeologs are a specific subtype of homologs, and can be thought of as orthologs between subgenomes. In Orthologous MAtrix, we infer homoeologs in three polyploid plant species: upland cotton (Gossypium hirsutum), rapeseed (Brassica napus), and bread wheat (Triticum aestivum). While we can typically recognize the features of a "good" homoeolog prediction (a consistent evolutionary distance, high synteny, and a one-to-one relationship), none of them is a hard-fast criterion. We devised a novel fuzzy logic-based method to assign confidence scores to each pair of predicted homoeologs. We inferred homoeolog pairs and used the new and improved method to assign confidence scores, which ranged from 0 to 100. Most confidence scores were between 70 and 100, but the distribution varied between genomes. The new confidence scores show an improvement over our previous method and were manually evaluated using a subset from various confidence ranges

    OMA 2011: orthology inference among 1000 complete genomes

    Get PDF
    OMA (Orthologous MAtrix) is a database that identifies orthologs among publicly available, complete genomes. Initiated in 2004, the project is at its 11th release. It now includes 1000 genomes, making it one of the largest resources of its kind. Here, we describe recent developments in terms of species covered; the algorithmic pipeline—in particular regarding the treatment of alternative splicing, and new features of the web (OMA Browser) and programming interface (SOAP API). In the second part, we review the various representations provided by OMA and their typical applications. The database is publicly accessible at http://omabrowser.org

    On the Enhanced Interstellar Scattering Toward B1849+005

    Full text link
    (Abridged) This paper reports new Very Large Array (VLA) and Very Long Baseline Array (VLBA) observations of the extragalactic source B1849+005 at frequencies between 0.33 and 15 GHz and the re-analysis of archival VLA observations at 0.33, 1.5, and 4.9 GHz. The structure of this source is complex but interstellar scattering dominates the structure of the central component at least to 15 GHz. An analysis of the phase structure functions of the interferometric visibilities shows the density fluctuations along this line of sight to be anisotropic (axial ratio = 1.3) with a frequency-independent position angle, and having an inner scale of roughly a few hundred kilometers. The anisotropies occur on length scales of order 10^{15} cm (D/5 kpc), which within the context of certain magnetohydrodynamic turbulence theories indicates the length scale on which the kinetic and magnetic energy densities are comparable. A conservative upper limit on the velocity of the scattering material is 1800 km/s. In the 0.33 GHz field of view, there are a number of other sources that might also be heavily scattered. Both B1849+005 and PSR B1849+00 are highly scattered, and they are separated by only 13'. If the lines of sight are affected by the same ``clump'' of scattering material, it must be at least 2.3 kpc distant. However, a detailed attempt to account for the scattering observables toward these sources does not produce a self-consistent set of parameters for such a clump. A clump of H\alpha emission, possibly associated with the H II region G33.418-0.004, lies between these two lines of sight, but it seems unable to account for all of the required excess scattering.Comment: 23 pages, LaTeX2e AASTeX, 13 figures in 14 PostScript files, accepted for publication in Ap

    Partial Homology Relations - Satisfiability in terms of Di-Cographs

    Full text link
    Directed cographs (di-cographs) play a crucial role in the reconstruction of evolutionary histories of genes based on homology relations which are binary relations between genes. A variety of methods based on pairwise sequence comparisons can be used to infer such homology relations (e.g.\ orthology, paralogy, xenology). They are \emph{satisfiable} if the relations can be explained by an event-labeled gene tree, i.e., they can simultaneously co-exist in an evolutionary history of the underlying genes. Every gene tree is equivalently interpreted as a so-called cotree that entirely encodes the structure of a di-cograph. Thus, satisfiable homology relations must necessarily form a di-cograph. The inferred homology relations might not cover each pair of genes and thus, provide only partial knowledge on the full set of homology relations. Moreover, for particular pairs of genes, it might be known with a high degree of certainty that they are not orthologs (resp.\ paralogs, xenologs) which yields forbidden pairs of genes. Motivated by this observation, we characterize (partial) satisfiable homology relations with or without forbidden gene pairs, provide a quadratic-time algorithm for their recognition and for the computation of a cotree that explains the given relations

    The Early Evolution of Massive Stars: Radio Recombination Line Spectra

    Full text link
    Velocity shifts and differential broadening of radio recombination lines are used to estimate the densities and velocities of the ionized gas in several hypercompact and ultracompact HII regions. These small HII regions are thought to be at their earliest evolutionary phase and associated with the youngest massive stars. The observations suggest that these HII regions are characterized by high densities, supersonic flows and steep density gradients, consistent with accretion and outflows that would be associated with the formation of massive stars.Comment: ApJ in pres

    Coordinated thermal and optical observations of Trans-Neptunian object (20000) Varuna from Sierra Nevada

    Get PDF
    We report on coordinated thermal and optical measurements of trans-Neptunian object (20000) Varuna obtained in January-February 2002, respectively from the IRAM 30-m and IAA 1.5 m telescopes. The optical data show a lightcurve with a period of 3.176+/-0.010 hr, a mean V magnitude of 20.37+/-0.08 and a 0.42+/-0.01 magnitude amplitude. They also tentatively indicate that the lightcurve is asymmetric and double-peaked. The thermal observations indicate a 1.12+/-0.41 mJy flux, averaged over the object's rotation. Combining the two datasets, we infer that Varuna has a mean 1060(+180/-220) km diameter and a mean 0.038(+0.022/-0.010) V geometric albedo, in general agreement with an earlier determination using the same technique.Comment: Accepted for publication in Astronomy & Astrophysics (7 pages, including 3 figures

    Subarcsecond Submillimeter Imaging of the Ultracompact HII Region G5.89-0.39

    Full text link
    We present the first subarcsecond submillimeter images of the enigmatic ultracompact HII region (UCHII) G5.89-0.39. Observed with the SMA, the 875 micron continuum emission exhibits a shell-like morphology similar to longer wavelengths. By using images with comparable angular resolution at five frequencies obtained from the VLA archive and CARMA, we have removed the free-free component from the 875 micron image. We find five sources of dust emission: two compact warm objects (SMA1 and SMA2) along the periphery of the shell, and three additional regions further out. There is no dust emission inside the shell, supporting the picture of a dust-free cavity surrounded by high density gas. At subarcsecond resolution, most of the molecular gas tracers encircle the UCHII region and appear to constrain its expansion. We also find G5.89-0.39 to be almost completely lacking in organic molecular line emission. The dust cores SMA1 and SMA2 exhibit compact spatial peaks in optically-thin gas tracers (e.g. 34SO2), while SMA1 also coincides with 11.9 micron emission. In CO(3-2), we find a high-velocity north/south bipolar outflow centered on SMA1, aligned with infrared H2 knots, and responsible for much of the maser activity. We conclude that SMA1 is an embedded intermediate mass protostar with an estimated luminosity of 3000 Lsun and a circumstellar mass of ~1 Msun. Finally, we have discovered an NH3 (3,3) maser 12 arcsec northwest of the UCHII region, coincident with a 44 GHz CH3OH maser, and possibly associated with the Br gamma outflow source identified by Puga et al. (2006).Comment: 41 pages, 11 figures, published in The Astrophysical Journal (2008) Volume 680, Issue 2, pp. 1271-1288. An error in the registration of the marker positions in Figure 11 has been corrected in this versio

    Uncertain groupings: probabilistic combination of grouping data

    Get PDF
    Probabilistic approaches for data integration have much potential. We view data integration as an iterative process where data understanding gradually increases as the data scientist continuously refines his view on how to deal with learned intricacies like data conflicts. This paper presents a probabilistic approach for integrating data on groupings. We focus on a bio-informatics use case concerning homology. A bio-informatician has a large number of homology data sources to choose from. To enable querying combined knowledge contained in these sources, they need to be integrated. We validate our approach by integrating three real-world biological databases on homology in three iterations

    Expanding the Orthologous Matrix (OMA) programmatic interfaces: REST API and the OmaDB packages for R and Python.

    Get PDF
    The Orthologous Matrix (OMA) is a well-established resource to identify orthologs among many genomes. Here, we present two recent additions to its programmatic interface, namely a REST API, and user-friendly R and Python packages called OmaDB. These should further facilitate the incorporation of OMA data into computational scripts and pipelines. The REST API can be freely accessed at https://omabrowser.org/api. The R OmaDB package is available as part of Bioconductor at http://bioconductor.org/packages/OmaDB/, and the omadb Python package is available from the Python Package Index (PyPI) at https://pypi.org/project/omadb/

    VLA H53alpha observations of the central region of the Super Star Cluster Galaxy NGC 5253

    Full text link
    We present observations in the H53alpha line and radio continuum at 43 GHz carried out with the VLA in the D array (2'' angular resolution) toward the starburst galaxy NGC 5253. VLA archival data have been reprocessed to produce a uniform set of 2, 1.3 and 0.7 cm high angular (0.''2 X 0.''1) radio continuum images. The RRL H53alpha, a previously reported measurement of the H92alpha RRL flux density and the reprocessed high angular resolution radio continuum flux densities have been modeled using a collection of HII regions. Based on the models, the ionized gas in the nuclear source has an electron density of ~6 X 10^4 cm^-3 and an volume filling factor of 0.05. A Lyman continuum photon production rate of 2 X 10^52 s^-1 is necessary to sustain the ionization in the nuclear region. The number of required O7 stars in the central 1.5 pc of the supernebula is ~ 2000. The H53alpha velocity gradient 10 km s^-1 arcsec^-1) implies a dynamical mass of ~3X10^5 Msun; this mass suggests the supernebula is confined by gravity.Comment: Accepted in Astrophysical Journal 7 figure
    corecore