1,538 research outputs found

    Selective Constraint on Noncoding Regions of Hominid Genomes

    Get PDF
    An important challenge for human evolutionary biology is to understand the genetic basis of human–chimpanzee differences. One influential idea holds that such differences depend, to a large extent, on adaptive changes in gene expression. An important step in assessing this hypothesis involves gaining a better understanding of selective constraint on noncoding regions of hominid genomes. In noncoding sequence, functional elements are frequently small and can be separated by large nonfunctional regions. For this reason, constraint in hominid genomes is likely to be patchy. Here we use conservation in more distantly related mammals and amniotes as a way of identifying small sequence windows that are likely to be functional. We find that putatively functional noncoding elements defined in this manner are subject to significant selective constraint in hominids

    SREBP Controls Oxygen-Dependent Mobilization of Retrotransposons in Fission Yeast

    Get PDF
    Retrotransposons are mobile genetic elements that proliferate through an RNA intermediate. Transposons do not encode transcription factors and thus rely on host factors for mRNA expression and survival. Despite information regarding conditions under which elements are upregulated, much remains to be learned about the regulatory mechanisms or factors controlling retrotransposon expression. Here, we report that low oxygen activates the fission yeast Tf2 family of retrotransposons. Sre1, the yeast ortholog of the mammalian membrane-bound transcription factor sterol regulatory element binding protein (SREBP), directly induces the expression and mobilization of Tf2 retrotransposons under low oxygen. Sre1 binds to DNA sequences in the Tf2 long terminal repeat that functions as an oxygen-dependent promoter. We find that Tf2 solo long terminal repeats throughout the genome direct oxygen-dependent expression of adjacent coding and noncoding sequences, providing a potential mechanism for the generation of oxygen-dependent gene expression

    Exploration of wheat and pathogen transcriptomes during tan spot infection

    Get PDF
    Objectives: The fungus Pyrenophora tritici-repentis is the causal agent of tan spot, a major disease of wheat (Triticum aestivum). Here, we used RNA sequencing to generate transcriptional datasets for both the host and pathogen during infection and during in vitro pathogen growth stages. Data description: To capture gene expression during wheat infection with the P. tritici-repentis isolate M4, RNA datasets were generated for wheat inoculated with P. tritici-repentis (infection) and a mock (control) at 3 and 4 days post-infection, when scorable leaf disease symptoms manifest. The P. tritici-repentis isolate M4 was also RNA sequenced to capture gene expression in vitro at two different growth stages: 7-day old vegetative mycelia and 9-day old sporulating mycelia, to coincide with a latent growth stage and early sporulation respectively. In total, 6 RNA datasets are available to aid in the validation of predicted genes of P. tritici-repentis and wheat. The datasets generated offer an insight into the transcriptomic profile of the host-pathogen interaction and can be used to investigate the expression of a subset of transcripts or targeted genes prior to designing cost-intensive RNA sequencing experiments, that would be best further explored with replication and a time series analysis

    The worm in the world and the world in the worm

    Get PDF
    Caenorhabditis elegans is a preeminent model organism, but the natural ecology of this nematode has been elusive. A four-year survey of French orchards published in BMC Biology reveals thriving populations of C. elegans (and Caenorhabditis briggsae) in rotting fruit and plant stems. Rather than being simply a 'soil nematode', C. elegans appears to be a 'plant-rot nematode'. These studies signal a growing interest in the integrated genomics and ecology of these tractable animals

    Estimating translational selection in Eukaryotic Genomes

    Get PDF
    Natural selection on codon usage is a pervasive force that acts on a large variety of prokaryotic and eukaryotic genomes. Despite this, obtaining reliable estimates of selection on codon usage has proved complicated, perhaps due to the fact that the selection coefficients involved are very small. In this work, a population genetics model is used to measure the strength of selected codon usage bias, S, in 10 eukaryotic genomes. It is shown that the strength of selection is closely linked to expression and that reliable estimates of selection coefficients can only be obtained for genes with very similar expression levels. We compare the strength of selected codon usage for orthologous genes across all 10 genomes classified according to expression categories. Fungi genomes present the largest S values (2.24–2.56), whereas multicellular invertebrate and plant genomes present more moderate values (0.61–1.91). The large mammalian genomes (human and mouse) show low S values (0.22–0.51) for the most highly expressed genes. This might not be evidence for selection in these organisms as the technique used here to estimate S does not properly account for nucleotide composition heterogeneity along such genomes. The relationship between estimated S values and empirical estimates of population size is presented here for the first time. It is shown, as theoretically expected, that population size has an important role in the operativity of translational selection

    Penalized likelihood for sparse contingency tables with an application to full-length cDNA libraries

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The joint analysis of several categorical variables is a common task in many areas of biology, and is becoming central to systems biology investigations whose goal is to identify potentially complex interaction among variables belonging to a network. Interactions of arbitrary complexity are traditionally modeled in statistics by log-linear models. It is challenging to extend these to the high dimensional and potentially sparse data arising in computational biology. An important example, which provides the motivation for this article, is the analysis of so-called full-length cDNA libraries of alternatively spliced genes, where we investigate relationships among the presence of various exons in transcript species.</p> <p>Results</p> <p>We develop methods to perform model selection and parameter estimation in log-linear models for the analysis of sparse contingency tables, to study the interaction of two or more factors. Maximum Likelihood estimation of log-linear model coefficients might not be appropriate because of the presence of zeros in the table's cells, and new methods are required. We propose a computationally efficient ℓ<sub>1</sub>-penalization approach extending the Lasso algorithm to this context, and compare it to other procedures in a simulation study. We then illustrate these algorithms on contingency tables arising from full-length cDNA libraries.</p> <p>Conclusion</p> <p>We propose regularization methods that can be used successfully to detect complex interaction patterns among categorical variables in a broad range of biological problems involving categorical variables.</p

    The G Protein–Coupled Receptor Subset of the Chicken Genome

    Get PDF
    G protein–coupled receptors (GPCRs) are one of the largest families of proteins, and here we scan the recently sequenced chicken genome for GPCRs. We use a homology-based approach, utilizing comparisons with all human GPCRs, to detect and verify chicken GPCRs from translated genomic alignments and Genscan predictions. We present 557 manually curated sequences for GPCRs from the chicken genome, of which 455 were previously not annotated. More than 60% of the chicken Genscan gene predictions with a human ortholog needed curation, which drastically changed the average percentage identity between the human–chicken orthologous pairs (from 56.3% to 72.9%). Of the non-olfactory chicken GPCRs, 79% had a one-to-one orthologous relationship to a human GPCR. The Frizzled, Secretin, and subgroups of the Rhodopsin families have high proportions of orthologous pairs, although the percentage of amino acid identity varies. Other groups show large differences, such as the Adhesion family and GPCRs that bind exogenous ligands. The chicken has only three bitter Taste 2 receptors, and it also lacks an ortholog to human TAS1R2 (one of three GPCRs in the human genome in the Taste 1 receptor family [TAS1R]), implying that the chicken's ability and mode of detecting both bitter and sweet taste may differ from the human's. The chicken genome contains at least 229 olfactory receptors, and the majority of these (218) originate from a chicken-specific expansion. To our knowledge, this dataset of chicken GPCRs is the largest curated dataset from a single gene family from a non-mammalian vertebrate. Both the updated human GPCR dataset, as well the chicken GPCR dataset, are available for download

    Fluctuating selection models and Mcdonald-Kreitman type analyses

    Get PDF
    It is likely that the strength of selection acting upon a mutation varies through time due to changes in the environment. However, most population genetic theory assumes that the strength of selection remains constant. Here we investigate the consequences of fluctuating selection pressures on the quantification of adaptive evolution using McDonald-Kreitman (MK) style approaches. In agreement with previous work, we show that fluctuating selection can generate evidence of adaptive evolution even when the expected strength of selection on a mutation is zero. However, we also find that the mutations, which contribute to both polymorphism and divergence tend, on average, to be positively selected during their lifetime, under fluctuating selection models. This is because mutations that fluctuate, by chance, to positive selected values, tend to reach higher frequencies in the population than those that fluctuate towards negative values. Hence the evidence of positive adaptive evolution detected under a fluctuating selection model by MK type approaches is genuine since fixed mutations tend to be advantageous on average during their lifetime. Never-the-less we show that methods tend to underestimate the rate of adaptive evolution when selection fluctuates

    Nemo: a computational tool for analyzing nematode locomotion

    Get PDF
    The nematode Caenorhabditis elegans responds to an impressive range of chemical, mechanical and thermal stimuli and is extensively used to investigate the molecular mechanisms that mediate chemosensation, mechanotransduction and thermosensation. The main behavioral output of these responses is manifested as alterations in animal locomotion. Monitoring and examination of such alterations requires tools to capture and quantify features of nematode movement. In this paper, we introduce Nemo (nematode movement), a computationally efficient and robust two-dimensional object tracking algorithm for automated detection and analysis of C. elegans locomotion. This algorithm enables precise measurement and feature extraction of nematode movement components. In addition, we develop a Graphical User Interface designed to facilitate processing and interpretation of movement data. While, in this study, we focus on the simple sinusoidal locomotion of C. elegans, our approach can be readily adapted to handle complicated locomotory behaviour patterns by including additional movement characteristics and parameters subject to quantification. Our software tool offers the capacity to extract, analyze and measure nematode locomotion features by processing simple video files. By allowing precise and quantitative assessment of behavioral traits, this tool will assist the genetic dissection and elucidation of the molecular mechanisms underlying specific behavioral responses.Comment: 12 pages, 2 figures. accepted by BMC Neuroscience 2007, 8:8
    corecore