3,890 research outputs found

    Repeated divergent selection on pigmentation genes in a rapid finch radiation

    Get PDF
    Instances of recent and rapid speciation are suitable for associating phenotypes with their causal genotypes, especially if gene flow homogenizes areas of the genome that are not under divergent selection. We study a rapid radiation of nine sympatric bird species known as capuchino seedeaters, which are differentiated in sexually selected characters of male plumage and song. We sequenced the genomes of a phenotypically diverse set of species to search for differentiated genomic regions. Capuchinos show differences in a small proportion of their genomes, yet selection has acted independently on the same targets in different members of this radiation. Many divergent regions contain genes involved in the melanogenesis pathway, with the strongest signal originating from putative regulatory regions. Selection has acted on these same genomic regions in different lineages, likely shaping the evolution of cis-regulatory elements, which control how more conserved genes are expressed and thereby generate diversity in classically sexually selected traits.Fil: Campagna, Leonardo. Cornell University; Estados Unidos. Consejo Nacional de Investigaciones Científicas y Técnicas; ArgentinaFil: Repenning, Márcio. Pontificia Universidade Católica do Rio Grande do Sul. Museu de Ciências e Tecnologia; BrasilFil: Silveira, Luís Fábio. Universidade de Sao Paulo; BrasilFil: Fontana, Carla Suertegaray. Pontificia Universidade Católica do Rio Grande do Sul; BrasilFil: Tubaro, Pablo Luis. Consejo Nacional de Investigaciones Científicas y Técnicas. Oficina de Coordinación Administrativa Parque Centenario. Museo Argentino de Ciencias Naturales "Bernardino Rivadavia"; ArgentinaFil: Lovette, Irby. Cornell University; Estados Unido

    Detecting Clusters of Mutations

    Get PDF
    Positive selection for protein function can lead to multiple mutations within a small stretch of DNA, i.e., to a cluster of mutations. Recently, Wagner proposed a method to detect such mutation clusters. His method, however, did not take into account that residues with high solvent accessibility are inherently more variable than residues with low solvent accessibility. Here, we propose a new algorithm to detect clustered evolution. Our algorithm controls for different substitution probabilities at buried and exposed sites in the tertiary protein structure, and uses random permutations to calculate accurate P values for inferred clusters. We apply the algorithm to genomes of bacteria, fly, and mammals, and find several clusters of mutations in functionally important regions of proteins. Surprisingly, clustered evolution is a relatively rare phenomenon. Only between 2% and 10% of the genes we analyze contain a statistically significant mutation cluster. We also find that not controlling for solvent accessibility leads to an excess of clusters in terminal and solvent-exposed regions of proteins. Our algorithm provides a novel method to identify functionally relevant divergence between groups of species. Moreover, it could also be useful to detect artifacts in automatically assembled genomes

    Mobilomics in Saccharomyces cerevisiae Strains

    Get PDF
    Background: Mobile Genetic Elements (MGEs) are selfish DNA integrated in the genomes. Their detection is mainly based on consensus-like searches by scanning the investigated genome against the sequence of an already identified MGE. Mobilomics aims at discovering all the MGEs in a genome and understanding their dynamic behavior: The data for this kind of investigation can be provided by comparative genomics of closely related organisms. The amount of data thus involved requires a strong computational effort, which should be alleviated.Results: Our approach proposes to exploit the high similarity among homologous chromosomes of different strains of the same species, following a progressive comparative genomics philosophy. We introduce a software tool based on our new fast algorithm, called regender, which is able to identify the conserved regions between chromosomes. Our case study is represented by a unique recently available dataset of 39 different strains of S.cerevisiae, which regender is able to compare in few minutes. By exploring the non-conserved regions, where MGEs are mainly retrotransposons called Tys, and marking the candidate Tys based on their length, we are able to locate a priori and automatically all the already known Tys and map all the putative Tys in all the strains. The remaining putative mobile elements (PMEs) emerging from this intra-specific comparison are sharp markers of inter-specific evolution: indeed, many events of non-conservation among different yeast strains correspond to PMEs. A clustering based on the presence/absence of the candidate Tys in the strains suggests an evolutionary interconnection that is very similar to classic phylogenetic trees based on SNPs analysis, even though it is computed without using phylogenetic information.Conclusions: The case study indicates that the proposed methodology brings two major advantages: (a) it does not require any template sequence for the wanted MGEs and (b) it can be applied to infer MGEs also for low coverage genomes with unresolved bases, where traditional approaches are largely ineffective

    Promoter-sharing by different genes in human genome – CPNE1 and RBM12 gene pair as an example

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Regulation of gene expression plays important role in cellular functions. Co-regulation of different genes may indicate functional connection or even physical interaction between gene products. Thus analysis on genomic structures that may affect gene expression regulation could shed light on the functions of genes.</p> <p>Results</p> <p>In a whole genome analysis of alternative splicing events, we found that two distinct genes, <it>copine I </it>(<it>CPNE1</it>) and <it>RNA binding motif protein 12 </it>(<it>RBM12</it>), share the most 5' exons and therefore the promoter region in human. Further analysis identified many gene pairs in human genome that share the same promoters and 5' exons but have totally different coding sequences. Analysis of genomic and expressed sequences, either cDNAs or expressed sequence tags (ESTs) for <it>CPNE1 </it>and <it>RBM12</it>, confirmed the conservation of this phenomenon during evolutionary courses. The co-expression of the two genes initiated from the same promoter is confirmed by Reverse Transcription-Polymerase Chain Reaction (RT-PCR) in different tissues in both human and mouse. High degrees of sequence conservation among multiple species in the 5'UTR region common to <it>CPNE1 </it>and <it>RBM12 </it>were also identified.</p> <p>Conclusion</p> <p>Promoter and 5'UTR sharing between <it>CPNE1 </it>and <it>RBM12 </it>is observed in human, mouse and zebrafish. Conservation of this genomic structure in evolutionary courses indicates potential functional interaction between the two genes. More than 20 other gene pairs in human genome were found to have the similar genomic structure in a genome-wide analysis, and it may represent a unique pattern of genomic arrangement that may affect expression regulation of the corresponding genes.</p

    Bayesian machine learning methods for predicting protein-peptide interactions and detecting mosaic structures in DNA sequences alignments

    Get PDF
    Short well-defined domains known as peptide recognition modules (PRMs) regulate many important protein-protein interactions involved in the formation of macromolecular complexes and biochemical pathways. High-throughput experiments like yeast two-hybrid and phage display are expensive and intrinsically noisy, therefore it would be desirable to target informative interactions and pursue in silico approaches. We propose a probabilistic discriminative approach for predicting PRM-mediated protein-protein interactions from sequence data. The model suffered from over-fitting, so Laplacian regularisation was found to be important in achieving a reasonable generalisation performance. A hybrid approach yielded the best performance, where the binding site motifs were initialised with the predictions of a generative model. We also propose another discriminative model which can be applied to all sequences present in the organism at a significantly lower computational cost. This is due to its additional assumption that the underlying binding sites tend to be similar.It is difficult to distinguish between the binding site motifs of the PRM due to the small number of instances of each binding site motif. However, closely related species are expected to share similar binding sites, which would be expected to be highly conserved. We investigated rate variation along DNA sequence alignments, modelling confounding effects such as recombination. Traditional approaches to phylogenetic inference assume that a single phylogenetic tree can represent the relationships and divergences between the taxa. However, taxa sequences exhibit varying levels of conservation, e.g. due to regulatory elements and active binding sites, and certain bacteria and viruses undergo interspecific recombination. We propose a phylogenetic factorial hidden Markov model to infer recombination and rate variation. We examined the performance of our model and inference scheme on various synthetic alignments, and compared it to state of the art breakpoint models. We investigated three DNA sequence alignments: one of maize actin genes, one bacterial (Neisseria), and the other of HIV-1. Inference is carried out in the Bayesian framework, using Reversible Jump Markov Chain Monte Carlo

    Anchoring linkage groups of the Rosa genetic map to physical chromosomes with tyramide-FISH and EST-SNP markers

    Get PDF
    In order to anchor Rosa linkage groups to physical chromosomes, a combination of the Tyramide-FISH technology and the modern molecular marker system based on High Resolution Melting (HRM) is an efficient approach. Although, Tyramide-FISH is a very promising technique for the visualization of short DNA probes, it is very challenging for plant species with small chromosomes such as Rosa. In this study, we successfully applied the Tyramide-FISH technique for Rosa and compared different detection systems. An indirect detection system exploiting biotinylated tyramides was shown to be the most suitable technique for reliable signal detection. Three gene fragments with a size of 1100 pb-1700 bp (Phenylalanine Ammonia Lyase, Pyrroline-5-Carboxylate Synthase and Orcinol O-Methyl Transferase) have been physically mapped on chromosomes 7, 4 and 1, respectively, of Rosa wichurana. The signal frequency was between 25% and 40%. HRM markers of these 3 gene fragments were used to include the gene fragments on the existing genetic linkage map of Rosa wichurana. As a result, three linkage groups could be anchored to their physical chromosomes. The information was used to check for synteny between the Rosa chromosomes and Fragaria

    Mechanistic behaviour and molecular interactions of heat shock protein 47 (HSP47)

    Get PDF
    This project involves the study of heat shock protein 47 (HSP47), which is a molecular chaperone crucial for collagen biosynthesis. It exhibits a high degree of sequence homology with members of the serine protease inhibitor (serpin) superfamily, though HSP47 does not possess the inhibitory activity. It is a single-substrate chaperone, and binds only to collagen. ‘Knock-out’ of the hsp47 gene impairs the secretion of correctly folded collagen triple helix molecules leading to embryonic lethality in mice. Thus the aim of this project was to elucidate the specific mechanism that governs the binding to and release from collagen at the molecular level, known as the ‘pH-switch mechanism’. Emphasis is given on histidine (His) residues as the HSP47-collagen dissociation pH is similar to the pKa of the imidazole side chain of His residues. Site directed mutagenesis was used to mutate surface His residues, based on a mouse HSP47 homology model. The effects of the mutations on the behaviour of HSP47 were then assessed by collagen binding assays and structural analyses with circular dichroism (CD). All mutants were found to have good solubility and retain their binding ability to collagen like wild-type HSP47 in batch assay, but perturbed behaviour was seen in column experiment. Mutation of His residue at position 191 (H191) causes the shift in the collagen dissociation pH, while mutation of H197 and/or 198 disrupt the specific HSP47-collagen interaction. H191, 197 and 198 are predicted to be located in the region near the C-terminus of strand 3 of β-sheet A (s3A) in the homology model, a region specifically known as the ‘breach cluster’ in serpin nomenclature. The extent of conformational rearrangement of this region was further investigated by means of intrinsic tryptophan fluorescence spectroscopy using a series of single tryptophan (Trp) mutants. Results from analyses performed on the mutants did not contradict the observation seen in His mutational work, as Trp residues in the ‘breach’ cluster are likely to be located in the dynamic region of HSP47 pH-triggered conformational change. In conclusion, this study establishes the importance of His residues in the ‘breach cluster’ to HSP47 pH-switch behaviour. Finally, a model for HSP47 pH-switch mechanism was proposed from data obtained via mutagenesis experiments. The model is hoped to assist future research into HSP47 cellular behaviour and will also be of great use in therapeutic applications involving the molecular chaperone
    • …
    corecore