2,886 research outputs found

    The Capacity of Some P\'olya String Models

    Get PDF
    We study random string-duplication systems, which we call P\'olya string models. These are motivated by DNA storage in living organisms, and certain random mutation processes that affect their genome. Unlike previous works that study the combinatorial capacity of string-duplication systems, or various string statistics, this work provides exact capacity or bounds on it, for several probabilistic models. In particular, we study the capacity of noisy string-duplication systems, including the tandem-duplication, end-duplication, and interspersed-duplication systems. Interesting connections are drawn between some systems and the signature of random permutations, as well as to the beta distribution common in population genetics

    Evolution of k-mer Frequencies and Entropy in Duplication and Substitution Mutation Systems

    Get PDF
    Genomic evolution can be viewed as string-editing processes driven by mutations. An understanding of the statistical properties resulting from these mutation processes is of value in a variety of tasks related to biological sequence data, e.g., estimation of model parameters and compression. At the same time, due to the complexity of these processes, designing tractable stochastic models and analyzing them are challenging. In this paper, we study two kinds of systems, each representing a set of mutations. In the first system, tandem duplications and substitution mutations are allowed and in the other, interspersed duplications. We provide stochastic models and, via stochastic approximation, study the evolution of substring frequencies for these two systems separately. Specifically, we show that k-mer frequencies converge almost surely and determine the limit set. Furthermore, we present a method for finding upper bounds on entropy for such systems

    Transcriptome sequencing reveals genome-wide variation in molecular evolutionary rate among ferns

    Get PDF
    Orthogroup data file. Zipped folder containing fasta-formatted reads identified by ProteinOrtho, used for all downstream orthogroup determination and analysis, along with readme document and relevant project-specific scripts (also available online via Dryad: http://dx.doi.org/10.5061/dryad.rg22j ). (ZIP 12021 kb

    Population Genetics of Duplicated Alternatively Spliced Exons of the Dscam Gene in Daphnia and Drosophila

    Get PDF
    In insects and crustaceans, the Down syndrome cell adhesion molecule (Dscam) occurs in many different isoforms. These are produced by mutually exclusive alternative splicing of dozens of tandem duplicated exons coding for parts or whole immunoglobulin (Ig) domains of the Dscam protein. This diversity plays a role in the development of the nervous system and also in the immune system. Structural analysis of the protein suggested candidate epitopes where binding to pathogens could occur. These epitopes are coded by regions of the duplicated exons and are therefore diverse within individuals. Here we apply molecular population genetics and molecular evolution analyses using Daphnia magna and several Drosophila species to investigate the potential role of natural selection in the divergence between orthologs of these duplicated exons among species, as well as between paralogous exons within species. We found no evidence for a role of positive selection in the divergence of these paralogous exons. However, the power of this test was low, and the fact that no signs of gene conversion between paralogous exons were found suggests that paralog diversity may nonetheless be maintained by selection. The analysis of orthologous exons in Drosophila and in Daphnia revealed an excess of non-synonymous polymorphisms in the epitopes putatively involved in pathogen binding. This may be a sign of balancing selection. Indeed, in Dr. melanogaster the same derived non-synonymous alleles segregate in several populations around the world. Yet other hallmarks of balancing selection were not found. Hence, we cannot rule out that the excess of non-synonymous polymorphisms is caused by segregating slightly deleterious alleles, thus potentially indicating reduced selective constraints in the putative pathogen binding epitopes of Dscam

    Adaptive evolution after duplication of penaeidin antimicrobial peptides

    Get PDF
    Penaeidin antimicrobial peptides in penaeid shrimps are an important component of their innate immune system that provides immunity against infection caused by several gram-positive bacteria and filamentous fungal species. Despite the knowledge on the identification and characterization of these peptides in penaeid shrimps, little is known about the evolutionary pattern of these peptides and the underlying genetic mechanisms that maintain high sequence diversities in the penaeidin gene family. Based on the phylogenetic analyses and maximum likelihood-based codon substitution analyses, here we present the convincing evidence that multiple copies of penaeidins have evolved by gene duplication, and positive Darwinian selection (adaptive evolution) is the likely cause of accelerated rate of amino acid substitutions among these duplicated genes.fisheries of the country has led to a remarkable increase in fish landings, the problem of by-catch and targeted juvenile fishing is ever increasing. The present analysis shows that the contribution in the form of annual average profit by various craft–gear combinations is often not sufficient to compensate the overall loss generated by the same to the fishery through the harvest of juvenile fishes. Even though the fishermen gain some transient economic incentives from the juveniles landed, the estimated economic loss calculated was at about US$ 19,445 million year−1 from the mechanised as well as motorised sector. Both intergenerational and conventional discounting was applied to show the net present value (NPV) of future loss due to juvenile fishing. Some of the less capital intensive gears also substantially contribute towards the economic deficit caused by juvenile fishing. We suggest that, while considering the ecosystem impacts of accidental by-catch and intentional juvenile fishing, the economic impacts also should be taken into account prior to formulating any management measures. The study provides an insight to the cost of juvenile fishing in a multi-species multi-gear fishery, where a homogeneous management system is ineffective. The possible causes of increased growth overfishing in the country and subsequent economic loss to the industry are also discussed

    The capacity of some Pólya string models

    Get PDF
    We study random string-duplication systems, called Pólya string models, motivated by certain random mutation processes in the genome of living organisms. Unlike previous works that study the combinatorial capacity of string-duplication systems, or peripheral properties such as symbol frequency, this work provides exact capacity or bounds on it, for several probabilistic models. In particular, we give the exact capacity of the random tandem-duplication system, and the end-duplication system, and bound the capacity of the complement tandem-duplication system. Interesting connections are drawn between the former and the beta distribution common to population genetics, as well as between the latter system and signatures of random permutations

    Comparative mitochondrial genomics of snakes: extraordinary substitution rate dynamics and functionality of the duplicate control region

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The mitochondrial genomes of snakes are characterized by an overall evolutionary rate that appears to be one of the most accelerated among vertebrates. They also possess other unusual features, including short tRNAs and other genes, and a duplicated control region that has been stably maintained since it originated more than 70 million years ago. Here, we provide a detailed analysis of evolutionary dynamics in snake mitochondrial genomes to better understand the basis of these extreme characteristics, and to explore the relationship between mitochondrial genome molecular evolution, genome architecture, and molecular function. We sequenced complete mitochondrial genomes from Slowinski's corn snake (<it>Pantherophis slowinskii</it>) and two cottonmouths (<it>Agkistrodon piscivorus</it>) to complement previously existing mitochondrial genomes, and to provide an improved comparative view of how genome architecture affects molecular evolution at contrasting levels of divergence.</p> <p>Results</p> <p>We present a Bayesian genetic approach that suggests that the duplicated control region can function as an additional origin of heavy strand replication. The two control regions also appear to have different intra-specific versus inter-specific evolutionary dynamics that may be associated with complex modes of concerted evolution. We find that different genomic regions have experienced substantial accelerated evolution along early branches in snakes, with different genes having experienced dramatic accelerations along specific branches. Some of these accelerations appear to coincide with, or subsequent to, the shortening of various mitochondrial genes and the duplication of the control region and flanking tRNAs.</p> <p>Conclusion</p> <p>Fluctuations in the strength and pattern of selection during snake evolution have had widely varying gene-specific effects on substitution rates, and these rate accelerations may have been functionally related to unusual changes in genomic architecture. The among-lineage and among-gene variation in rate dynamics observed in snakes is the most extreme thus far observed in animal genomes, and provides an important study system for further evaluating the biochemical and physiological basis of evolutionary pressures in vertebrate mitochondria.</p

    Identification of New Alleles and the Determination of Alleles and Genotypes Frequencies at the CYP2D6 Gene in Emiratis

    Get PDF
    CYP2D6 belongs to the cytochrome P450 superfamily of enzymes and plays an important role in the metabolism of 20–25% of clinically used drugs including antidepressants. It displays inter-individual and inter-ethnic variability in activity ranging from complete absence to excessive activity which causes adverse drug reactions and toxicity or therapy failure even at normal drug doses. This variability is due to genetic polymorphisms which form poor, intermediate, extensive or ultrarapid metaboliser phenotypes. This study aimed to determine CYP2D6 alleles and their frequencies in the United Arab Emirates (UAE) local population. CYP2D6 alleles and genotypes were determined by direct DNA sequencing in 151 Emiratis with the majority being psychiatric patients on antidepressants. Several new alleles have been identified and in total we identified seventeen alleles and 49 genotypes. CYP2D6*1 (wild type) and CYP2D6*2 alleles (extensive metaboliser phenotype) were found with frequencies of 39.1% and 12.2%, respectively. CYP2D6*41 (intermediate metaboliser) occurred in 15.2%. Homozygous CYP2D6*4 allele (poor metaboliser) was found with a frequency of 2% while homozygous and heterozygous CYP2D6*4 occurred with a frequency of 9%. CYP2D6*2xn, caused by gene duplication (ultrarapid metaboliser) had a frequency of 4.3%. CYP2D6 gene duplication/multiduplication occurred in 16% but only 11.2% who carried more than 2 active functional alleles were considered ultrarapid metabolisers. CYP2D6 gene deletion in one copy occurred in 7.5% of the study group. In conclusion, CYP2D6 gene locus is heterogeneous in the UAE national population and no significant differences have been identified between the psychiatric patients and controls
    corecore