7 research outputs found
The Combinatorics of Tandem Duplication
Tandem duplication is an evolutionary process whereby a segment of DNA is replicated and proximally inserted. The different configurations that can arise from this process give rise to some interesting combinatorial questions. Firstly, we introduce an algebraic formalism to represent this process as a word producing automaton. The number of words arising from n tandem duplications can then be recursively derived. Secondly, each single word accounts for multiple evolutions. With the aid of a bi-coloured 2d- tree, a Hasse diagram corresponding to a partially ordered set is constructed, from which we can count the number of evolutions corresponding to a given word. Thirdly, we implement some subtree prune and graft operations on this structure to show that the total number of possible evolutions arising from n tandem duplications is . The space of structures arising from tandem duplication thus grows at a super-exponential rate with leading order term
Genome-wide microRNA screening in Nile tilapia reveals pervasive isomiRsâ transcription, sex-biased arm switching and increasing complexity of expression throughout development
MicroRNAs (miRNAs) are key regulators of gene expression in multicellular organisms. The elucidation of miRNA function and evolution depends on the identification and characterization of miRNA repertoire of strategic organisms, as the fast-evolving cichlid fishes. Using RNA-seq and comparative genomics we carried out an in-depth report of miRNAs in Nile tilapia (Oreochromis niloticus), an emergent model organism to investigate evo-devo mechanisms. Five hundred known miRNAs and almost one hundred putative novel vertebrate miRNAs have been identified, many of which seem to be teleost-specific, cichlid-specific or tilapia-specific. Abundant miRNA isoforms (isomiRs) were identified with modifications in both 5p and 3p miRNA transcripts. Changes in arm usage (arm switching) of nine miRNAs were detected in early development, adult stage and even between male and female samples. We found an increasing complexity of miRNA expression during ontogenetic development, revealing a remarkable synchronism between the rate of new miRNAs recruitment and morphological changes. Overall, our results enlarge vertebrate miRNA collection and reveal a notable differential ratio of miRNA arms and isoforms influenced by sex and developmental life stage, providing a better picture of the evolutionary and spatiotemporal dynamics of miRNAs
An improved microRNA annotation of the canine genome
The domestic dog, Canis familiaris, is a valuable model for studying human diseases. The publication of the latest Canine genome build and annotation, CanFam3.1 provides an opportunity to enhance our understanding of gene regulation across tissues in the dog model system. In this study, we used the latest dog genome assembly and small RNA sequencing data from 9 different dog tissues to predict novel miRNAs in the dog genome, as well as to annotate conserved miRNAs from the miRBase database that were missing from the current dog annotation. We used both miRCat and miRDeep2 algorithms to computationally predict miRNA loci. The resulting, putative hairpin sequences were analysed in order to discard false positives, based on predicted secondary structures and patterns of small RNA read alignments. Results were further divided into high and low confidence miRNAs, using the same criteria. We generated tissue specific expression profiles for the resulting set of 811 loci: 720 conserved miRNAs, (207 of which had not been previously annotated in the dog genome) and 91 novel miRNA loci. Comparative analyses revealed 8 putative homologues of some novel miRNA in ferret, and one in microbat. All miRNAs were also classified into the genic and intergenic categories, based on the Ensembl RefSeq gene annotation for CanFam3.1. This additionally allowed us to identify four previously undescribed MiRtrons among our total set of miRNAs. We additionally annotated piRNAs, using proTRAC on the same input data. We thus identified 263 putative clusters, most of which (211 clusters) were found to be expressed in testis. Our results represent an important improvement of the dog genome annotation, paving the way to further research on the evolution of gene regulation, as well as on the contribution of post-transcriptional regulation to pathological conditions
Genomic evidence for an African expansion of anatomically-modern humans by a Southern route.
There is general agreement among scientists about a recent (less
than 200,000 yrs ago) African origin of anatomically modern humans, whereas there is still uncertainty about whether, and to what extent, they admixed with archaic populations, which thus may have contributed to the modern populationsâ gene pools. Data on cranial morphology have been interpreted as suggesting that, before the main expansion from Africa through the Near East, anatomically modern humans may also have taken a
Southern route from the Horn of Africa through the Arabian peninsula to India, Melanesia and Australia, about 100,000 yrs ago. This view was recently supported by archaeological findings demonstrating human presence in Eastern Arabia >90,000 yrs ago. In this study we analyzed genetic variation at 111,197 nuclear SNPs in nine populations (Kurumba, Chenchu, Kamsali, Madiga, Mala, Irula, Dalit, Chinese, Japanese), chosen because
their genealogical relationships are expected to differ under the alternative models of expansion (single vs. multiple dispersals). We calculated correlations between genomic distances, and geographic distances estimated under the alternative assumptions of a single dispersal, or multiple dispersals,
and found a significantly stronger association for the multiple dispersal model. If confirmed, this result would cast doubts on the possibility that some non-African populations (i.e., those whose ancestors expanded through the Southern route) may have had any contacts with Neandertals