73 research outputs found

    Reptilian-transcriptome v1.0, a glimpse in the brain transcriptome of five divergent Sauropsida lineages and the phylogenetic position of turtles

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Reptiles are largely under-represented in comparative genomics despite the fact that they are substantially more diverse in many respects than mammals. Given the high divergence of reptiles from classical model species, next-generation sequencing of their transcriptomes is an approach of choice for gene identification and annotation.</p> <p>Results</p> <p>Here, we use 454 technology to sequence the brain transcriptome of four divergent reptilian and one reference avian species: the Nile crocodile, the corn snake, the bearded dragon, the red-eared turtle, and the chicken. Using an in-house pipeline for recursive similarity searches of >3,000,000 reads against multiple databases from 7 reference vertebrates, we compile a reptilian comparative transcriptomics dataset, with homology assignment for 20,000 to 31,000 transcripts per species and a cumulated non-redundant sequence length of 248.6 Mbases. Our approach identifies the majority (87%) of chicken brain transcripts and about 50% of <it>de novo </it>assembled reptilian transcripts. In addition to 57,502 microsatellite loci, we identify thousands of SNP and indel polymorphisms for population genetic and linkage analyses. We also build very large multiple alignments for Sauropsida and mammals (two million residues per species) and perform extensive phylogenetic analyses suggesting that turtles are not basal living reptiles but are rather associated with Archosaurians, hence, potentially answering a long-standing question in the phylogeny of Amniotes.</p> <p>Conclusions</p> <p>The reptilian transcriptome (freely available at <url>http://www.reptilian-transcriptomes.org</url>) should prove a useful new resource as reptiles are becoming important new models for comparative genomics, ecology, and evolutionary developmental genetics.</p

    2× genomes - depth does matter

    Get PDF
    The use of low coverage genomes in comparative evolutionary analyses skews estimates of gene gains and losses

    Meta-analysis of microarray data of rainbow trout fry gonad differentiation modulated by ethynylestradiol

    Get PDF
    Sex differentiation in fish is a highly labile process easily reversed by the use of exogenous hormonal treatment and has led to environmental concerns since low doses of estrogenic molecules can adversely impact fish reproduction. The goal of this study was to identify pathways altered by treatment with ethynylestradiol (EE2) in developing fish and to find new target genes to be tested further for their possible role in male-to-female sex transdifferentiation. To this end, we have successfully adapted a previously developed bioinformatics workflow to a meta-analysis of two datasets studying sex reversal following exposure to EE2 in juvenile rainbow trout. The meta-analysis consisted of retrieving the intersection of the top gene lists generated for both datasets, performed at different levels of stringency. The intersecting gene lists, enriched in true positive differentially expressed genes (DEGs), were subjected to over-representation analysis (ORA) which allowed identifying several statistically significant enriched pathways altered by EE2 treatment and several new candidate pathways, such as progesterone-mediated oocyte maturation and PPAR signalling. Moreover, several relevant key genes potentially implicated in the early transdifferentiation process were selected. Altogether, the results show that EE2 has a great effect on gene expression in juvenile rainbow trout. The feminization process seems to result from the altered transcription of genes implicated in normal female gonad differentiation, resulting in expression similar to that observed in normal females (i.e. the repression of key testicular markers cyp17a1, cyp11b, tbx1), as well as from other genes (including transcription factors) that respond specifically to the EE2 treatment. The results also showed that the bioinformatics workflow can be applied to different types of microarray platforms and could be generalized to (eco)toxicogenomics studies for environmental risk assessment purposes

    Historical Constraints on Vertebrate Genome Evolution

    Get PDF
    Recent analyses indicated that genes with larger effect of knockout or mutation and with larger probability to revert to single copy after whole genome duplication are expressed earlier in development. Here, we further investigate whether tissue specificity of gene expression is constrained by the age of origin of the corresponding genes. We use 38 metazoan genomes and a comparative genomic application system to integrate inference of gene duplication with expression data from 17,503 human genes into a strictly phylogenetic framework. We show that the number of anatomical systems in which genes are expressed decreases steadily with decreased age of the genes’ first appearance in the phylogeny: the oldest genes are expressed, on average, in twice as many anatomical systems than the genes gained recently in evolution. These results are robust to different sources of expression data, to different levels of the anatomical system hierarchy, and to the use of gene families rather than duplication events. Finally, we show that the rate of increase in gene tissue specificity correlates with the relative rate of increase in the maximum number of cell types in the corresponding taxa. Although subfunctionalization and increase in cell type number throughout evolution could constitute, respectively, the proximal and ultimate causes of this correlation, the two phenomena are intermingled. Our analyses identify a striking historical constraint in gene expression: the number of cell types in existence at the time of a gene appearance (through duplication or de novo origination) tends to determine its level of tissue specificity for tens or hundreds of millions of years

    gViz, a novel tool for the visualization of co-expression networks

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The quantity of microarray data available on the Internet has grown dramatically over the past years and now represents millions of Euros worth of underused information. One way to use this data is through co-expression analysis. To avoid a certain amount of bias, such data must often be analyzed at the genome scale, for example by network representation. The identification of co-expression networks is an important means to unravel gene to gene interactions and the underlying functional relationship between them. However, it is very difficult to explore and analyze a network of such dimensions. Several programs (Cytoscape, yEd) have already been developed for network analysis; however, to our knowledge, there are no available GraphML compatible programs.</p> <p>Findings</p> <p>We designed and developed gViz, a GraphML network visualization and exploration tool. gViz is built on clustering coefficient-based algorithms and is a novel tool to visualize and manipulate networks of co-expression interactions among a selection of probesets (each representing a single gene or transcript), based on a set of microarray co-expression data stored as an adjacency matrix.</p> <p>Conclusions</p> <p>We present here gViz, a software tool designed to visualize and explore large GraphML networks, combining network theory, biological annotation data, microarray data analysis and advanced graphical features.</p

    MetaPIGA v2.0: maximum likelihood large phylogeny estimation using the metapopulation genetic algorithm and other stochastic heuristics

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The development, in the last decade, of stochastic heuristics implemented in robust application softwares has made large phylogeny inference a key step in most comparative studies involving molecular sequences. Still, the choice of a phylogeny inference software is often dictated by a combination of parameters not related to the raw performance of the implemented algorithm(s) but rather by practical issues such as ergonomics and/or the availability of specific functionalities.</p> <p>Results</p> <p>Here, we present MetaPIGA v2.0, a robust implementation of several stochastic heuristics for large phylogeny inference (under maximum likelihood), including a Simulated Annealing algorithm, a classical Genetic Algorithm, and the Metapopulation Genetic Algorithm (metaGA) together with complex substitution models, discrete Gamma rate heterogeneity, and the possibility to partition data. MetaPIGA v2.0 also implements the Likelihood Ratio Test, the Akaike Information Criterion, and the Bayesian Information Criterion for automated selection of substitution models that best fit the data. Heuristics and substitution models are highly customizable through manual batch files and command line processing. However, MetaPIGA v2.0 also offers an extensive graphical user interface for parameters setting, generating and running batch files, following run progress, and manipulating result trees. MetaPIGA v2.0 uses standard formats for data sets and trees, is platform independent, runs in 32 and 64-bits systems, and takes advantage of multiprocessor and multicore computers.</p> <p>Conclusions</p> <p>The metaGA resolves the major problem inherent to classical Genetic Algorithms by maintaining high inter-population variation even under strong intra-population selection. Implementation of the metaGA together with additional stochastic heuristics into a single software will allow rigorous optimization of each heuristic as well as a meaningful comparison of performances among these algorithms. MetaPIGA v2.0 gives access both to high customization for the phylogeneticist, as well as to an ergonomic interface and functionalities assisting the non-specialist for sound inference of large phylogenetic trees using nucleotide sequences. MetaPIGA v2.0 and its extensive user-manual are freely available to academics at <url>http://www.metapiga.org</url>.</p

    Développements en phylogénomique : comparaisons de génomes et estimation de grandes phylogénies

    Get PDF
    n/aCette thĂšse prĂ©sente les dĂ©veloppements informatiques en phylogĂ©nomique ayant abouti Ă  la crĂ©ation de 2 logiciels : MetaPIGA 2.0 et MANTiS. Dans la premiĂšre partie, nous plongerons dans l’infĂ©rence phylogĂ©nĂ©tique, prĂ©sentant un petit Ă©tat de l’art du domaine puis en dĂ©taillant l’implĂ©mentation de MetaPIGA 2.0, un logiciel qui permet d’infĂ©rer une phylogĂ©nie via l’utilisation de mĂ©ta-heuristiques. MetaPIGA 2.0 se veut un logiciel d’infĂ©rence phylogĂ©nĂ©tique complet qui (1) regroupe les principales mĂ©thodes et modĂšles existants Ă  l’heure actuelle, (2) permet de travailler avec de trĂšs grands jeux de donnĂ©es, (3) soit un outil informatique moderne et convivial, et (4) qui offre un cadre informatique robuste permettant de dĂ©velopper et tester de nouvelles mĂ©ta-heuristiques, qui auront Ă©tĂ© prĂ©alablement adaptĂ©es au problĂšme de l’infĂ©rence phylogĂ©nĂ©tique. Nous y dĂ©crivons Ă©galement en dĂ©tails les 4 mĂ©ta-heuristiques qui ont Ă©tĂ© adaptĂ©es pour MetaPIGA 2.0 : un Hill Climbing, un Simulated Annealing, un algorithme gĂ©nĂ©tique et le metaGA (un algorithme gĂ©nĂ©tique mĂ©ta-populationnel). Dans la seconde partie de cette thĂšse, nous mettons en avant une approche phylogĂ©nĂ©tique pour la gĂ©nomique comparative, en dĂ©veloppant le logiciel MANTiS, qui permet d’explorer le contenu d’une sĂ©lection de gĂ©nomes eucaryotes le long d’une phylogĂ©nie. Les relations d’orthologie et de paralogie sont identifiĂ©es Ă  l’aide d’arbres phylogĂ©nĂ©tiques, et nous pouvons suivre l’historique des Ă©vĂ©nements de duplication au cours de l’évolution et dĂ©terminer Ă  quel moment un « caractĂšre » a Ă©tĂ© gagnĂ© et perdu. Cette cartographie des gĂšnes sur les branches de la phylogĂ©nie nous permet Ă©galement de reconstituer le contenu des gĂ©nomes ancestraux prĂ©sumĂ©s. Enfin, en associant Ă  ces donnĂ©es diffĂ©rentes annotations fonctionnelles (telles que processus biologiques et fonctions molĂ©culaires dans lesquels les gĂšnes sont impliquĂ©s, ou les tissus dans lesquels ils sont exprimĂ©s), nous pouvons Ă©tudier l’évolution fonctionnelle d’un gĂ©nome au cours du temps. En se focalisant sur une branche de la phylogĂ©nie, et en comptant les gĂšnes associĂ©s Ă  une fonction donnĂ©e, nous pouvons dĂ©terminer si cette fonction est plus ou moins reprĂ©sentĂ©e par rapport au gĂ©nome actuel. Enfin, comme pour MetaPIGA 2.0, nous avons tout mis en Ɠuvre pour que MANTiS soit convivial et offre diffĂ©rents moyens d’exploration graphique des donnĂ©es. Nous y avons Ă©galement intĂ©grĂ© une interface permettant d’interagir dynamiquement avec l’ensemble des donnĂ©es, proposant Ă  l’utilisateur de poser le plus simplement possible un vaste Ă©ventail de questions de gĂ©nomique comparative avec des critĂšres phylogĂ©nĂ©tiques, augmentant considĂ©rablement l’intĂ©rĂȘt du logiciel pour la communautĂ© scientifique.(DOCSC03) -- FUNDP, 201
    • 

    corecore