204 research outputs found
GreenPhylDB: A Gene Family Database for plant functional Genomics
With the increasing number of genomes being sequenced, a major objective is to transfer accurate annotation from characterised proteins to uncharacterised sequences. Consequently, comparative genomics has become a usual and efficient strategy in functional genomics. The release of various annotated genomes of plants, such as _O. sativa_ and _A. thaliana_, has allowed setting up comprehensive lists of gene families defined by automated methods. However, like for gene sequence, manual curation of gene families is an important requirement that has to be undertaken. GreenPhylDB comprises protein sequences of 12 plant species fully sequenced that were grouped into homeomorphic families using similarity-based methods. Clusters are finally processed by phylogenetic analysis to infer orthologs and paralogs that will be particularly helpful to study genome evolution. Previously, each cluster has to be curated (i.e. properly named and classified) using different sources of information. A web interface for plant gene families’ curation was developed for that purpose. This interface, accessible on GreenPhylDB ("http://greenphyl.cirad.fr":http://greenphyl.cirad.fr), centralizes external references (e.g. InterPro, KEGG, Swiss-Prot, PIRSF, Pubmed) related to all gene members of the clusters and shows statistics and automatic analysis. We believe that this synthetic view of data available for a gene cluster, combined with basic guidelines, is an efficient way to provide reliable method for gene family annotations
Function prediction in plant genomes from large scale phylogenomics analyses : P0932
With the increasing number of plant genomes being sequenced, a major challenge is to accurately transfer annotations from well characterized genomes to newly obtained sequences. GreenPhylDB is a database designed for comparative and functional genomics based on complete genome-derived gene sequences. The database currently includes gene families of protein sequences from 22 plant species, including socio-economically important crops like rice, sorghum, maize, cassava and banana. Genes from all these species are organized in clusters based on sequence similarity. The clusters are manually annotated (i.e. properly named and classified) and sequences included in each cluster are characterized by phylogenetic analysis in order to elucidate evolutionary relationships (e.g. orthologs, super-orthologs, in/out-paralogs) among genes. GreenPhyl provides a reliable and stable catalog of gene families useful for annotation on new genome sequences in plants. With its improved user interface, the new release of GreenPhyl keeps the previous gene clustering quality and introduces additional features such as specific search engines (quick search, deep search, InterPro domain combination and GO family browser). The GreenPhyl's pipeline relies on RapGreen, a new version of the RAP reconciliation tool (Dufayard et al, 2005) that allows us to root gene trees and infer orthology relationships between sequences of a family. GreenPhyl version 3 is available at http://www.greenphyl.org and is a collaborative resource of SouthGreen (southgreen.cirad.fr), a bioinformatics platform applied to the genetic and genomic resources analyses of the South and Mediterranean plants. (Résumé d'auteur
Towards a bioinformatics platform for the Musa research community : [Abstract W076]
Current experiments in genomics produce a large amount of data that needs to be organized into databases and broadly accessible. Like other species, the Musa genomics community would benefit from centralized and innovative ways to study its genome. Over the past years, genetic and genomic data (e.g. BAC, EST, Markers) have been generated and stored in databases. Several pipelines of analyses were implemented for gene, transposable element, and expression data analyses, and for comparative genomics such as ortholog predictions via a phylogenomic approach (GreenPhyl). Web tools have been developed or implemented to facilitate access to data, such as genetic makers (TropGeneDB), genetic maps (CMap), a physical Map (GBrowse), and Expressed sequence Tags (ESTtik) gene/TE predictions, and to allow online manual genome annotation (GnpAnnot). The number of tools may continue to grow, in particular with the near release of the Musa genome sequence, and the increase of Next-Generation Sequencing (NGS) facilities. The GMGC website (http://www.musagenomics.org) is a place where data can be shared, and where databases and tools can be listed in an homogeneous way to serve the Musa genomics community. It is intended to provide the researchers interested in Musa with a common set of resources in order to work more efficiently and effectively. (Texte intégral
A comprehensive approach to dark matter studies: exploration of simplified top-philic models
Studies of dark matter lie at the interface of collider physics, astrophysics
and cosmology. Constraining models featuring dark matter candidates entails the
capability to provide accurate predictions for large sets of observables and
compare them to a wide spectrum of data. We present a framework which, starting
from a model lagrangian, allows one to consistently and systematically make
predictions, as well as to confront those predictions with a multitude of
experimental results. As an application, we consider a class of simplified dark
matter models where a scalar mediator couples only to the top quark and a
fermionic dark sector (i.e. the simplified top-philic dark matter model). We
study in detail the complementarity of relic density, direct/indirect detection
and collider searches in constraining the multi-dimensional model parameter
space, and efficiently identify regions where individual approaches to dark
matter detection provide the most stringent bounds. In the context of collider
studies of dark matter, we point out the complementarity of LHC searches in
probing different regions of the model parameter space with final states
involving top quarks, photons, jets and/or missing energy. Our study of dark
matter production at the LHC goes beyond the tree-level approximation and we
show examples of how higher-order corrections to dark matter production
processes can affect the interpretation of the experimental results.Comment: 52 pages, 23 figure
GreenPhyl : Phylogenomic resources for comparative and functional genomics
With the increasing number of plant genomes being sequenced, a major objective is to transfer accurate annotation from characterized sequences to uncharacterized sequences. GreenPhyl (Conte et al., 2008, Nucleic Acid Research, 36: D991-8) is a tool for plant comparative genomics that predicts the function of genes based on their evolutionary relationship with genes of known function. The database (version 2) comprises protein sequences of 16 plant species fully sequenced including socio-economically important crops like rice, sorghum and maize that were grouped into gene families using similarity-based methods. GreenPhyl contains approximately 13,000 gene families being annotated, computational analyzes and external cross-references (InterPro, KEGG, Swiss-Prot, Pubmed) related to all gene members. Once manually annotated (i.e. properly named and classified), gene families are finally processed by phylogenetic analyses to distinguish orthologous and paralogous gene. Orthologous genes descend from the last common ancestor through speciation and most probably encode proteins with a similar function in different species. In addition, the website offers a range of user-friendly tools to query the data. These resources will be particularly helpful to molecular biologist for gene discovery and gene function inference. We believe that a better understanding of genome evolution will contribute to elucidate the genetic basis of important agronomic traits and therefore facilitate ongoing plant breeding efforts.(Texte intégral
Innovative Mars Global International Exploration (IMaGInE) mission
This paper presents the conceptual design of the IMaGInE (Innovative Mars Global International Exploration) Mission whose mission objectives are to deliver a crew of four astronauts to the surface of Deimos and a robotic exploration mission to Phobos for approx-imately 343 days during the years 2031 and 2032, perform surface excursions, technology demonstrations, and In Situ Resource Utilization (ISRU) of the Martian moons as well as site reconnaissance for future human exploration of Mars. This is the winning mission design of the 2016 Revolutionary Aerospace Systems Concepts-Academic Linkage (RASC-AL) competition, awarded with the "Best in Theme," "Best Overall," and "Pio-neering Exceptional Achievement Concept Honor (PEACH)" prizes. This competition was sponsored by NIA and NASA
Current Challenges in Fostering the European Innovation Ecosystem
The present report discusses innovation challenges under the following headings:
The 3% R&D target and industrial structure: is it still a relevant goal? if we cannot achieve the 3% target, does it make sense to keep it?
Technology diffusion: how can we combat its sluggishness and speed up adoption?
Access to finance: is the large amount of liquidity being funnelled to "zombie" companies instead of highly innovative ones?
Universities and skills: are higher education institutions adequately playing their role in driving innovation?
The governance of the R&I system: how to remove administrative barriers and increase flexibility?
Can SSH research contribute more to shaping R&I policies?JRC.B.7-Knowledge for Finance, Innovation and Growt
GreenPhylDB v5: A comparative pangenomic database for plant genomes
Comparative genomics is the analysis of genomic relationships among different species and serves as a significant base for evolutionary and functional genomic studies. GreenPhylDB (https://www.greenphyl.org) is a database designed to facilitate the exploration of gene families and homologous relationships among plant genomes, including staple crops critically important for global food security. GreenPhylDB is available since 2007, after the release of the Arabidopsis thaliana and Oryza sativa genomes and has undergone multiple releases. With the number of plant genomes currently available, it becomes challenging to select a single reference for comparative genomics studies but there is still a lack of databases taking advantage several genomes by species for orthology detection. GreenPhylDBv5 introduces the concept of comparative pangenomics by harnessing multiple genome sequences by species. We created 19 pangenes and processed them with other species still relying on one genome. In total, 46 plant species were considered to build gene families and predict their homologous relationships through phylogenetic-based analyses. In addition, since the previous publication, we rejuvenated the website and included a new set of original tools including protein-domain combination, tree topologies searches and a section for users to store their own results in order to support community curation efforts
A look at trails through the pangenome visualization jungle
High-throughput sequencing technologies enabled the production of multiple reference genome sequences for a single species. Comparisons of such sequences showed that there are structural variations between individuals from the same species such as Copy Number Variations (CNV) and Presence Absence Variations (PAV) that can have a significant impact on phenotypic variation in plants and could be suitable for breeding improved crop varieties. Thus, a single reference genome is insufficient to capture all variations.
Pangenomics is an integrative approach which aims to the assessment of such genomic variations and more within a group of closely related individuals. Its definition can focus on the whole repertoire of genes within a group or can include blocks of genomic sequences shared between species. We introduce here a new visualization tool, based on a linear representation: the PANgenome Analyzer with CHromosomal Exploration (PANACHE). It is a web-based application which enables its users to explore a pangenomic reference divided in multiple panchromosomes
Toward community standards in the quest for orthologs
The identification of orthologs—genes pairs descended from a common ancestor through speciation, rather than duplication—has emerged as an essential component of many bioinformatics applications, ranging from the annotation of new genomes to experimental target prioritization. Yet, the development and application of orthology inference methods is hampered by the lack of consensus on source proteomes, file formats and benchmarks. The second ‘Quest for Orthologs' meeting brought together stakeholders from various communities to address these challenges. We report on achievements and outcomes of this meeting, focusing on topics of particular relevance to the research community at large. The Quest for Orthologs consortium is an open community that welcomes contributions from all researchers interested in orthology research and applications. Contact: [email protected]
- …
