164 research outputs found

    GreenPhylDB: phylogenomic resources for comparative and functional genomics in plants

    Get PDF
    Poster presented at 9th PlantGEM 2011. Istanbul (Turkey), 4-7 May 201

    Structural evolution drives diversification of the large LRR-RLK gene family

    Get PDF
    Cells are continuously exposed to chemical signals that they must discriminate between and respond to appropriately. In embryophytes, the leucine‐rich repeat receptor‐like kinases (LRR‐RLKs) are signal receptors critical in development and defense. LRR‐RLKs have diversified to hundreds of genes in many plant genomes. Although intensively studied, a well‐resolved LRR‐RLK gene tree has remained elusive. To resolve the LRR‐RLK gene tree, we developed an improved gene discovery method based on iterative hidden Markov model searching and phylogenetic inference. We used this method to infer complete gene trees for each of the LRR‐RLK subclades and reconstructed the deepest nodes of the full gene family. We discovered that the LRR‐RLK gene family is even larger than previously thought, and that protein domain gains and losses are prevalent. These structural modifications, some of which likely predate embryophyte diversification, led to misclassification of some LRR‐RLK variants as members of other gene families. Our work corrects this misclassification. Our results reveal ongoing structural evolution generating novel LRR‐RLK genes. These new genes are raw material for the diversification of signaling in development and defense. Our methods also enable phylogenetic reconstruction in any large gene family

    Survey of Branch Support Methods Demonstrates Accuracy, Power, and Robustness of Fast Likelihood-based Approximation Schemes

    Get PDF
    Phylogenetic inference and evaluating support for inferred relationships is at the core of many studies testing evolutionary hypotheses. Despite the popularity of nonparametric bootstrap frequencies and Bayesian posterior probabilities, the interpretation of these measures of tree branch support remains a source of discussion. Furthermore, both methods are computationally expensive and become prohibitive for large data sets. Recent fast approximate likelihood-based measures of branch supports (approximate likelihood ratio test [aLRT] and Shimodaira-Hasegawa [SH]-aLRT) provide a compelling alternative to these slower conventional methods, offering not only speed advantages but also excellent levels of accuracy and power. Here we propose an additional method: a Bayesian-like transformation of aLRT (aBayes). Considering both probabilistic and frequentist frameworks, we compare the performance of the three fast likelihood-based methods with the standard bootstrap (SBS), the Bayesian approach, and the recently introduced rapid bootstrap. Our simulations and real data analyses show that with moderate model violations, all tests are sufficiently accurate, but aLRT and aBayes offer the highest statistical power and are very fast. With severe model violations aLRT, aBayes and Bayesian posteriors can produce elevated false-positive rates. With data sets for which such violation can be detected, we recommend using SH-aLRT, the nonparametric version of aLRT based on a procedure similar to the Shimodaira-Hasegawa tree selection. In general, the SBS seems to be excessively conservative and is much slower than our approximate likelihood-based method

    Application du systÚme GenFam à la réponse au stress des plantes : intégration de l'identification d'éléments cis spécifiques

    Get PDF
    UMR AGAP - Ă©quipe ID - IntĂ©gration des donnĂ©esGenFam est un systĂšme intĂ©gratif d'analyse de familles de gĂšnes. Ce systĂšme permet (i) de crĂ©er des familles de gĂšnes de gĂ©nomes complets, (ii) d’exĂ©cuter une analyse phylogĂ©nĂ©tique de cette famille Ă  travers le gestionnaire de workflows Galaxy afin de dĂ©finir les relations d'homologie, (iii) d'Ă©tudier des Ă©vĂ©nements Ă©volutifs Ă  partir de blocs de syntĂ©nie prĂ©calculĂ©es avec le workflow SynMap de la plateforme de gĂ©nomique comparative (CoGe) et (iv) d’intĂ©grer ces rĂ©sultats dans l'interface de visualisation synthĂ©tique. La premiĂšre application de GenFam est d’identifier des gĂšnes candidats pour la tolĂ©rance aux stress environnementaux. Il nĂ©cessite de mettre en Ă©vidence la prĂ©sence de sĂ©quences rĂ©gulatrices cis spĂ©cifiques de la rĂ©ponse aux stress (de type ABRE, DRE). Dans ce contexte, nous avons besoin d’intĂ©grer de nouveaux outils afin de dĂ©couvrir et chercher des sites de fixation de facteurs de transcription (Transcription Factor Binding Sites, TFBS) dans les sĂ©quences promotrices des gĂšnes membre de la famille Ă©tudiĂ©e. Ce workflow Galaxy va, d'une part, sĂ©lectionner les rĂ©gions flanquantes en 5' ou en 3' des gĂšnes d'intĂ©rĂȘts selon le choix de l'utilisateur. D'autre part, les rĂ©gions flanquantes sont analysĂ©es afin de dĂ©couvrir et rechercher les motifs de sĂ©quences rĂ©gulatrices cis spĂ©cifiques de la rĂ©ponse aux stress avec des mĂ©thodes complĂ©mentaires comme MEME, STIF, PHYME. Ces rĂ©sultats ainsi que l’annotation fonctionnelle des gĂšnes Ă©tiquetĂ©s comme Ă©tant impliquĂ©s dans la rĂ©ponse au stress seront intĂ©grĂ©s dans l’interface de visualisation. Ce travail doit permettre une rĂ©flexion sur la notion d'orthologie fonctionnelle et effectuer une recherche translationnelle depuis les espĂšces modĂšles jusqu'aux espĂšces d'intĂ©rĂȘt agronomique (i.e identifier des gĂšnes candidats pour la rĂ©ponse au stress du cafĂ©ier Ă  partir d'informations fonctionnelles connues chez Arabidopsis)
    • 

    corecore