24 research outputs found

    An Automated Method for Rapid Identification of Putative Gene Family Members in Plants

    Get PDF
    BACKGROUND: Gene duplication events have played a significant role in genome evolution, particularly in plants. Exhaustive searches for all members of a known gene family as well as the identification of new gene families has become increasingly important. Subfunctionalization via changes in regulatory sequences following duplication (adaptive selection) appears to be a common mechanism of evolution in plants and can be accompanied by purifying selection on the coding region. Such negative selection can be detected by a bias toward synonymous over nonsynonymous substitutions. However, the process of identifying this bias requires many steps usually employing several different software programs. We have simplified the process and significantly shortened the time required by condensing many steps into a few scripts or programs to rapidly identify putative gene family members beginning with a single query sequence. RESULTS: In this report we 1) describe the software tools (SimESTs, PCAT, and SCAT) developed to automate the gene family identification, 2) demonstrate the validity of the method by correctly identifying 3 of 4 PAL gene family members from Arabidopsis using EST data alone, 3) identify 2 to 6 CAD gene family members from Glycine max (previously unidentified), and 4) identify 2 members of a putative Glycine max gene family previously unidentified in any plant species. CONCLUSION: Gene families in plants, particularly that subset where purifying selection has occurred in the coding region, can be identified quickly and easily by integrating our software tools and commonly available contig assembly and ORF identification programs

    Validation of an NSP-based (negative selection pattern) gene family identification strategy

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Gene family identification from ESTs can be a valuable resource for analysis of genome evolution but presents unique challenges in organisms for which the entire genome is not yet sequenced. We have developed a novel gene family identification method based on negative selection patterns (NSP) between family members to screen EST-generated contigs. This strategy was tested on five known gene families in Arabidopsis to see if individual paralogs could be identified with accuracy from EST data alone when compared to the actual gene sequences in this fully sequenced genome.</p> <p>Results</p> <p>The NSP method uniquely identified family members in all the gene families tested. Two members of the FtsH gene family, three members each of the PAL, RF1, and ribosomal L6 gene families, and four members of the CAD gene family were correctly identified. Additionally all ESTs from the representative contigs when checked against MapViewer data successfully identify the gene locus predicted.</p> <p>Conclusion</p> <p>We demonstrate the effectiveness of the NSP strategy in identifying specific gene family members in Arabidopsis using only EST data and we describe how this strategy can be used to identify many gene families in agronomically important crop species where they are as yet undiscovered.</p

    In Depth Characterization of Repetitive DNA in 23 Plant Genomes Reveals Sources of Genome Size Variation in the Legume Tribe Fabeae

    Get PDF
    The differential accumulation and elimination of repetitive DNA are key drivers of genome size variation in flowering plants, yet there have been few studies which have analysed how different types of repeats in related species contribute to genome size evolution within a phylogenetic context. This question is addressed here by conducting large-scale comparative analysis of repeats in 23 species from four genera of the monophyletic legume tribe Fabeae, representing a 7.6-fold variation in genome size. Phylogenetic analysis and genome size reconstruction revealed that this diversity arose from genome size expansions and contractions in different lineages during the evolution of Fabeae. Employing a combination of low-pass genome sequencing with novel bioinformatic approaches resulted in identification and quantification of repeats making up 55-83% of the investigated genomes. In turn, this enabled an analysis of how each major repeat type contributed to the genome size variation encountered. Differential accumulation of repetitive DNA was found to account for 85% of the genome size differences between the species, and most (57%) of this variation was found to be driven by a single lineage of Ty3/gypsy LTR-retrotransposons, the Ogre elements. Although the amounts of several other lineages of LTR-retrotransposons and the total amount of satellite DNA were also positively correlated with genome size, their contributions to genome size variation were much smaller (up to 6%). Repeat analysis within a phylogenetic framework also revealed profound differences in the extent of sequence conservation between different repeat types across Fabeae. In addition to these findings, the study has provided a proof of concept for the approach combining recent developments in sequencing and bioinformatics to perform comparative analyses of repetitive DNAs in a large number of non-model species without the need to assemble their genomes

    Evolution of Stress-Regulated Gene Expression in Duplicate Genes of Arabidopsis thaliana

    Get PDF
    Due to the selection pressure imposed by highly variable environmental conditions, stress sensing and regulatory response mechanisms in plants are expected to evolve rapidly. One potential source of innovation in plant stress response mechanisms is gene duplication. In this study, we examined the evolution of stress-regulated gene expression among duplicated genes in the model plant Arabidopsis thaliana. Key to this analysis was reconstructing the putative ancestral stress regulation pattern. By comparing the expression patterns of duplicated genes with the patterns of their ancestors, duplicated genes likely lost and gained stress responses at a rapid rate initially, but the rate is close to zero when the synonymous substitution rate (a proxy for time) is >∼0.8. When considering duplicated gene pairs, we found that partitioning of putative ancestral stress responses occurred more frequently compared to cases of parallel retention and loss. Furthermore, the pattern of stress response partitioning was extremely asymmetric. An analysis of putative cis-acting DNA regulatory elements in the promoters of the duplicated stress-regulated genes indicated that the asymmetric partitioning of ancestral stress responses are likely due, at least in part, to differential loss of DNA regulatory elements; the duplicated genes losing most of their stress responses were those that had lost more of the putative cis-acting elements. Finally, duplicate genes that lost most or all of the ancestral responses are more likely to have gained responses to other stresses. Therefore, the retention of duplicates that inherit few or no functions seems to be coupled to neofunctionalization. Taken together, our findings provide new insight into the patterns of evolutionary changes in gene stress responses after duplication and lay the foundation for testing the adaptive significance of stress regulatory changes under highly variable biotic and abiotic environments

    Cognitive effects of adjunctive perampanel for partial-onset seizures: A randomized trial

    Get PDF
    Objective Assess cognitive effects of adjunctive perampanel in adolescents. Methods In this double-blind study (ClinicalTrials.gov identifier: NCT01161524), patients aged 12 to <18 years with partial-onset seizures despite receiving 1–3 antiepileptic drugs were randomized (2:1) to perampanel or placebo. Perampanel was increased weekly in 2-mg increments to 8–12 mg/day (6-week titration; 13-week maintenance). Changes in neuropsychological outcomes were assessed at end of maintenance: Cognitive Drug Research (CDR) System Global Cognition Score (primary end point), five CDR System domain T-scores (secondary end points), letter fluency, category fluency, and Lafayette Grooved Pegboard Test (LGPT). Results One hundred thirty-three patients were randomized. In the full analysis set, there were no differences of perampanel (n = 79) vs. placebo (n = 44) in CDR System Global Cognition Score (least squares mean change, βˆ’0.6 vs. 1.6; p = 0.145), Quality of Working Memory (1.1 vs. 2.0; p = 0.579), or Power of Attention (βˆ’6.9 vs. βˆ’2.7; p = 0.219). There were small differences with perampanel vs. placebo in other CDR System domains: improvements in Quality of Episodic Memory (3.0 vs. βˆ’1.2; p = 0.012), and worsening in Continuity of Attention (βˆ’3.3 vs. 1.6; p = 0.013) and Speed of Memory (0.3 vs. 7.0; p = 0.032). Letter fluency, category fluency, and LGPT were not significantly different between groups. The most frequent adverse events with perampanel were dizziness (30.6%) and somnolence (15.3%). Significance Perampanel did not differ from placebo in the global cognitive score, two of five subdomains, and four other cognitive measures. Perampanel was worse on two and better on one subdomain
    corecore