24 research outputs found
An Automated Method for Rapid Identification of Putative Gene Family Members in Plants
BACKGROUND: Gene duplication events have played a significant role in genome evolution, particularly in plants. Exhaustive searches for all members of a known gene family as well as the identification of new gene families has become increasingly important. Subfunctionalization via changes in regulatory sequences following duplication (adaptive selection) appears to be a common mechanism of evolution in plants and can be accompanied by purifying selection on the coding region. Such negative selection can be detected by a bias toward synonymous over nonsynonymous substitutions. However, the process of identifying this bias requires many steps usually employing several different software programs. We have simplified the process and significantly shortened the time required by condensing many steps into a few scripts or programs to rapidly identify putative gene family members beginning with a single query sequence. RESULTS: In this report we 1) describe the software tools (SimESTs, PCAT, and SCAT) developed to automate the gene family identification, 2) demonstrate the validity of the method by correctly identifying 3 of 4 PAL gene family members from Arabidopsis using EST data alone, 3) identify 2 to 6 CAD gene family members from Glycine max (previously unidentified), and 4) identify 2 members of a putative Glycine max gene family previously unidentified in any plant species. CONCLUSION: Gene families in plants, particularly that subset where purifying selection has occurred in the coding region, can be identified quickly and easily by integrating our software tools and commonly available contig assembly and ORF identification programs
Validation of an NSP-based (negative selection pattern) gene family identification strategy
<p>Abstract</p> <p>Background</p> <p>Gene family identification from ESTs can be a valuable resource for analysis of genome evolution but presents unique challenges in organisms for which the entire genome is not yet sequenced. We have developed a novel gene family identification method based on negative selection patterns (NSP) between family members to screen EST-generated contigs. This strategy was tested on five known gene families in Arabidopsis to see if individual paralogs could be identified with accuracy from EST data alone when compared to the actual gene sequences in this fully sequenced genome.</p> <p>Results</p> <p>The NSP method uniquely identified family members in all the gene families tested. Two members of the FtsH gene family, three members each of the PAL, RF1, and ribosomal L6 gene families, and four members of the CAD gene family were correctly identified. Additionally all ESTs from the representative contigs when checked against MapViewer data successfully identify the gene locus predicted.</p> <p>Conclusion</p> <p>We demonstrate the effectiveness of the NSP strategy in identifying specific gene family members in Arabidopsis using only EST data and we describe how this strategy can be used to identify many gene families in agronomically important crop species where they are as yet undiscovered.</p
In Depth Characterization of Repetitive DNA in 23 Plant Genomes Reveals Sources of Genome Size Variation in the Legume Tribe Fabeae
The differential accumulation and elimination of repetitive DNA are key drivers of genome size variation in flowering plants, yet there have been few studies which have analysed how different types of repeats in related species contribute to genome size evolution within a phylogenetic context. This question is addressed here by conducting large-scale comparative analysis of repeats in 23 species from four genera of the monophyletic legume tribe Fabeae, representing a 7.6-fold variation in genome size. Phylogenetic analysis and genome size reconstruction revealed that this diversity arose from genome size expansions and contractions in different lineages during the evolution of Fabeae. Employing a combination of low-pass genome sequencing with novel bioinformatic approaches resulted in identification and quantification of repeats making up 55-83% of the investigated genomes. In turn, this enabled an analysis of how each major repeat type contributed to the genome size variation encountered. Differential accumulation of repetitive DNA was found to account for 85% of the genome size differences between the species, and most (57%) of this variation was found to be driven by a single lineage of Ty3/gypsy LTR-retrotransposons, the Ogre elements. Although the amounts of several other lineages of LTR-retrotransposons and the total amount of satellite DNA were also positively correlated with genome size, their contributions to genome size variation were much smaller (up to 6%). Repeat analysis within a phylogenetic framework also revealed profound differences in the extent of sequence conservation between different repeat types across Fabeae. In addition to these findings, the study has provided a proof of concept for the approach combining recent developments in sequencing and bioinformatics to perform comparative analyses of repetitive DNAs in a large number of non-model species without the need to assemble their genomes
Evolution of Stress-Regulated Gene Expression in Duplicate Genes of Arabidopsis thaliana
Due to the selection pressure imposed by highly variable environmental conditions, stress sensing and regulatory response mechanisms in plants are expected to evolve rapidly. One potential source of innovation in plant stress response mechanisms is gene duplication. In this study, we examined the evolution of stress-regulated gene expression among duplicated genes in the model plant Arabidopsis thaliana. Key to this analysis was reconstructing the putative ancestral stress regulation pattern. By comparing the expression patterns of duplicated genes with the patterns of their ancestors, duplicated genes likely lost and gained stress responses at a rapid rate initially, but the rate is close to zero when the synonymous substitution rate (a proxy for time) is >βΌ0.8. When considering duplicated gene pairs, we found that partitioning of putative ancestral stress responses occurred more frequently compared to cases of parallel retention and loss. Furthermore, the pattern of stress response partitioning was extremely asymmetric. An analysis of putative cis-acting DNA regulatory elements in the promoters of the duplicated stress-regulated genes indicated that the asymmetric partitioning of ancestral stress responses are likely due, at least in part, to differential loss of DNA regulatory elements; the duplicated genes losing most of their stress responses were those that had lost more of the putative cis-acting elements. Finally, duplicate genes that lost most or all of the ancestral responses are more likely to have gained responses to other stresses. Therefore, the retention of duplicates that inherit few or no functions seems to be coupled to neofunctionalization. Taken together, our findings provide new insight into the patterns of evolutionary changes in gene stress responses after duplication and lay the foundation for testing the adaptive significance of stress regulatory changes under highly variable biotic and abiotic environments
Cognitive effects of adjunctive perampanel for partial-onset seizures: A randomized trial
Objective
Assess cognitive effects of adjunctive perampanel in adolescents.
Methods
In this double-blind study (ClinicalTrials.gov identifier: NCT01161524), patients aged 12 to <18 years with partial-onset seizures despite receiving 1β3 antiepileptic drugs were randomized (2:1) to perampanel or placebo. Perampanel was increased weekly in 2-mg increments to 8β12 mg/day (6-week titration; 13-week maintenance). Changes in neuropsychological outcomes were assessed at end of maintenance: Cognitive Drug Research (CDR) System Global Cognition Score (primary end point), five CDR System domain T-scores (secondary end points), letter fluency, category fluency, and Lafayette Grooved Pegboard Test (LGPT).
Results
One hundred thirty-three patients were randomized. In the full analysis set, there were no differences of perampanel (n = 79) vs. placebo (n = 44) in CDR System Global Cognition Score (least squares mean change, β0.6 vs. 1.6; p = 0.145), Quality of Working Memory (1.1 vs. 2.0; p = 0.579), or Power of Attention (β6.9 vs. β2.7; p = 0.219). There were small differences with perampanel vs. placebo in other CDR System domains: improvements in Quality of Episodic Memory (3.0 vs. β1.2; p = 0.012), and worsening in Continuity of Attention (β3.3 vs. 1.6; p = 0.013) and Speed of Memory (0.3 vs. 7.0; p = 0.032). Letter fluency, category fluency, and LGPT were not significantly different between groups. The most frequent adverse events with perampanel were dizziness (30.6%) and somnolence (15.3%).
Significance
Perampanel did not differ from placebo in the global cognitive score, two of five subdomains, and four other cognitive measures. Perampanel was worse on two and better on one subdomain