54 research outputs found

    Phylogenetic Comparison of F-Box (FBX) Gene Superfamily within the Plant Kingdom Reveals Divergent Evolutionary Histories Indicative of Genomic Drift

    Get PDF
    The emergence of multigene families has been hypothesized as a major contributor to the evolution of complex traits and speciation. To help understand how such multigene families arose and diverged during plant evolution, we examined the phylogenetic relationships of F-Box (FBX) genes, one of the largest and most polymorphic superfamilies known in the plant kingdom. FBX proteins comprise the target recognition subunit of SCF-type ubiquitin-protein ligases, where they individually recruit specific substrates for ubiquitylation. Through the extensive analysis of 10,811 FBX loci from 18 plant species, ranging from the alga Chlamydomonas reinhardtii to numerous monocots and eudicots, we discovered strikingly diverse evolutionary histories. The number of FBX loci varies widely and appears independent of the growth habit and life cycle of land plants, with a little as 198 predicted for Carica papaya to as many as 1350 predicted for Arabidopsis lyrata. This number differs substantially even among closely related species, with evidence for extensive gains/losses. Despite this extraordinary inter-species variation, one subset of FBX genes was conserved among most species examined. Together with evidence of strong purifying selection and expression, the ligases synthesized from these conserved loci likely direct essential ubiquitylation events. Another subset was much more lineage specific, showed more relaxed purifying selection, and was enriched in loci with little or no evidence of expression, suggesting that they either control more limited, species-specific processes or arose from genomic drift and thus may provide reservoirs for evolutionary innovation. Numerous FBX loci were also predicted to be pseudogenes with their numbers tightly correlated with the total number of FBX genes in each species. Taken together, it appears that the FBX superfamily has independently undergone substantial birth/death in many plant lineages, with its size and rapid evolution potentially reflecting a central role for ubiquitylation in driving plant fitness

    Maize haplotype with a helitron-amplified cytidine deaminase gene copy

    Get PDF
    BACKGROUND: Genetic maps are based on recombination of orthologous gene sequences between different strains of the same species. Therefore, it was unexpected to find extensive non-collinearity of genes between different inbred strains of maize. Interestingly, disruption of gene collinearity can be caused among others by a rolling circle-type copy and paste mechanism facilitated by Helitrons. However, understanding the role of this type of gene amplification has been hampered by the lack of finding intact gene sequences within Helitrons. RESULTS: By aligning two haplotypes of the z1C1 locus of maize we found a Helitron that contains two genes, one encoding a putative cytidine deaminase and one a hypothetical protein with part of a 40S ribosomal protein. The cytidine deaminase gene, called ZmCDA3, has been copied from the ZmCDA1 gene on maize chromosome 7 about 4.5 million years ago (mya) after maize was formed by whole-genome duplication from two progenitors. Inbred lines contain gene copies of both progenitors, the ZmCDA1 and ZmCDA2 genes. Both genes diverged when the progenitors of maize split and are derived from the same progenitor as the rice OsCDA1 gene. The ZmCDA1 and ZmCDA2 genes are both transcribed in leaf and seed tissue, but transcripts of the paralogous ZmCDA3 gene have not been found yet. Based on their protein structure the maize CDA genes encode a nucleoside deaminase that is found in bacterial systems and is distinct from the mammalian RNA and/or DNA modifying enzymes. CONCLUSION: The conservation of a paralogous gene sequence encoding a cytidine deaminase gene over 4.5 million years suggests that Helitrons could add functional gene sequences to new chromosomal positions and thereby create new haplotypes. However, the function of such paralogous gene copies cannot be essential because they are not present in all maize strains. However, it is interesting to note that maize hybrids can outperform their inbred parents. Therefore, certain haplotypes may function only in combination with other haplotypes or under specialized environmental conditions

    Sequencing, Mapping, and Analysis of 27,455 Maize Full-Length cDNAs

    Get PDF
    Full-length cDNA (FLcDNA) sequencing establishes the precise primary structure of individual gene transcripts. From two libraries representing 27 B73 tissues and abiotic stress treatments, 27,455 high-quality FLcDNAs were sequenced. The average transcript length was 1.44 kb including 218 bases and 321 bases of 5β€² and 3β€² UTR, respectively, with 8.6% of the FLcDNAs encoding predicted proteins of fewer than 100 amino acids. Approximately 94% of the FLcDNAs were stringently mapped to the maize genome. Although nearly two-thirds of this genome is composed of transposable elements (TEs), only 5.6% of the FLcDNAs contained TE sequences in coding or UTR regions. Approximately 7.2% of the FLcDNAs are putative transcription factors, suggesting that rare transcripts are well-enriched in our FLcDNA set. Protein similarity searching identified 1,737 maize transcripts not present in rice, sorghum, Arabidopsis, or poplar annotated genes. A strict FLcDNA assembly generated 24,467 non-redundant sequences, of which 88% have non-maize protein matches. The FLcDNAs were also assembled with 41,759 FLcDNAs in GenBank from other projects, where semi-strict parameters were used to identify 13,368 potentially unique non-redundant sequences from this project. The libraries, ESTs, and FLcDNA sequences produced from this project are publicly available. The annotated EST and FLcDNA assemblies are available through the maize FLcDNA web resource (www.maizecdna.org)

    A Genome-Wide Characterization of MicroRNA Genes in Maize

    Get PDF
    MicroRNAs (miRNAs) are small, non-coding RNAs that play essential roles in plant growth, development, and stress response. We conducted a genome-wide survey of maize miRNA genes, characterizing their structure, expression, and evolution. Computational approaches based on homology and secondary structure modeling identified 150 high-confidence genes within 26 miRNA families. For 25 families, expression was verified by deep-sequencing of small RNA libraries that were prepared from an assortment of maize tissues. PCR–RACE amplification of 68 miRNA transcript precursors, representing 18 families conserved across several plant species, showed that splice variation and the use of alternative transcriptional start and stop sites is common within this class of genes. Comparison of sequence variation data from diverse maize inbred lines versus teosinte accessions suggest that the mature miRNAs are under strong purifying selection while the flanking sequences evolve equivalently to other genes. Since maize is derived from an ancient tetraploid, the effect of whole-genome duplication on miRNA evolution was examined. We found that, like protein-coding genes, duplicated miRNA genes underwent extensive gene-loss, with ∼35% of ancestral sites retained as duplicate homoeologous miRNA genes. This number is higher than that observed with protein-coding genes. A search for putative miRNA targets indicated bias towards genes in regulatory and metabolic pathways. As maize is one of the principal models for plant growth and development, this study will serve as a foundation for future research into the functional roles of miRNA genes

    Duplication of a well-conserved homeodomain-leucine zipper transcription factor gene in barley generates a copy with more specific functions

    Get PDF
    Three spikelets are formed at each rachis node of the cultivated barley (Hordeum vulgare ssp. vulgare) spike. In two-rowed barley, the central one is fertile and the two lateral ones are sterile, whereas in the six-rowed type, all three are fertile. This characteristic is determined by the allelic constitution at the six-rowed spike 1 (vrs1) locus on the long arm of chromosome 2H, with the recessive allele (vrs1) being responsible for the six-rowed phenotype. The Vrs1 (HvHox1) gene encodes a homeodomain-leucine zipper (HD-Zip) transcription factor. Here, we show that the Vrs1 gene evolved in the Poaceae via a duplication, with a second copy of the gene, HvHox2, present on the short arm of chromosome 2H. Micro-collinearity and polypeptide sequences were both well conserved between HvHox2 and its Poaceae orthologs, but Vrs1 is unique to the barley tribe. The Vrs1 gene product lacks a motif which is conserved among the HvHox2 orthologs. A phylogenetic analysis demonstrated that Vrs1 and HvHox2 must have diverged after the separation of Brachypodium distachyon from the Pooideae and suggests that Vrs1 arose following the duplication of HvHox2, and acquired its new function during the evolution of the barley tribe. HvHox2 was expressed in all organs examined but Vrs1 was predominantly expressed in immature inflorescence

    Systematic Analysis of Sequences and Expression Patterns of Drought-Responsive Members of the HD-Zip Gene Family in Maize

    Get PDF
    Background: Members of the homeodomain-leucine zipper (HD-Zip) gene family encode transcription factors that are unique to plants and have diverse functions in plant growth and development such as various stress responses, organ formation and vascular development. Although systematic characterization of this family has been carried out in Arabidopsis and rice, little is known about HD-Zip genes in maize (Zea mays L.). Methods and Findings: In this study, we described the identification and structural characterization of HD-Zip genes in the maize genome. A complete set of 55 HD-Zip genes (Zmhdz1-55) were identified in the maize genome using Blast search tools and categorized into four classes (HD-Zip I-IV) based on phylogeny. Chromosomal location of these genes revealed that they are distributed unevenly across all 10 chromosomes. Segmental duplication contributed largely to the expansion of the maize HD-ZIP gene family, while tandem duplication was only responsible for the amplification of the HD-Zip II genes. Furthermore, most of the maize HD-Zip I genes were found to contain an overabundance of stress-related ciselements in their promoter sequences. The expression levels of the 17 HD-Zip I genes under drought stress were also investigated by quantitative real-time PCR (qRT-PCR). All of the 17 maize HD-ZIP I genes were found to be regulated by drought stress, and the duplicated genes within a sister pair exhibited the similar expression patterns, suggesting their conserved functions during the process of evolution

    Maize RNA PolIV affects the expression of genes with nearby TE insertions and has a genome-wide repressive impact on transcription

    Get PDF
    Abstract Background RNA-directed DNA methylation (RdDM) is a plant-specific epigenetic process that relies on the RNA polymerase IV (Pol IV) for the production of 24 nucleotide small interfering RNAs (siRNA) that guide the cytosine methylation and silencing of genes and transposons. Zea mays RPD1/RMR6 gene encodes the largest subunit of Pol IV and is required for normal plant development, paramutation, transcriptional repression of certain transposable elements (TEs) and transcriptional regulation of specific alleles. Results In this study we applied a total RNA-Seq approach to compare the B73 and rpd1/rmr6 leaf transcriptomes. Although previous studies indicated that loss of siRNAs production in RdDM mutants provokes a strong loss of CHH DNA methylation but not massive gene or TEs transcriptional activation in both Arabidopsis and maize, our total RNA-Seq analysis of rpd1/rmr6 transcriptome reveals that loss of Pol IV activity causes a global increase in the transcribed fraction of the maize genome. Our results point to the genes with nearby TE insertions as being the most strongly affected by Pol IV-mediated gene silencing. TEs modulation of nearby gene expression is linked to alternative methylation profiles on gene flanking regions, and these profiles are strictly dependent on specific characteristics of the TE member inserted. Although Pol IV is essential for the biogenesis of siRNAs, the genes with associated siRNA loci are less affected by the pol IV mutation. Conclusions This deep and integrated analysis of gene expression, TEs distribution, smallRNA targeting and DNA methylation levels, reveals that loss of Pol IV activity globally affects genome regulation, pointing at TEs as modulator of nearby gene expression and indicating the existence of multiple level epigenetic silencing mechanisms. Our results also suggest a predominant role of the Pol IV-mediated RdDM pathway in genome dominance regulation, and subgenome stability and evolution in maize

    A Single Molecule Scaffold for the Maize Genome

    Get PDF
    About 85% of the maize genome consists of highly repetitive sequences that are interspersed by low-copy, gene-coding sequences. The maize community has dealt with this genomic complexity by the construction of an integrated genetic and physical map (iMap), but this resource alone was not sufficient for ensuring the quality of the current sequence build. For this purpose, we constructed a genome-wide, high-resolution optical map of the maize inbred line B73 genome containing >91,000 restriction sites (averaging 1 site/∼23 kb) accrued from mapping genomic DNA molecules. Our optical map comprises 66 contigs, averaging 31.88 Mb in size and spanning 91.5% (2,103.93 Mb/∼2,300 Mb) of the maize genome. A new algorithm was created that considered both optical map and unfinished BAC sequence data for placing 60/66 (2,032.42 Mb) optical map contigs onto the maize iMap. The alignment of optical maps against numerous data sources yielded comprehensive results that proved revealing and productive. For example, gaps were uncovered and characterized within the iMap, the FPC (fingerprinted contigs) map, and the chromosome-wide pseudomolecules. Such alignments also suggested amended placements of FPC contigs on the maize genetic map and proactively guided the assembly of chromosome-wide pseudomolecules, especially within complex genomic regions. Lastly, we think that the full integration of B73 optical maps with the maize iMap would greatly facilitate maize sequence finishing efforts that would make it a valuable reference for comparative studies among cereals, or other maize inbred lines and cultivars
    • …
    corecore