Search CORE

205 research outputs found

The fate of Arabidopsis thaliana homeologous CNSs and their motifs in the Paleohexaploid Brassica rapa.

Author: Freeling Michael
Pires J
Subramaniam Sabarinath
Wang Xiaowu
Publication venue: eScholarship, University of California
Publication date: 01/01/2013
Field of study

Following polyploidy, duplicate genes are often deleted, and if they are not, then duplicate regulatory regions are sometimes lost. By what mechanism is this loss and what is the chance that such a loss removes function? To explore these questions, we followed individual Arabidopsis thaliana-A. thaliana conserved noncoding sequences (CNSs) into the Brassica ancestor, through a paleohexaploidy and into Brassica rapa. Thus, a single Brassicaceae CNS has six potential orthologous positions in B. rapa; a single Arabidopsis CNS has three potential homeologous positions. We reasoned that a CNS, if present on a singlet Brassica gene, would be unlikely to lose function compared with a more redundant CNS, and this is the case. Redundant CNSs go nondetectable often. Using this logic, each mechanism of CNS loss was assigned a metric of functionality. By definition, proved deletions do not function as sequence. Our results indicated that CNSs that go nondetectable by base substitution or large insertion are almost certainly still functional (redundancy does not matter much to their detectability frequency), whereas those lost by inferred deletion or indels are approximately 75% likely to be nonfunctional. Overall, an average nondetectable, once-redundant CNS more than 30 bp in length has a 72% chance of being nonfunctional, and that makes sense because 97% of them sort to a molecular mechanism with deletion in its description, but base substitutions do cause loss. Similarly, proved-functional G-boxes go undetectable by deletion 82% of the time. Fractionation mutagenesis is a procedure that uses polyploidy as a mutagenic agent to genetically alter RNA expression profiles, and then to construct testable hypotheses as to the function of the lost regulatory site. We show fractionation mutagenesis to be a deletion machine in the Brassica lineage

PubMed Central

eScholarship - University of California

Response to Birchler: Heterosis is partly a sub-problem of quantitative genetics, but its solution may depend on understanding the mysterious genetics of quantity

Author: Freeling Michael
Publication venue: Maydica
Publication date: 29/11/2012
Field of study

CREA Journals (Consiglio per la ricerca in agricoltura e l’analisi dell’economia agraria)

Genes Identified by Visible Mutant Phenotypes Show Increased Bias toward One of Two Subgenomes of Maize

Author: Freeling Michael
Schnable James C.
Publication venue: Public Library of Science
Publication date: 01/01/2011
Field of study

Not all genes are created equal. Despite being supported by sequence conservation and expression data, knockout homozygotes of many genes show no visible effects, at least under laboratory conditions. We have identified a set of maize (Zea mays L.) genes which have been the subject of a disproportionate share of publications recorded at MaizeGDB. We manually anchored these “classical” maize genes to gene models in the B73 reference genome, and identified syntenic orthologs in other grass genomes. In addition to proofing the most recent version 2 maize gene models, we show that a subset of these genes, those that were identified by morphological phenotype prior to cloning, are retained at syntenic locations throughout the grasses at much higher levels than the average expressed maize gene, and are preferentially found on the maize1 subgenome even with a duplicate copy is still retained on the opposite subgenome. Maize1 is the subgenome that experienced less gene loss following the whole genome duplication in maize lineage 5–12 million years ago and genes located on this subgenome tend to be expressed at higher levels in modern maize. Links to the web based software that supported our syntenic analyses in the grasses should empower further research and support teaching involving the history of maize genetic research. Our findings exemplify the concept of “grasses as a single genetic system,” where what is learned in one grass may be applied to another

CiteSeerX

Public Library of Science (PLOS)

Crossref

DigitalCommons@University of Nebraska

Directory of Open Access Journals

PubMed Central

Origin and evolution of the octoploid strawberry genome.

Author: Acharya Charlotte B
Alger Elizabeth I
Baruch Kobi
Ben-Zvi Gil
Bird Kevin A
Brodt Avital
Childs Kevin L
Cole Glenn S
Colle Marivi
Edger Patrick P
Freeling Michael
Hardigan Michael A
Jiang Ning
Knapp Steven J
Lyons Eric
McKain Michael R
Mower Jeffrey P
Nelson Andrew DL
Ou Shujun
Poorten Thomas J
Pumplin Nathan
Puzey Joshua R
Shiue Lily
Smith Ronald D
Swale Thomas
Teresi Scott J
VanBuren Robert
Wai Ching Man
Yocca Alan E
Publication venue: eScholarship, University of California
Publication date: 01/01/2019
Field of study

Cultivated strawberry emerged from the hybridization of two wild octoploid species, both descendants from the merger of four diploid progenitor species into a single nucleus more than 1 million years ago. Here we report a near-complete chromosome-scale assembly for cultivated octoploid strawberry (Fragaria × ananassa) and uncovered the origin and evolutionary processes that shaped this complex allopolyploid. We identified the extant relatives of each diploid progenitor species and provide support for the North American origin of octoploid strawberry. We examined the dynamics among the four subgenomes in octoploid strawberry and uncovered the presence of a single dominant subgenome with significantly greater gene content, gene expression abundance, and biased exchanges between homoeologous chromosomes, as compared with the other subgenomes. Pathway analysis showed that certain metabolomic and disease-resistance traits are largely controlled by the dominant subgenome. These findings and the reference genome should serve as a powerful platform for future evolutionary studies and enable molecular breeding in strawberry

Crossref

eScholarship - University of California

UA Institutional Repository (Univ. of Alabama)

Dose–Sensitivity, Conserved Non-Coding Sequences, and Duplicate Gene Retention through Multiple Tetraploidies in the Grasses

Author: Freeling Michael
Pedersen Brent S.
Schnable James C.
Subramaniam Sabarinath
Publication venue: DigitalCommons@University of Nebraska - Lincoln
Publication date: 01/01/2011
Field of study

Whole genome duplications, or tetraploidies, are an important source of increased gene content. Following whole genome duplication, duplicate copies of many genes are lost from the genome. This loss of genes is biased both in the classes of genes deleted and the subgenome from which they are lost. Many or all classes are genes preferentially retained as duplicate copies are engaged in dose sensitive protein–protein interactions, such that deletion of any one duplicate upsets the status quo of subunit concentrations, and presumably lowers fitness as a result. Transcription factors are also preferentially retained following every whole genome duplications studied. This has been explained as a consequence of protein–protein interactions, just as for other highly retained classes of genes. We show that the quantity of conserved noncoding sequences (CNSs) associated with genes predicts the likelihood of their retention as duplicate pairs following whole genome duplication. As many CNSs likely represent binding sites for transcriptional regulators, we propose that the likelihood of gene retention following tetraploidy may also be influenced by dose–sensitive protein–DNA interactions between the regulatory regions of CNS-rich genes – nicknamed bigfoot genes – and the proteins that bind to them. Using grass genomes, we show that differential loss of CNSs from one member of a pair following the pre-grass tetraploidy reduces its chance of retention in the subsequent maize lineage tetraploidy

Crossref

DigitalCommons@University of Nebraska

Directory of Open Access Journals

PubMed Central

Frontiers - Publisher Connector

Escape from Preferential Retention Following Repeated Whole Genome Duplications in Plants

Author: Freeling Michael
Pires J. Chris
Schnable James C.
Wang Xiaowu
Publication venue: Frontiers Research Foundation
Publication date: 01/01/2012
Field of study

The well supported gene dosage hypothesis predicts that genes encoding proteins engaged in dose–sensitive interactions cannot be reduced back to single copies once all interacting partners are simultaneously duplicated in a whole genome duplication. The genomes of extant flowering plants are the result of many sequential rounds of whole genome duplication, yet the fraction of genomes devoted to encoding complex molecular machines does not increase as fast as expected through multiple rounds of whole genome duplications. Using parallel interspecies genomic comparisons in the grasses and crucifers, we demonstrate that genes retained as duplicates following a whole genome duplication have only a 50% chance of being retained as duplicates in a second whole genome duplication. Genes which fractionated to a single copy following a second whole genome duplication tend to be the member of a gene pair with less complex promoters, lower levels of expression, and to be under lower levels of purifying selection. We suggest the copy with lower levels of expression and less purifying selection contributes less to effective gene-product dosage and therefore is under less dosage constraint in future whole genome duplications, providing an explanation for why flowering plant genomes are not overrun with subunits of large dose–sensitive protein complexes

Crossref

DigitalCommons@University of Nebraska

Directory of Open Access Journals

PubMed Central

Frontiers - Publisher Connector

Automated Conserved Non-Coding Sequence (CNS) Discovery Reveals Differences in Gene Content and Promoter Evolution among Grasses

Author: Freeling Michael
Pedersen Brent S.
Schnable James C.
Turco Gina Marie
Publication venue: DigitalCommons@University of Nebraska - Lincoln
Publication date: 01/01/2013
Field of study

Conserved non-coding sequences (CNS) are islands of non-coding sequence that, like protein coding exons, show less divergence in sequence between related species than functionless DNA. Several CNSs have been demonstrated experimentally to function as cis-regulatory regions. However, the specific functions of most CNSs remain unknown. Previous searches for CNS in plants have either anchored on exons and only identified nearby sequences or required years of painstaking manual annotation. Here we present an open source tool that can accurately identify CNSs between any two related species with sequenced genomes, including both those immediately adjacent to exons and distal sequences separated by \u3e12 kb of non-coding sequence. We have used this tool to characterize new motifs, associate CNSs with additional functions, and identify previously undetected genes encoding RNA and protein in the genomes of five grass species. We provide a list of 15,363 orthologous CNSs conserved across all grasses tested. We were also able to identify regulatory sequences present in the common ancestor of grasses that have been lost in one or more extant grass lineages. Lists of orthologous gene pairs and associated CNSs are provided for reference inbred lines of arabidopsis, Japonica rice, foxtail millet, sorghum, brachypodium, and maize

Crossref

DigitalCommons@University of Nebraska

Directory of Open Access Journals

Frontiers - Publisher Connector

PubMed Central

Recommended from our members

Fractionation and subfunctionalization following genome duplications : mechanisms that drive gene content and their consequences

Author: Fowler John E.
Freeling Michael
Scanlon Michael J.
Publication venue: 'Elsevier BV'
Publication date
Field of study

A gene's duplication relaxes selection. Loss of duplicate, low-function DNA (fractionation) sometimes follows, mostly by deletion in plants, but mostly via the pseudogene pathway in fish and other clades with smaller population sizes. Subfunctionalization — the founding term of the Xfunctionalization lexicon — while not the general cause of differences in duplicate gene retention, becomes primary as the number of a gene's cis — regulatory sites increases. Balanced gene drive explains retention for the average gene. Both maintenance-of-balance and subfunctionalization drive gene content nonrandomly, and currently fall outside of our accepted Theory of Evolution. The ‘typical’ mutation encountered by a gene duplicate is not a neutral loss-of-function; dominant mutations (Muller's lexicon; these are not neutral) abound, and confound X functionalization terms like ‘neofunctionalization’. Confusion of words may cause confusion of thought. As with many plants, fish tetraploidies provide a higher throughput surrogate-genetic method to infer function from human and other vertebrate ENCODE-like regulatory sites. ‘Not only have studies on polyploid fractionation led to reconsiderations of fundamental evolutionary theory, but fractionation in polyploids permits higher-throughput comparative genomic experiments using ENCODE-like data yielding the logical precision expected of genetic analyses.’Keywords: The Theory of Evolution, fractionation, Gene Balance Hypothesis, genome dominance, Whole genome duplication, subfunctionalization, dominant mutatio

ScholarsArchive@OSU

Genome-Wide Analysis of Syntenic Gene Deletion in the Grasses

Author: Alexandrov
Barker
Baucom
Bikard
Blanc
Buggs
Chang
De Bodt
Dehal
Eric Lyons
Flagel
Freeling
Freeling
Freeling
Gaeta
Goff
Grass Phylogeny Working Group
Harris
James C. Schnable
Lai
Langham
Larkin
Lynch
Lyons
Mallet
Massa
Michael Freeling
Mizuta
Osborn
Ouyang
Paterson
Paterson
Prasad
Ramsey
Rieseberg
Salse
Sankoff
Scannell
Schnable
Schnable
Schnable
Schnable
Seoighe
Soderlund
Soltis
Soltis
Springer
Swanson-Wagner
Swigoňová
Sémon
Tang
Tang
The International Brachypodium Initiative
Thomas
Tuskan
van de Peer
Wang
Woodhouse
Xiong
Yang
Yu
Publication venue: Oxford University Press
Publication date: 23/01/2012
Field of study

The grasses, Poaceae, are one of the largest and most successful angiosperm families. Like many radiations of flowering plants, the divergence of the major grass lineages was preceded by a whole-genome duplication (WGD), although these events are not rare for flowering plants. By combining identification of syntenic gene blocks with measures of gene pair divergence and different frequencies of ancient gene loss, we have separated the two subgenomes present in modern grasses. Reciprocal loss of duplicated genes or genomic regions has been hypothesized to reproductively isolate populations and, thus, speciation. However, in contrast to previous studies in yeast and teleost fishes, we found very little evidence of reciprocal loss of homeologous genes between the grasses, suggesting that post-WGD gene loss may not be the cause of the grass radiation. The sets of homeologous and orthologous genes and predicted locations of deleted genes identified in this study, as well as links to the CoGe comparative genomics web platform for analyzing pan-grass syntenic regions, are provided along with this paper as a resource for the grass genetics community

Crossref

DigitalCommons@University of Nebraska

PubMed Central

qTeller: a tool for comparative multi-genomic gene expression analysis

Author: Andorf Carson M.
Freeling Michael
Portwood John L., II
Schnable James
Schott David
Sen Shatabdi
Walley Justin W.
Woodhouse Margaret R.
Publication venue: DigitalCommons@University of Nebraska - Lincoln
Publication date: 01/01/2022
Field of study

Motivation: Over the last decade, RNA-Seq whole-genome sequencing has become a widely used method for measuring and understanding transcriptome-level changes in gene expression. Since RNA-Seq is relatively inexpensive, it can be used on multiple genomes to evaluate gene expression across many different conditions, tissues and cell types. Although many tools exist to map and compare RNA-Seq at the genomics level, few web-based tools are dedicated to making data generated for individual genomic analysis accessible and reusable at a gene-level scale for comparative analysis between genes, across different genomes and meta-analyses. Results: To address this challenge, we revamped the comparative gene expression tool qTeller to take advantage of the growing number of public RNA-Seq datasets. qTeller allows users to evaluate gene expression data in a defined genomic interval and also perform two-gene comparisons across multiple user-chosen tissues. Though previously unpublished, qTeller has been cited extensively in the scientific literature, demonstrating its importance to researchers. Our new version of qTeller now supports multiple genomes for intergenomic comparisons, and includes capabilities for both mRNA and protein abundance datasets. Other new features include support for additional data formats, modernized interface and back-end database and an optimized framework for adoption by other organisms’ databases. Availability and implementation: The source code for qTeller is open-source and available through GitHub (https:// github.com/Maize-Genetics-and-Genomics-Database/qTeller). A maize instance of qTeller is available at the Maize Genetics and Genomics database (MaizeGDB) (https://qteller.maizegdb.org/), where we have mapped over 200 unique datasets from GenBank across 27 maize genomes

DigitalCommons@University of Nebraska