72 research outputs found
Multiple FLC haplotypes defined by independent cis-regulatory variation underpin life history diversity in Arabidopsis thaliana
Relating molecular variation to phenotypic diversity is a central goal in evolutionary biology. In Arabidopsis thaliana, FLOWERING LOCUS C (FLC) is a major determinant of variation in vernalization—the acceleration of flowering by prolonged cold. Here, through analysis of 1307 A. thaliana accessions, we identify five predominant FLC haplotypes defined by noncoding sequence variation. Genetic and transgenic experiments show that they are functionally distinct, varying in FLC expression level and rate of epigenetic silencing. Allelic heterogeneity at this single locus accounts for a large proportion of natural variation in vernalization that contributes to adaptation of A. thaliana
Co-Variation between Seed Dormancy, Growth Rate and Flowering Time Changes with Latitude in Arabidopsis thaliana
Life-history traits controlling the duration and timing of developmental phases in the life cycle jointly determine fitness. Therefore, life-history traits studied in isolation provide an incomplete view on the relevance of life-cycle variation for adaptation. In this study, we examine genetic variation in traits covering the major life history events of the annual species Arabidopsis thaliana: seed dormancy, vegetative growth rate and flowering time. In a sample of 112 genotypes collected throughout the European range of the species, both seed dormancy and flowering time follow a latitudinal gradient independent of the major population structure gradient. This finding confirms previous studies reporting the adaptive evolution of these two traits. Here, however, we further analyze patterns of co-variation among traits. We observe that co-variation between primary dormancy, vegetative growth rate and flowering time also follows a latitudinal cline. At higher latitudes, vegetative growth rate is positively correlated with primary dormancy and negatively with flowering time. In the South, this trend disappears. Patterns of trait co-variation change, presumably because major environmental gradients shift with latitude. This pattern appears unrelated to population structure, suggesting that changes in the coordinated evolution of major life history traits is adaptive. Our data suggest that A. thaliana provides a good model for the evolution of trade-offs and their genetic basis.<br
The Impact of Arabidopsis on Human Health: Diversifying Our Portfolio
Studies of the model plant Arabidopsis thaliana may seem to have little impact on advances in medical research, yet a survey of the scientific literature shows that this is a misconception. Many discoveries with direct relevance to human health and disease have been elaborated using Arabidopsis, and several processes important to human biology are more easily studied in this versatile model plant
The scale of population structure in Arabidopsis thaliana
The population structure of an organism reflects its evolutionary history and influences its evolutionary trajectory. It constrains the combination of genetic diversity and reveals patterns of past gene flow. Understanding it is a prerequisite for detecting genomic regions under selection, predicting the effect of population disturbances, or modeling gene flow. This paper examines the detailed global population structure of Arabidopsis thaliana. Using a set of 5,707 plants collected from around the globe and genotyped at 149 SNPs, we show that while A. thaliana as a species self-fertilizes 97% of the time, there is considerable variation among local groups. This level of outcrossing greatly limits observed heterozygosity but is sufficient to generate considerable local haplotypic diversity. We also find that in its native Eurasian range A. thaliana exhibits continuous isolation by distance at every geographic scale without natural breaks corresponding to classical notions of populations. By contrast, in North America, where it exists as an exotic species, A. thaliana exhibits little or no population structure at a continental scale but local isolation by distance that extends hundreds of km. This suggests a pattern for the development of isolation by distance that can establish itself shortly after an organism fills a new habitat range. It also raises questions about the general applicability of many standard population genetics models. Any model based on discrete clusters of interchangeable individuals will be an uneasy fit to organisms like A. thaliana which exhibit continuous isolation by distance on many scales
DNA methylation variation in Arabidopsis has a genetic basis and shows evidence of local adaptation
Epigenome modulation in response to the environment potentially provides a
mechanism for organisms to adapt, both within and between generations. However,
neither the extent to which this occurs, nor the molecular mechanisms involved
are known. Here we investigate DNA methylation variation in Swedish Arabidopsis
thaliana accessions grown at two different temperatures. Environmental effects
on DNA methylation were limited to transposons, where CHH methylation was found
to increase with temperature. Genome-wide association mapping revealed that the
extensive CHH methylation variation was strongly associated with genetic
variants in both cis and trans, including a major trans-association close to
the DNA methyltransferase CMT2. Unlike CHH methylation, CpG gene body
methylation (GBM) on the coding region of genes was not affected by growth
temperature, but was instead strongly correlated with the latitude of origin.
Accessions from colder regions had higher levels of GBM for a significant
fraction of the genome, and this was correlated with elevated transcription
levels for the genes affected. Genome-wide association mapping revealed that
this effect was largely due to trans-acting loci, a significant fraction of
which showed evidence of local adaptation. These findings constitute the first
direct link between DNA methylation and adaptation to the environment, and
provide a basis for further dissecting how environmentally driven and
genetically determined epigenetic variation interact and influence organismal
fitness.Comment: 38 pages 4 figure
PoolHap: Inferring Haplotype Frequencies from Pooled Samples by Next Generation Sequencing
With the advance of next-generation sequencing (NGS) technologies, increasingly ambitious applications are becoming feasible. A particularly powerful one is the sequencing of polymorphic, pooled samples. The pool can be naturally occurring, as in the case of multiple pathogen strains in a blood sample, multiple types of cells in a cancerous tissue sample, or multiple isoforms of mRNA in a cell. In these cases, it's difficult or impossible to partition the subtypes experimentally before sequencing, and those subtype frequencies must hence be inferred. In addition, investigators may occasionally want to artificially pool the sample of a large number of individuals for reasons of cost-efficiency, e. g., when carrying out genetic mapping using bulked segregant analysis. Here we describe PoolHap, a computational tool for inferring haplotype frequencies from pooled samples when haplotypes are known. The key insight into why PoolHap works is that the large number of SNPs that come with genome-wide coverage can compensate for the uneven coverage across the genome. The performance of PoolHap is illustrated and discussed using simulated and real data. We show that PoolHap is able to accurately estimate the proportions of haplotypes with less than 2% error for 34-strain mixtures with 2X total coverage Arabidopsis thaliana whole genome polymorphism data. This method should facilitate greater biological insight into heterogeneous samples that are difficult or impossible to isolate experimentally. Software and users manual are freely available at http://arabidopsis.gmi.oeaw.ac.at/quan/poolhap/
Strong signature of natural selection within an FHIT intron implicated in prostate cancer risk
Previously, a candidate gene linkage approach on brother pairs affected with prostate cancer identified a locus of prostate cancer susceptibility at D3S1234 within the fragile histidine triad gene (FHIT), a tumor suppressor that induces apoptosis. Subsequent association tests on 16 SNPs spanning approximately 381 kb surrounding D3S1234 in Americans of European descent revealed significant evidence of association for a single SNP within intron 5 of FHIT. In the current study, resequencing and genotyping within a 28.5 kb region surrounding this SNP further delineated the association with prostate cancer risk to a 15 kb region. Multiple SNPs in sequences under evolutionary constraint within intron 5 of FHIT defined several related haplotypes with an increased risk of prostate cancer in European-Americans. Strong associations were detected for a risk haplotype defined by SNPs 138543, 142413, and 152494 in all cases (Pearson's χ2 = 12.34, df 1, P = 0.00045) and for the homozygous risk haplotype defined by SNPs 144716, 142413, and 148444 in cases that shared 2 alleles identical by descent with their affected brothers (Pearson's χ2 = 11.50, df 1, P = 0.00070). In addition to highly conserved sequences encompassing SNPs 148444 and 152413, population studies revealed strong signatures of natural selection for a 1 kb window covering the SNP 144716 in two human populations, the European American (π = 0.0072, Tajima's D= 3.31, 14 SNPs) and the Japanese (π = 0.0049, Fay & Wu's H = 8.05, 14 SNPs), as well as in chimpanzees (Fay & Wu's H = 8.62, 12 SNPs). These results strongly support the involvement of the FHIT intronic region in an increased risk of prostate cancer. © 2008 Ding et al
Major-Effect Alleles at Relatively Few Loci Underlie Distinct Vernalization and Flowering Variation in Arabidopsis Accessions
We have explored the genetic basis of variation in vernalization requirement and
response in Arabidopsis accessions, selected on the basis of their phenotypic
distinctiveness. Phenotyping of F2 populations in different environments, plus
fine mapping, indicated possible causative genes. Our data support the
identification of FRI and FLC as candidates
for the major-effect QTL underlying variation in vernalization response, and
identify a weak FLC allele, caused by a Mutator-like
transposon, contributing to flowering time variation in two N. American
accessions. They also reveal a number of additional QTL that contribute to
flowering time variation after saturating vernalization. One of these was the
result of expression variation at the FT locus. Overall, our
data suggest that distinct phenotypic variation in the vernalization and
flowering response of Arabidopsis accessions is accounted for by variation that
has arisen independently at relatively few major-effect loci
High-Resolution Analysis of Parent-of-Origin Allelic Expression in the Arabidopsis Endosperm
Genomic imprinting is an epigenetic phenomenon leading to parent-of-origin specific differential expression of maternally and paternally inherited alleles. In plants, genomic imprinting has mainly been observed in the endosperm, an ephemeral triploid tissue derived after fertilization of the diploid central cell with a haploid sperm cell. In an effort to identify novel imprinted genes in Arabidopsis thaliana, we generated deep sequencing RNA profiles of F1 hybrid seeds derived after reciprocal crosses of Arabidopsis Col-0 and Bur-0 accessions. Using polymorphic sites to quantify allele-specific expression levels, we could identify more than 60 genes with potential parent-of-origin specific expression. By analyzing the distribution of DNA methylation and epigenetic marks established by Polycomb group (PcG) proteins using publicly available datasets, we suggest that for maternally expressed genes (MEGs) repression of the paternally inherited alleles largely depends on DNA methylation or PcG-mediated repression, whereas repression of the maternal alleles of paternally expressed genes (PEGs) predominantly depends on PcG proteins. While maternal alleles of MEGs are also targeted by PcG proteins, such targeting does not cause complete repression. Candidate MEGs and PEGs are enriched for cis-proximal transposons, suggesting that transposons might be a driving force for the evolution of imprinted genes in Arabidopsis. In addition, we find that MEGs and PEGs are significantly faster evolving when compared to other genes in the genome. In contrast to the predominant location of mammalian imprinted genes in clusters, cluster formation was only detected for few MEGs and PEGs, suggesting that clustering is not a major requirement for imprinted gene regulation in Arabidopsis
- …