541,225 research outputs found
Gene expression in large pedigrees: analytic approaches.
BackgroundWe currently have the ability to quantify transcript abundance of messenger RNA (mRNA), genome-wide, using microarray technologies. Analyzing genotype, phenotype and expression data from 20 pedigrees, the members of our Genetic Analysis Workshop (GAW) 19 gene expression group published 9 papers, tackling some timely and important problems and questions. To study the complexity and interrelationships of genetics and gene expression, we used established statistical tools, developed newer statistical tools, and developed and applied extensions to these tools.MethodsTo study gene expression correlations in the pedigree members (without incorporating genotype or trait data into the analysis), 2 papers used principal components analysis, weighted gene coexpression network analysis, meta-analyses, gene enrichment analyses, and linear mixed models. To explore the relationship between genetics and gene expression, 2 papers studied expression quantitative trait locus allelic heterogeneity through conditional association analyses, and epistasis through interaction analyses. A third paper assessed the feasibility of applying allele-specific binding to filter potential regulatory single-nucleotide polymorphisms (SNPs). Analytic approaches included linear mixed models based on measured genotypes in pedigrees, permutation tests, and covariance kernels. To incorporate both genotype and phenotype data with gene expression, 4 groups employed linear mixed models, nonparametric weighted U statistics, structural equation modeling, Bayesian unified frameworks, and multiple regression.Results and discussionRegarding the analysis of pedigree data, we found that gene expression is familial, indicating that at least 1 factor for pedigree membership or multiple factors for the degree of relationship should be included in analyses, and we developed a method to adjust for familiality prior to conducting weighted co-expression gene network analysis. For SNP association and conditional analyses, we found FaST-LMM (Factored Spectrally Transformed Linear Mixed Model) and SOLAR-MGA (Sequential Oligogenic Linkage Analysis Routines -Major Gene Analysis) have similar type 1 and type 2 errors and can be used almost interchangeably. To improve the power and precision of association tests, prior knowledge of DNase-I hypersensitivity sites or other relevant biological annotations can be incorporated into the analyses. On a biological level, eQTL (expression quantitative trait loci) are genetically complex, exhibiting both allelic heterogeneity and epistasis. Including both genotype and phenotype data together with measurements of gene expression was found to be generally advantageous in terms of generating improved levels of significance and in providing more interpretable biological models.ConclusionsPedigrees can be used to conduct analyses of and enhance gene expression studies
Natural variation in abiotic stress responsive gene expression and local adaptation to climate in Arabidopsis thaliana.
Gene expression varies widely in natural populations, yet the proximate and ultimate causes of this variation are poorly known. Understanding how variation in gene expression affects abiotic stress tolerance, fitness, and adaptation is central to the field of evolutionary genetics. We tested the hypothesis that genes with natural genetic variation in their expression responses to abiotic stress are likely to be involved in local adaptation to climate in Arabidopsis thaliana. Specifically, we compared genes with consistent expression responses to environmental stress (expression stress responsive, "eSR") to genes with genetically variable responses to abiotic stress (expression genotype-by-environment interaction, "eGEI"). We found that on average genes that exhibited eGEI in response to drought or cold had greater polymorphism in promoter regions and stronger associations with climate than those of eSR genes or genomic controls. We also found that transcription factor binding sites known to respond to environmental stressors, especially abscisic acid responsive elements, showed significantly higher polymorphism in drought eGEI genes in comparison to eSR genes. By contrast, eSR genes tended to exhibit relatively greater pairwise haplotype sharing, lower promoter diversity, and fewer nonsynonymous polymorphisms, suggesting purifying selection or selective sweeps. Our results indicate that cis-regulatory evolution and genetic variation in stress responsive gene expression may be important mechanisms of local adaptation to climatic selective gradients
The molecular basis of T cell acute lymphoblastic leukemia
T cell acute lymphoblastic leukemias (T-ALLs) arise from the malignant transformation of hematopoietic progenitors primed toward T cell development, as result of a multistep oncogenic process involving constitutive activation of NOTCH signaling and genetic alterations in transcription factors, signaling oncogenes, and tumor suppressors. Notably, these genetic alterations define distinct molecular groups of T-ALL with specific gene expression signatures and clinicobiological features. This review summarizes recent advances in our understanding of the molecular genetics of T-ALL
The Functional Consequences of Variation in Transcription Factor Binding
One goal of human genetics is to understand how the information for precise
and dynamic gene expression programs is encoded in the genome. The interactions
of transcription factors (TFs) with DNA regulatory elements clearly play an
important role in determining gene expression outputs, yet the regulatory logic
underlying functional transcription factor binding is poorly understood. Many
studies have focused on characterizing the genomic locations of TF binding, yet
it is unclear to what extent TF binding at any specific locus has functional
consequences with respect to gene expression output. To evaluate the context of
functional TF binding we knocked down 59 TFs and chromatin modifiers in one
HapMap lymphoblastoid cell line. We then identified genes whose expression was
affected by the knockdowns. We intersected the gene expression data with
transcription factor binding data (based on ChIP-seq and DNase-seq) within 10
kb of the transcription start sites of expressed genes. This combination of
data allowed us to infer functional TF binding. On average, 14.7% of genes
bound by a factor were differentially expressed following the knockdown of that
factor, suggesting that most interactions between TF and chromatin do not
result in measurable changes in gene expression levels of putative target
genes. We found that functional TF binding is enriched in regulatory elements
that harbor a large number of TF binding sites, at sites with predicted higher
binding affinity, and at sites that are enriched in genomic regions annotated
as active enhancers.Comment: 30 pages, 6 figures (7 supplemental figures and 6 supplemental tables
available upon request to [email protected]). Submitted to PLoS
Genetic
Revised Annotations, Sex-Biased Expression, and Lineage-Specific Genes in the Drosophila melanogaster group
Here, we provide revised gene models for D. ananassae, D. yakuba, and D.
simulans, which include UTRs and empirically verified intron-exon boundaries,
as well as ortholog groups identified using a fuzzy reciprocal-best-hit blast
comparison. Using these revised annotations, we perform differential expression
testing using the cufflinks suite to provide a broad overview of differential
expression between reproductive tissues and the carcass. We identify thousands
of genes that are differentially expressed across tissues in D. yakuba and D.
simulans, with roughly 60% agreement in expression patterns of orthologs in D.
yakuba and D. simulans. We identify several cases of putative polycistronic
transcripts, pointing to a combination of transcriptional read-through in the
genome as well as putative gene fusion and fission events across taxa. We
furthermore identify hundreds of lineage specific genes in each species with no
blast hits among transcripts of any other Drosophila species, which are
candidates for neofunctionalized proteins and a potential source of genetic
novelty.Comment: Revised manuscript, also available online preprint at G3: Genes,
Genomes, Genetics. Gene models, ortholog calls, and tissue specific
expression results are available at http://github.com/ThorntonLab/GFF or the
UCSC browser on the Thornton Lab public track hub at http://genome.ucsc.ed
Gene expression drives the evolution of dominance.
Dominance is a fundamental concept in molecular genetics and has implications for understanding patterns of genetic variation, evolution, and complex traits. However, despite its importance, the degree of dominance in natural populations is poorly quantified. Here, we leverage multiple mating systems in natural populations of Arabidopsis to co-estimate the distribution of fitness effects and dominance coefficients of new amino acid changing mutations. We find that more deleterious mutations are more likely to be recessive than less deleterious mutations. Further, this pattern holds across gene categories, but varies with the connectivity and expression patterns of genes. Our work argues that dominance arises as a consequence of the functional importance of genes and their optimal expression levels
Virus-induced gene complementation reveals a transcription factor network in modulation of tomato fruit ripening
Plant virus technology, in particular virus-induced gene silencing, is a widely used reverse- and forward-genetics tool in plant functional genomics. However the potential of virus technology to express genes to induce phenotypes or to complement mutants in order to understand the function of plant genes is not well documented. Here we exploit Potato virus X as a tool for virus-induced gene complementation (VIGC). Using VIGC in tomato, we demonstrated that ectopic viral expression of LeMADS-RIN, which encodes a MADS-box transcription factor (TF), resulted in functional complementation of the non-ripening rin mutant phenotype and caused fruits to ripen. Comparative gene expression analysis indicated that LeMADS-RIN up-regulated expression of the SBP-box (SQUAMOSA promoter binding protein-like) gene LeSPL-CNR, but down-regulated the expression of LeHB-1, an HD-Zip homeobox TF gene. Our data support the hypothesis that a transcriptional network may exist among key TFs in the modulation of fruit ripening in tomato
Modularity Facilitates Flexible Tuning of Plastic and Evolutionary Gene Expression Responses during Early Divergence
Gene expression changes have been recognized as important drivers of adaptation to changing environmental conditions. Little is known about the relative roles of plastic and evolutionary responses in complex gene expression networks during the early stages of divergence. Large gene expression data sets coupled with in silico methods for identifying coexpressed modules now enable systems genetics approaches also in nonmodel species for better understanding of gene expression responses during early divergence. Here, we combined gene coexpression analyses with population genetics to separate plastic and population (evolutionary) effects in expression networks using small salmonid populations as a model system. We show that plastic and population effects were highly variable among the six identified modules and that the plastic effects explained larger proportion of the total eigengene expression than population effects. A more detailed analysis of the population effects using a QST - FST comparison across 16,622 annotated transcripts revealed that gene expression followed neutral expectations within modules and at the global level. Furthermore, two modules showed enrichment for genes coding for early developmental traits that have been previously identified as important phenotypic traits in thermal responses in the same model system indicating that coexpression analysis can capture expression patterns underlying ecologically important traits. We suggest that module-specific responses may facilitate the flexible tuning of expression levels to local thermal conditions. Overall, our study indicates that plasticity and neutral evolution are the main drivers of gene expression variance in the early stages of thermal adaptation in this system.Peer reviewe
- …
