270 research outputs found
The Functional Consequences of Variation in Transcription Factor Binding
One goal of human genetics is to understand how the information for precise
and dynamic gene expression programs is encoded in the genome. The interactions
of transcription factors (TFs) with DNA regulatory elements clearly play an
important role in determining gene expression outputs, yet the regulatory logic
underlying functional transcription factor binding is poorly understood. Many
studies have focused on characterizing the genomic locations of TF binding, yet
it is unclear to what extent TF binding at any specific locus has functional
consequences with respect to gene expression output. To evaluate the context of
functional TF binding we knocked down 59 TFs and chromatin modifiers in one
HapMap lymphoblastoid cell line. We then identified genes whose expression was
affected by the knockdowns. We intersected the gene expression data with
transcription factor binding data (based on ChIP-seq and DNase-seq) within 10
kb of the transcription start sites of expressed genes. This combination of
data allowed us to infer functional TF binding. On average, 14.7% of genes
bound by a factor were differentially expressed following the knockdown of that
factor, suggesting that most interactions between TF and chromatin do not
result in measurable changes in gene expression levels of putative target
genes. We found that functional TF binding is enriched in regulatory elements
that harbor a large number of TF binding sites, at sites with predicted higher
binding affinity, and at sites that are enriched in genomic regions annotated
as active enhancers.Comment: 30 pages, 6 figures (7 supplemental figures and 6 supplemental tables
available upon request to [email protected]). Submitted to PLoS
Genetic
Evolutionary insights into primate skeletal gene regulation using a comparative cell culture model
The evolution of complex skeletal traits in primates was likely influenced by both genetic and environmental factors. Because skeletal tissues are notoriously challenging to study using functional genomic approaches, they remain poorly characterized even in humans, let alone across multiple species. The challenges involved in obtaining functional genomic data from the skeleton, combined with the difficulty of obtaining such tissues from nonhuman apes, motivated us to consider an alternative in vitro system with which to comparatively study gene regulation in skeletal cell types. Specifically, we differentiated six human (Homo sapiens) and six chimpanzee (Pan troglodytes) induced pluripotent stem cell lines (iPSCs) into mesenchymal stem cells (MSCs) and subsequently into osteogenic cells (bone cells). We validated differentiation using standard methods and collected single-cell RNA sequencing data from over 100,000 cells across multiple samples and replicates at each stage of differentiation. While most genes that we examined display conserved patterns of expression across species, hundreds of genes are differentially expressed (DE) between humans and chimpanzees within and across stages of osteogenic differentiation. Some of these interspecific DE genes show functional enrichments relevant in skeletal tissue trait development. Moreover, topic modeling indicates that interspecific gene programs become more pronounced as cells mature. Overall, we propose that this in vitro model can be used to identify interspecific regulatory differences that may have contributed to skeletal trait differences between species
Phase-Separation Transition in Liquid Mixtures Near Charged Objects
We study the thermodynamic behavior of nonpolar liquid mixtures in the
vicinity of curved charged objects, such as electrodes or charged colloids. For
small enough charge on the object, or equivalently, small potential, the
dielectrophoretic force leads to enrichment of the more polar liquid close the
colloid. However, there is a critical value of charge (or potential), above
which a phase-separation transition occurs, and the interface between high- and
low-dielectric constant components becomes sharp. Analytical and numerical
composition profile are given, and the equilibrium front location as a function
of charge or voltage is found. We further employ a simple Cahn-Hilliard type
equation to study the dynamics of phase-separation in spatially nonuniform
electric fields. We find an exponential relaxation of the composition front
location, with a characteristic time depending on the charge, mixture
composition and ambient temperature.Comment: final version, includes small changes and typo correction
Comparative metabolomics in primates reveals the effects of diet and gene regulatory variation on metabolic divergence.
Human diets differ from those of non-human primates. Among few obvious differences, humans consume more meat than most non-human primates and regularly cook their food. It is hypothesized that a dietary shift during human evolution has been accompanied by molecular adaptations in metabolic pathways. Consistent with this notion, comparative studies of gene expression levels in primates have found that the regulation of genes with metabolic functions tend to evolve rapidly in the human lineage. The metabolic consequences of these regulatory differences, however, remained unknown. To address this gap, we performed a comparative study using a combination of gene expression and metabolomic profiling in livers from humans, chimpanzees, and rhesus macaques. We show that dietary differences between species have a strong effect on metabolic concentrations. In addition, we found that differences in metabolic concentration across species are correlated with inter-species differences in the expression of the corresponding enzymes, which control the same metabolic reaction. We identified a number of metabolic compounds with lineage-specific profiles, including examples of human-species metabolic differences that may be directly related to dietary differences
The Genetic and Mechanistic Basis for Variation in Gene Regulation
It is now well established that noncoding regulatory variants play a central role in the genetics of common diseases and in evolution. However, until recently, we have known little about the mechanisms by which most regulatory variants act. For instance, what types of functional elements in DNA, RNA, or proteins are most often affected by regulatory variants? Which stages of gene regulation are typically altered? How can we predict which variants are most likely to impact regulation in a given cell type? Recent studies, in many cases using quantitative trait loci (QTL)-mapping approaches in cell lines or tissue samples, have provided us with considerable insight into the properties of genetic loci that have regulatory roles. Such studies have uncovered novel biochemical regulatory interactions and led to the identification of previously unrecognized regulatory mechanisms. We have learned that genetic variation is often directly associated with variation in regulatory activities (namely, we can map regulatory QTLs, not just expression QTLs [eQTLs]), and we have taken the first steps towards understanding the causal order of regulatory events (for example, the role of pioneer transcription factors). Yet, in most cases, we still do not know how to interpret overlapping combinations of regulatory interactions, and we are still far from being able to predict how variation in regulatory mechanisms is propagated through a chain of interactions to eventually result in changes in gene expression profiles.National Institutes of Health (U.S.) (grant NIH HG006123)National Institutes of Health (U.S.) (NIH GM007197)National Institutes of Health (U.S.) (grant NIH MH084703)Howard Hughes Medical InstituteJane Coffin Childs Memorial Fund for Medical Research (postdoctoral fellowship
Recommended from our members
Taxonomic Classification of Bacterial 16S rRNA Genes Using Short Sequencing Reads: Evaluation of Effective Study Designs
Massively parallel high throughput sequencing technologies allow us to interrogate the microbial composition of biological samples at unprecedented resolution. The typical approach is to perform high-throughout sequencing of 16S rRNA genes, which are then taxonomically classified based on similarity to known sequences in existing databases. Current technologies cause a predicament though, because although they enable deep coverage of samples, they are limited in the length of sequence they can produce. As a result, high-throughout studies of microbial communities often do not sequence the entire 16S rRNA gene. The challenge is to obtain reliable representation of bacterial communities through taxonomic classification of short 16S rRNA gene sequences. In this study we explored properties of different study designs and developed specific recommendations for effective use of short-read sequencing technologies for the purpose of interrogating bacterial communities, with a focus on classification using naïve Bayesian classifiers. To assess precision and coverage of each design, we used a collection of ∼8,500 manually curated 16S rRNA gene sequences from cultured bacteria and a set of over one million bacterial 16S rRNA gene sequences retrieved from environmental samples, respectively. We also tested different configurations of taxonomic classification approaches using short read sequencing data, and provide recommendations for optimal choice of the relevant parameters. We conclude that with a judicious selection of the sequenced region and the corresponding choice of a suitable training set for taxonomic classification, it is possible to explore bacterial communities at great depth using current technologies, with only a minimal loss of taxonomic resolution.</p
Recommended from our members
The Effect of Freeze-Thaw Cycles on Gene Expression Levels in Lymphoblastoid Cell Lines
Epstein-Barr virus (EBV) transformed lymphoblastoid cell lines (LCLs) are a widely used renewable resource for functional genomic studies in humans. The ability to accumulate multidimensional data pertaining to the same individual cell lines, from complete genomic sequences to detailed gene regulatory profiles, further enhances the utility of LCLs as a model system. However, the extent to which LCLs are a faithful model system is relatively unknown. We have previously shown that gene expression profiles of newly established LCLs maintain a strong individual component. Here, we extend our study to investigate the effect of freeze-thaw cycles on gene expression patterns in mature LCLs, especially in the context of inter-individual variation in gene expression. We report a profound difference in the gene expression profiles of newly established and mature LCLs. Once newly established LCLs undergo a freeze-thaw cycle, the individual specific gene expression signatures become much less pronounced as the gene expression levels in LCLs from different individuals converge to a more uniform profile, which reflects a mature transformed B cell phenotype. We found that previously identified eQTLs are enriched among the relatively few genes whose regulations in mature LCLs maintain marked individual signatures. We thus conclude that while insight drawn from gene regulatory studies in mature LCLs may generally not be affected by the artificial nature of the LCL model system, many aspects of primary B cell biology cannot be observed and studied in mature LCL cultures.</p
RNA-seq: impact of RNA degradation on transcript quantification
Background
The use of low quality RNA samples in whole-genome gene expression profiling remains controversial. It is unclear if transcript degradation in low quality RNA samples occurs uniformly, in which case the effects of degradation can be corrected via data normalization, or whether different transcripts are degraded at different rates, potentially biasing measurements of expression levels. This concern has rendered the use of low quality RNA samples in whole-genome expression profiling problematic. Yet, low quality samples (for example, samples collected in the course of fieldwork) are at times the sole means of addressing specific questions.
Results
We sought to quantify the impact of variation in RNA quality on estimates of gene expression levels based on RNA-seq data. To do so, we collected expression data from tissue samples that were allowed to decay for varying amounts of time prior to RNA extraction. The RNA samples we collected spanned the entire range of RNA Integrity Number (RIN) values (a metric commonly used to assess RNA quality). We observed widespread effects of RNA quality on measurements of gene expression levels, as well as a slight but significant loss of library complexity in more degraded samples.
Conclusions
While standard normalizations failed to account for the effects of degradation, we found that by explicitly controlling for the effects of RIN using a linear model framework we can correct for the majority of these effects. We conclude that in instances in which RIN and the effect of interest are not associated, this approach can help recover biologically meaningful signals in data from degraded RNA samples.American Heart Association (Predoctoral Fellowship
- …