75 research outputs found
The specificity and evolution of gene regulatory elements
Thesis (Ph. D.)--Massachusetts Institute of Technology, Computational and Systems Biology Program, 2010.This electronic version was submitted by the student author. The certified thesis is available in the Institute Archives and Special Collections.Cataloged from student-submitted PDF version of thesis.Includes bibliographical references.The regulation of gene expression underlies the morphological, physiological, and functional differences between human cell types, developmental stages, and healthy and disease states. Gene regulation in eukaryotes is controlled by a complex milieu including transcription factors, microRNAs (miRNAs), cis-regulatory DNA and RNA. It is the quantitative and combinatorial interactions of these regulatory elements that defines gene expression, but these interactions are incompletely understood. In this thesis, I present two new methods for determining the quantitative specificity of gene regulatory factors. First, I present a comparative genomics approach that utilizes signatures of natural selection to detect the conserved biological relevance of miRNAs and their targets. Using this method, I quantify the abundance of different conserved miRNA target types, including different seed matches and 30-compensatory targets. I show that over 60% of mammalian mRNAs are conserved targets of miRNAs and that a surprising amount of conserved miRNA targeting is mediated by seed matches with relatively low efficacy. Extending this method from mammals to other organisms, I find that miRNA targeting rules are mostly conserved, although I show evidence for new types of miRNA targets in nematodes. Taking advantage of variations in 30 UTR lengths between species, I describe general properties of miRNA targeting that are affected by 30 UTR length. Finally, I introduce a new, high-throughput assay for the quantification of transcription factor in vitro binding affinity to millions of sequences. I apply this method to GCN4, a yeast transcription factor, and reconstruct all known properties of its binding preferences. Additionally, I discover some new subtleties in its specificity and estimate dissociation constants for hundreds of thousands of sequences. I verify the utility of the binding affinities by comparing to in vivo binding data and to the regulatory response following GCN4 induction.by Robin Carl Friedman.Ph.D
Formation, regulation and evolution of Caenorhabditis elegans 3'UTRs
Post-transcriptional gene regulation frequently occurs through elements in mRNA 3Ⲡuntranslated regions (UTRs)1, 2. Although crucial roles for 3â˛UTR-mediated gene regulation have been found in Caenorhabditis elegans3, 4, 5, most C. elegans genes have lacked annotated 3â˛UTRs6, 7. Here we describe a high-throughput method for reliable identification of polyadenylated RNA termini, and we apply this method, called poly(A)-position profiling by sequencing (3P-Seq), to determine C. elegans 3â˛UTRs. Compared to standard methods also recently applied to C. elegans UTRs8, 3P-Seq identified 8,580 additional UTRs while excluding thousands of shorter UTR isoforms that do not seem to be authentic. Analysis of this expanded and corrected data set suggested that the high A/U content of C. elegans 3â˛UTRs facilitated genome compaction, because the elements specifying cleavage and polyadenylation, which are A/U rich, can more readily emerge in A/U-rich regions. Indeed, 30% of the protein-coding genes have mRNAs with alternative, partially overlapping end regions that generate another 10,480 cleavage and polyadenylation sites that had gone largely unnoticed and represent potential evolutionary intermediates of progressive UTR shortening. Moreover, a third of the convergently transcribed genes use palindromic arrangements of bidirectional elements to specify UTRs with convergent overlap, which also contributes to genome compaction by eliminating regions between genes. Although nematode 3â˛UTRs have median length only one-sixth that of mammalian 3â˛UTRs, they have twice the density of conserved microRNA sites, in part because additional types of seed-complementary sites are preferentially conserved. These findings reveal the influence of cleavage and polyadenylation on the evolution of genome architecture and provide resources for studying post-transcriptional gene regulation.National Institutes of Health (U.S.) (Grant number GM067031)National Science Foundation (U.S.). Predoctural FellowshipUnited States. Dept. of Energy. Computational Science Graduate Fellowship (Krell Institute
Recommended from our members
The human body at cellular resolution: the NIH Human Biomolecular Atlas Program
Abstract: Transformative technologies are enabling the construction of three-dimensional maps of tissues with unprecedented spatial and molecular resolution. Over the next seven years, the NIH Common Fund Human Biomolecular Atlas Program (HuBMAP) intends to develop a widely accessible framework for comprehensively mapping the human body at single-cell resolution by supporting technology development, data acquisition, and detailed spatial mapping. HuBMAP will integrate its efforts with other funding agencies, programs, consortia, and the biomedical research community at large towards the shared vision of a comprehensive, accessible three-dimensional molecular and cellular atlas of the human body, in health and under various disease conditions
An original phylogenetic approach identified mitochondrial haplogroup T1a1 as inversely associated with breast cancer risk in BRCA2 mutation carriers
Introduction: Individuals carrying pathogenic mutations in the BRCA1 and BRCA2 genes have a high lifetime risk of breast cancer. BRCA1 and BRCA2 are involved in DNA double-strand break repair, DNA alterations that can be caused by exposure to reactive oxygen species, a main source of which are mitochondria. Mitochondrial genome variations affect electron transport chain efficiency and reactive oxygen species production. Individuals with different mitochondrial haplogroups differ in their metabolism and sensitivity to oxidative stress. Variability in mitochondrial genetic background can alter reactive oxygen species production, leading to cancer risk. In the present study, we tested the hypothesis that mitochondrial haplogroups modify breast cancer risk in BRCA1/2 mutation carriers. Methods: We genotyped 22,214 (11,421 affected, 10,793 unaffected) mutation carriers belonging to the Consortium of Investigators of Modifiers of BRCA1/2 for 129 mitochondrial polymorphisms using the iCOGS array. Haplogroup inference and association detection were performed using a phylogenetic approach. ALTree was applied to explore the reference mitochondrial evolutionary tree and detect subclades enriched in affected or unaffected individuals. Results: We discovered that subclade T1a1 was depleted in affected BRCA2 mutation carriers compared with the rest of clade T (hazard ratio (HR) = 0.55; 95% confidence interval (CI), 0.34 to 0.88; P = 0.01). Compared with the most frequent haplogroup in the general population (that is, H and T clades), the T1a1 haplogroup has a HR of 0.62 (95% CI, 0.40 to 0.95; P = 0.03). We also identified three potential susceptibility loci, including G13708A/rs28359178, which has demonstrated an inverse association with familial breast cancer risk. Conclusions: This study illustrates how original approaches such as the phylogeny-based method we used can empower classical molecular epidemiological studies aimed at identifying association or risk modification effects.Peer reviewe
Robust estimation of bacterial cell count from optical density
Optical density (OD) is widely used to estimate the density of cells in liquid culture, but cannot be compared between instruments without a standardized calibration protocol and is challenging to relate to actual cell count. We address this with an interlaboratory study comparing three simple, low-cost, and highly accessible OD calibration protocols across 244 laboratories, applied to eight strains of constitutive GFP-expressing E. coli. Based on our results, we recommend calibrating OD to estimated cell count using serial dilution of silica microspheres, which produces highly precise calibration (95.5% of residuals <1.2-fold), is easily assessed for quality control, also assesses instrument effective linear range, and can be combined with fluorescence calibration to obtain units of Molecules of Equivalent Fluorescein (MEFL) per cell, allowing direct comparison and data fusion with flow cytometry measurements: in our study, fluorescence per cell measurements showed only a 1.07-fold mean difference between plate reader and flow cytometry data
Omecamtiv mecarbil in chronic heart failure with reduced ejection fraction, GALACTICâHF: baseline characteristics and comparison with contemporary clinical trials
Aims:
The safety and efficacy of the novel selective cardiac myosin activator, omecamtiv mecarbil, in patients with heart failure with reduced ejection fraction (HFrEF) is tested in the Global Approach to Lowering Adverse Cardiac outcomes Through Improving Contractility in Heart Failure (GALACTICâHF) trial. Here we describe the baseline characteristics of participants in GALACTICâHF and how these compare with other contemporary trials.
Methods and Results:
Adults with established HFrEF, New York Heart Association functional class (NYHA)ââĽâII, EF â¤35%, elevated natriuretic peptides and either current hospitalization for HF or history of hospitalization/ emergency department visit for HF within a year were randomized to either placebo or omecamtiv mecarbil (pharmacokineticâguided dosing: 25, 37.5 or 50âmg bid). 8256 patients [male (79%), nonâwhite (22%), mean age 65âyears] were enrolled with a mean EF 27%, ischemic etiology in 54%, NYHA II 53% and III/IV 47%, and median NTâproBNP 1971âpg/mL. HF therapies at baseline were among the most effectively employed in contemporary HF trials. GALACTICâHF randomized patients representative of recent HF registries and trials with substantial numbers of patients also having characteristics understudied in previous trials including more from North America (n = 1386), enrolled as inpatients (n = 2084), systolic blood pressureâ<â100âmmHg (n = 1127), estimated glomerular filtration rate <â30âmL/min/1.73 m2 (n = 528), and treated with sacubitrilâvalsartan at baseline (n = 1594).
Conclusions:
GALACTICâHF enrolled a wellâtreated, highârisk population from both inpatient and outpatient settings, which will provide a definitive evaluation of the efficacy and safety of this novel therapy, as well as informing its potential future implementation
Most mammalian mRNAs are conserved targets of microRNAs
MicroRNAs (miRNAs) are small endogenous RNAs that pair to sites in mRNAs to direct post-transcriptional repression. Many sites that match the miRNA seed (nucleotides 2â7), particularly those in 3Ⲡuntranslated regions (3â˛UTRs), are preferentially conserved. Here, we overhauled our tool for finding preferential conservation of sequence motifs and applied it to the analysis of human 3â˛UTRs, increasing by nearly threefold the detected number of preferentially conserved miRNA target sites. The new tool more efficiently incorporates new genomes and more completely controls for background conservation by accounting for mutational biases, dinucleotide conservation rates, and the conservation rates of individual UTRs. The improved background model enabled preferential conservation of a new site type, the âoffset 6mer,â to be detected. In total, >45,000 miRNA target sites within human 3â˛UTRs are conserved above background levels, and >60% of human protein-coding genes have been under selective pressure to maintain pairing to miRNAs. Mammalian-specific miRNAs have far fewer conserved targets than do the more broadly conserved miRNAs, even when considering only more recently emerged targets. Although pairing to the 3Ⲡend of miRNAs can compensate for seed mismatches, this class of sites constitutes less than 2% of all preferentially conserved sites detected. The new tool enables statistically powerful analysis of individual miRNA target sites, with the probability of preferentially conserved targeting (PCT) correlating with experimental measurements of repression. Our expanded set of target predictions (including conserved 3â˛-compensatory sites), are available at the TargetScan website, which displays the PCT for each site and each predicted target.National Institutes of Health (U.S.)Krell InstituteUnited States. Dept. of Energy (Computational Sciences Graduate Fellowship
- âŚ