27 research outputs found

    JASPAR 2016: a major expansion and update of the open-access database of transcription factor binding profiles.

    Get PDF
    JASPAR (http://jaspar.genereg.net) is an open-access database storing curated, non-redundant transcription factor (TF) binding profiles representing transcription factor binding preferences as position frequency matrices for multiple species in six taxonomic groups. For this 2016 release, we expanded the JASPAR CORE collection with 494 new TF binding profiles (315 in vertebrates, 11 in nematodes, 3 in insects, 1 in fungi and 164 in plants) and updated 59 profiles (58 in vertebrates and 1 in fungi). The introduced profiles represent an 83% expansion and 10% update when compared to the previous release. We updated the structural annotation of the TF DNA binding domains (DBDs) following a published hierarchical structural classification. In addition, we introduced 130 transcription factor flexible models trained on ChIP-seq data for vertebrates, which capture dinucleotide dependencies within TF binding sites. This new JASPAR release is accompanied by a new web tool to infer JASPAR TF binding profiles recognized by a given TF protein sequence. Moreover, we provide the users with a Ruby module complementing the JASPAR API to ease programmatic access and use of the JASPAR collection of profiles. Finally, we provide the JASPAR2016 R/Bioconductor data package with the data of this release

    PAX6 MiniPromoters drive restricted expression from rAAV in the adult mouse retina.

    Get PDF
    Current gene therapies predominantly use small, strong, and readily available ubiquitous promoters. However, as the field matures, the availability of small, cell-specific promoters would be greatly beneficial. Here we design seven small promoters from the human paired box 6 (PAX6) gene and test them in the adult mouse retina using recombinant adeno-associated virus. We chose the retina due to previous successes in gene therapy for blindness, and the PAX6 gene since it is: well studied; known to be driven by discrete regulatory regions; expressed in therapeutically interesting retinal cell types; and mutated in the vision-loss disorder aniridia, which is in need of improved therapy. At the PAX6 locus, 31 regulatory regions were bioinformatically predicted, and nine regulatory regions were constructed into seven MiniPromoters. Driving Emerald GFP, these MiniPromoters were packaged into recombinant adeno-associated virus, and injected intravitreally into postnatal day 14 mice. Four MiniPromoters drove consistent retinal expression in the adult mouse, driving expression in combinations of cell-types that endogenously express Pax6: ganglion, amacrine, horizontal, and Müller glia. Two PAX6-MiniPromoters drive expression in three of the four cell types that express PAX6 in the adult mouse retina. Combined, they capture all four cell types, making them potential tools for research, and PAX6-gene therapy for aniridia

    The Space Weather Atmosphere Models and Indices (SWAMI) project: Overview and first results

    Get PDF
    This is the final version. Available on open access from EDP Open via the DOI in this recordSpace weather driven atmospheric density variations affect low Earth orbit (LEO) satellites during all phases of their operational lifetime. Rocket launches, re-entry events and space debris are also similarly affected. A better understanding of space weather processes and their impact on atmospheric density is thus critical for satellite operations as well as for safety issues. The Horizon 2020 project Space Weather Atmosphere Model and Indices (SWAMI) project, which started in January 2018, aims to enhance this understanding by: Developing improved neutral atmosphere and thermosphere models, and combining these models to produce a new whole atmosphere model. Developing new geomagnetic activity indices with higher time cadence to enable better representation of thermospheric variability in the models, and improving the forecast of these indices. The project stands out by providing an integrated approach to the satellite neutral environment, in which the main space weather drivers are addressed together with model improvement. The outcomes of SWAMI will provide a pathway to improved space weather services as the project will not only address the science issues, but also the transition of models into operational services. The project aims to develop a unique new whole atmosphere model, by extending and blending the Unified Model (UM), which is the Met Office weather and climate model, and the Drag Temperature Model (DTM), which is a semi-empirical model which covers the 120–1500 km altitude range. A user-focused operational tool for satellite applications shall be developed based on this. In addition, improved geomagnetic indices shall be developed and shall be used in the UM and DTM for enhanced nowcast and forecast capability. In this paper, we report on progress with SWAMI to date. The UM has been extended from its original upper boundary of 85 km to run stably and accurately with a 135 km lid. Developments to the UM radiation scheme to enable accurate performance in the mesosphere and lower thermosphere are described. These include addition of non-local thermodynamic equilibrium effects and extension to include the far ultraviolet and extreme ultraviolet. DTM has been re-developed using a more accurate neutral density observation database than has been used in the past. In addition, we describe an algorithm to develop a new version of DTM driven by geomagnetic indices with a 60 minute cadence (denoted Hp60) rather than 3-hourly Kp indices (and corresponding ap indices). The development of the Hp60 index, and the Hp30 and Hp90 indices, which are similar to Hp60 but with 30 minute and 90 minute cadences, respectively, is described, as is the development and testing of neural network and other machine learning methods applied to the forecast of geomagnetic indices.European Union Horizon 202

    Erratum: JASPAR 2018: update of the open-access database of transcription factor binding profiles and its web framework

    Get PDF
    JASPAR (http://jaspar.genereg.net) is an open-access database of curated, non-redundant transcription factor (TF)-binding profiles stored as position frequency matrices (PFMs) and TF flexible models (TFFMs) for TFs across multiple species in six taxonomic groups. In the 2018 release of JASPAR, the CORE collection has been expanded with 322 new PFMs (60 for vertebrates and 262 for plants) and 33 PFMs were updated (24 for vertebrates, 8 for plants and 1 for insects). These new profiles represent a 30% expansion compared to the 2016 release. In addition, we have introduced 316 TFFMs (95 for vertebrates, 218 for plants and 3 for insects). This release incorporates clusters of similar PFMs in each taxon and each TF class per taxon. The JASPAR 2018 CORE vertebrate collection of PFMs was used to predict TF-binding sites in the human genome. The predictions are made available to the scientific community through a UCSC Genome Browser track data hub. Finally, this update comes with a new web framework with an interactive and responsive user-interface, along with new features. All the underlying data can be retrieved programmatically using a RESTful API and through the JASPAR 2018 R/Bioconductor package

    Validation of Skeletal Muscle cis-Regulatory Module Predictions Reveals Nucleotide Composition Bias in Functional Enhancers

    Get PDF
    We performed a genome-wide scan for muscle-specific cis-regulatory modules (CRMs) using three computational prediction programs. Based on the predictions, 339 candidate CRMs were tested in cell culture with NIH3T3 fibroblasts and C2C12 myoblasts for capacity to direct selective reporter gene expression to differentiated C2C12 myotubes. A subset of 19 CRMs validated as functional in the assay. The rate of predictive success reveals striking limitations of computational regulatory sequence analysis methods for CRM discovery. Motif-based methods performed no better than predictions based only on sequence conservation. Analysis of the properties of the functional sequences relative to inactive sequences identifies nucleotide sequence composition can be an important characteristic to incorporate in future methods for improved predictive specificity. Muscle-related TFBSs predicted within the functional sequences display greater sequence conservation than non-TFBS flanking regions. Comparison with recent MyoD and histone modification ChIP-Seq data supports the validity of the functional regions

    Stroke genetics informs drug discovery and risk prediction across ancestries

    Get PDF
    Previous genome-wide association studies (GWASs) of stroke - the second leading cause of death worldwide - were conducted predominantly in populations of European ancestry(1,2). Here, in cross-ancestry GWAS meta-analyses of 110,182 patients who have had a stroke (five ancestries, 33% non-European) and 1,503,898 control individuals, we identify association signals for stroke and its subtypes at 89 (61 new) independent loci: 60 in primary inverse-variance-weighted analyses and 29 in secondary meta-regression and multitrait analyses. On the basis of internal cross-ancestry validation and an independent follow-up in 89,084 additional cases of stroke (30% non-European) and 1,013,843 control individuals, 87% of the primary stroke risk loci and 60% of the secondary stroke risk loci were replicated (P < 0.05). Effect sizes were highly correlated across ancestries. Cross-ancestry fine-mapping, in silico mutagenesis analysis(3), and transcriptome-wide and proteome-wide association analyses revealed putative causal genes (such as SH3PXD2A and FURIN) and variants (such as at GRK5 and NOS3). Using a three-pronged approach(4), we provide genetic evidence for putative drug effects, highlighting F11, KLKB1, PROC, GP1BA, LAMC2 and VCAM1 as possible targets, with drugs already under investigation for stroke for F11 and PROC. A polygenic score integrating cross-ancestry and ancestry-specific stroke GWASs with vascular-risk factor GWASs (integrative polygenic scores) strongly predicted ischaemic stroke in populations of European, East Asian and African ancestry(5). Stroke genetic risk scores were predictive of ischaemic stroke independent of clinical risk factors in 52,600 clinical-trial participants with cardiometabolic disease. Our results provide insights to inform biology, reveal potential drug targets and derive genetic risk prediction tools across ancestries.</p

    Stroke genetics informs drug discovery and risk prediction across ancestries

    Get PDF
    Previous genome-wide association studies (GWASs) of stroke — the second leading cause of death worldwide — were conducted predominantly in populations of European ancestry1,2. Here, in cross-ancestry GWAS meta-analyses of 110,182 patients who have had a stroke (five ancestries, 33% non-European) and 1,503,898 control individuals, we identify association signals for stroke and its subtypes at 89 (61 new) independent loci: 60 in primary inverse-variance-weighted analyses and 29 in secondary meta-regression and multitrait analyses. On the basis of internal cross-ancestry validation and an independent follow-up in 89,084 additional cases of stroke (30% non-European) and 1,013,843 control individuals, 87% of the primary stroke risk loci and 60% of the secondary stroke risk loci were replicated (P < 0.05). Effect sizes were highly correlated across ancestries. Cross-ancestry fine-mapping, in silico mutagenesis analysis3, and transcriptome-wide and proteome-wide association analyses revealed putative causal genes (such as SH3PXD2A and FURIN) and variants (such as at GRK5 and NOS3). Using a three-pronged approach4, we provide genetic evidence for putative drug effects, highlighting F11, KLKB1, PROC, GP1BA, LAMC2 and VCAM1 as possible targets, with drugs already under investigation for stroke for F11 and PROC. A polygenic score integrating cross-ancestry and ancestry-specific stroke GWASs with vascular-risk factor GWASs (integrative polygenic scores) strongly predicted ischaemic stroke in populations of European, East Asian and African ancestry5. Stroke genetic risk scores were predictive of ischaemic stroke independent of clinical risk factors in 52,600 clinical-trial participants with cardiometabolic disease. Our results provide insights to inform biology, reveal potential drug targets and derive genetic risk prediction tools across ancestries

    JASPAR 2018: update of the open-access database of transcription factor binding profiles and its web framework.

    No full text
    JASPAR (http://jaspar.genereg.net) is an open-access database of curated, non-redundant transcription factor (TF)-binding profiles stored as position frequency matrices (PFMs) and TF flexible models (TFFMs) for TFs across multiple species in six taxonomic groups. In the 2018 release of JASPAR, the CORE collection has been expanded with 322 new PFMs (60 for vertebrates and 262 for plants) and 33 PFMs were updated (24 for vertebrates, 8 for plants and 1 for insects). These new profiles represent a 30% expansion compared to the 2016 release. In addition, we have introduced 316 TFFMs (95 for vertebrates, 218 for plants and 3 for insects). This release incorporates clusters of similar PFMs in each taxon and each TF class per taxon. The JASPAR 2018 CORE vertebrate collection of PFMs was used to predict TF-binding sites in the human genome. The predictions are made available to the scientific community through a UCSC Genome Browser track data hub. Finally, this update comes with a new web framework with an interactive and responsive user-interface, along with new features. All the underlying data can be retrieved programmatically using a RESTful API and through the JASPAR 2018 R/Bioconductor package
    corecore