52 research outputs found
A Machine Learning Approach for Identifying Novel Cell Type–Specific Transcriptional Regulators of Myogenesis
Transcriptional enhancers integrate the contributions of multiple classes of transcription factors (TFs) to orchestrate the myriad spatio-temporal gene expression programs that occur during development. A molecular understanding of enhancers with similar activities requires the identification of both their unique and their shared sequence features. To address this problem, we combined phylogenetic profiling with a DNA–based enhancer sequence classifier that analyzes the TF binding sites (TFBSs) governing the transcription of a co-expressed gene set. We first assembled a small number of enhancers that are active in Drosophila melanogaster muscle founder cells (FCs) and other mesodermal cell types. Using phylogenetic profiling, we increased the number of enhancers by incorporating orthologous but divergent sequences from other Drosophila species. Functional assays revealed that the diverged enhancer orthologs were active in largely similar patterns as their D. melanogaster counterparts, although there was extensive evolutionary shuffling of known TFBSs. We then built and trained a classifier using this enhancer set and identified additional related enhancers based on the presence or absence of known and putative TFBSs. Predicted FC enhancers were over-represented in proximity to known FC genes; and many of the TFBSs learned by the classifier were found to be critical for enhancer activity, including POU homeodomain, Myb, Ets, Forkhead, and T-box motifs. Empirical testing also revealed that the T-box TF encoded by org-1 is a previously uncharacterized regulator of muscle cell identity. Finally, we found extensive diversity in the composition of TFBSs within known FC enhancers, suggesting that motif combinatorics plays an essential role in the cellular specificity exhibited by such enhancers. In summary, machine learning combined with evolutionary sequence analysis is useful for recognizing novel TFBSs and for facilitating the identification of cognate TFs that coordinate cell type–specific developmental gene expression patterns
Mapping inequalities in exclusive breastfeeding in low- and middle-income countries, 2000–2018
Exclusive breastfeeding (EBF)-giving infants only breast-milk for the first 6 months of life-is a component of optimal breastfeeding practices effective in preventing child morbidity and mortality. EBF practices are known to vary by population and comparable subnational estimates of prevalence and progress across low- and middle-income countries (LMICs) are required for planning policy and interventions. Here we present a geospatial analysis of EBF prevalence estimates from 2000 to 2018 across 94 LMICs mapped to policy-relevant administrative units (for example, districts), quantify subnational inequalities and their changes over time, and estimate probabilities of meeting the World Health Organization's Global Nutrition Target (WHO GNT) of ≥70% EBF prevalence by 2030. While six LMICs are projected to meet the WHO GNT of ≥70% EBF prevalence at a national scale, only three are predicted to meet the target in all their district-level units by 2030.This work was primarily supported by grant no. OPP1132415 from the Bill & Melinda Gates Foundation. Co-authors used by the Bill & Melinda Gates Foundation (E.G.P. and R.R.3) provided feedback on initial maps and drafts of this manuscript. L.G.A. has received support from Coordenação de Aperfeiçoamento de Pessoal de Nível Superior, Brasil (CAPES), Código de Financiamento 001 and Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq) (grant nos. 404710/2018-2 and 310797/2019-5). O.O.Adetokunboh acknowledges the National Research Foundation, Department of Science and Innovation and South African Centre for Epidemiological Modelling and Analysis. M.Ausloos, A.Pana and C.H. are partially supported by a grant from the Romanian National Authority for Scientific Research and Innovation, CNDS-UEFISCDI, project no. PN-III-P4-ID-PCCF-2016-0084. P.C.B. would like to acknowledge the support of F. Alam and A. Hussain. T.W.B. was supported by the Alexander von Humboldt Foundation through the Alexander von Humboldt Professor award, funded by the German Federal Ministry of Education and Research. K.Deribe is supported by the Wellcome Trust (grant no. 201900/Z/16/Z) as part of his international intermediate fellowship. C.H. and A.Pana are partially supported by a grant of the Romanian National Authority for Scientific Research and Innovation, CNDS-UEFISCDI, project no. PN-III-P2-2.1-SOL-2020-2-0351. B.Hwang is partially supported by China Medical University (CMU109-MF-63), Taichung, Taiwan. M.Khan acknowledges Jatiya Kabi Kazi Nazrul Islam University for their support. A.M.K. acknowledges the other collaborators and the corresponding author. Y.K. was supported by the Research Management Centre, Xiamen University Malaysia (grant no. XMUMRF/2020-C6/ITM/0004). K.Krishan is supported by a DST PURSE grant and UGC Centre of Advanced Study (CAS II) awarded to the Department of Anthropology, Panjab University, Chandigarh, India. M.Kumar would like to acknowledge FIC/NIH K43 TW010716-03. I.L. is a member of the Sistema Nacional de Investigación (SNI), which is supported by the Secretaría Nacional de Ciencia, Tecnología e Innovación (SENACYT), Panamá. M.L. was supported by China Medical University, Taiwan (CMU109-N-22 and CMU109-MF-118). W.M. is currently a programme analyst in Population and Development at the United Nations Population Fund (UNFPA) Country Office in Peru, which does not necessarily endorses this study. D.E.N. acknowledges Cochrane South Africa, South African Medical Research Council. G.C.P. is supported by an NHMRC research fellowship. P.Rathi acknowledges support from Kasturba Medical College, Mangalore, Manipal Academy of Higher Education, Manipal, India. Ramu Rawat acknowledges the support of the GBD Secretariat for supporting the reviewing and collaboration of this paper. B.R. acknowledges support from Manipal College of Health Professions, Manipal Academy of Higher Education, Manipal. A.Ribeiro was supported by National Funds through FCT, under the programme of ‘Stimulus of Scientific Employment—Individual Support’ within the contract no. info:eu-repo/grantAgreement/FCT/CEEC IND 2018/CEECIND/02386/2018/CP1538/CT0001/PT. S.Sajadi acknowledges colleagues at Global Burden of Diseases and Local Burden of Disease. A.M.S. acknowledges the support from the Egyptian Fulbright Mission Program. F.S. was supported by the Shenzhen Science and Technology Program (grant no. KQTD20190929172835662). A.Sheikh is supported by Health Data Research UK. B.K.S. acknowledges Kasturba Medical College, Mangalore, Manipal Academy of Higher Education, Manipal for all the academic support. B.U. acknowledges support from Manipal Academy of Higher Education, Manipal. C.S.W. is supported by the South African Medical Research Council. Y.Z. was supported by Science and Technology Research Project of Hubei Provincial Department of Education (grant no. Q20201104) and Outstanding Young and Middle-aged Technology Innovation Team Project of Hubei Provincial Department of Education (grant no. T2020003). The funders of the study had no role in study design, data collection and analysis, decision to publish or preparation of the manuscript. The corresponding author had full access to all the data in the study and had final responsibility for the decision to submit for publication. All maps presented in this study are generated by the authors and no permissions are required to publish them
ABCC5, a Gene That Influences the Anterior Chamber Depth, Is Associated with Primary Angle Closure Glaucoma
Anterior chamber depth (ACD) is a key anatomical risk factor for primary angle closure glaucoma (PACG). We conducted a genome-wide association study (GWAS) on ACD to discover novel genes for PACG on a total of 5,308 population-based individuals of Asian descent. Genome-wide significant association was observed at a sequence variant within ABCC5 (rs1401999; per-allele effect size = -0.045 mm, P = 8.17×10-9). This locus was associated with an increase in risk of PACG in a separate case-control study of 4,276 PACG cases and 18,801 controls (per-allele OR = 1.13 [95% CI: 1.06-1.22], P = 0.00046). The association was strengthened when a sub-group of controls with open angles were included in the analysis (per-allele OR = 1.30, P = 7.45×10-9; 3,458 cases vs. 3,831 controls). Our findings suggest that the increase in PACG risk could in part be mediated by genetic sequence variants influencing anterior chamber dimensions
Regulatory Architecture of the Neuronal Cacng2/Tarpγ2 Gene Promoter: Multiple Repressive Domains, a Polymorphic Regulatory Short Tandem Repeat, and Bidirectional Organization with Co-regulated lncRNAs
CACNG2 (TARPγ2, Stargazin) is a multi-functional regulator of excitatory neurotransmission and has been implicated in the pathological processes of several brain diseases. Cacng2 function is dependent upon expression level, but currently, little is known about the molecular mechanisms that control expression of this gene. To address this deficit and investigate disease-related gene variants, we have cloned and characterized the rat Cacng2 promoter and have defined three major features: (i) multiple repressive domains that include an array of RE-1 silencing transcription factor (REST) elements, and a calcium regulatory element-binding factor (CaRF) element, (ii) a (poly-GA) short tandem repeat (STR), and (iii) bidirectional organization with expressed lncRNAs. Functional activity of the promoter was demonstrated in transfected neuronal cell lines (HT22 and PC12), but although selective removal of REST and CaRF domains was shown to enhance promoter-driven transcription, the enhanced Cacng2 promoter constructs were still about fivefold weaker than a comparable rat Synapsin-1 promoter sequence. Direct evidence of REST activity at the Cacng2 promoter was obtained through co-transfection with an established dominant-negative REST (DNR) construct. Investigation of the GA-repeat STR revealed polymorphism across both animal strains and species, and size variation was also observed in absence epilepsy disease model cohorts (Genetic Absence Epilepsy Rats, Strasbourg [GAERS] and non-epileptic control [NEC] rats). These data provide evidence of a genotype (STR)-phenotype correlation that may be unique with respect to proximal gene regulatory sequence in the demonstrated absence of other promoter, or 3′ UTR variants in GAERS rats. However, although transcriptional regulatory activity of the STR was demonstrated in further transfection studies, we did not find a GAERS vs. NEC difference, indicating that this specific STR length variation may only be relevant in the context of other (Cacna1h and Kcnk9) gene variants in this disease model. Additional studies revealed further (bidirectional) complexity at the Cacng2 promoter, and we identified novel, co-regulated, antisense rat lncRNAs that are paired with Cacng2 mRNA. These studies have provided novel insights into the organization of a synaptic protein gene promoter, describing multiple repressive and modulatory domains that can mediate diverse regulatory inputs
- …