58 research outputs found

    Analysis of genomic data to derive biological conclusions on (1) transcriptional regulation in the human genome and (2) antibody resistance in hepatitis C virus

    Full text link
    High­-throughput sequencing has become pervasive in all facets of genomic analysis. I developed computational methods to analyze high­-throughput sequencing data and derive biological conclusions in two research areas -- transcriptional regulation in mammals and evolution of virus under immune pressure. To investigate transcriptional regulation, I integrated data from multiple experiments performed by the ENCODE consortium. First, my analysis revealed that Transcription Factors (TFs) prefer to bind GC-­rich, histone­-depleted regions. By comparing in vivo and in vitro nucleosome dynamics, I observed that while histones have an innate preference for binding GC-­rich DNA, TF binding overrides this preference and produces a negative correlation between GC content and histone enrichment. In the next project, I found that the binding events of multiple TFs co-­occur at genomic regions enriched in activating histone marks that are typically associated with gene enhancers and promoters, suggesting that these regions may be enhancers or have TSS-­distal transcription. Lastly, I used supervised machine ­learning techniques to train histone enrichment signals and sequence features to predict transcriptional enhancers to be validated in mouse-­transgenic assays. In a post­-clinical trial exploratory analysis of Hepatitis C Virus (HCV), I traced the evolutionary path of the envelope proteins E1 and E2 in HCV-infected liver transplant patients, in response to a novel antibody. I developed a systematic amino acid­-level analysis pipeline that quantifies differences in amino acid frequencies in each position between two time points. Upon applying this method across all positions in the E1/E2 region and comparing pre-­liver­-transplant and post­-viral­-rebound time points, mutations in two positions emerged as being key to antibody evasion. Both these mutations--N415K/D and N417S--were in the epitope targeted by the antibody, but surprisingly, did not co­-occur. In post­-rebound viral genomes that contain the N417S mutation but retain the wild-­type variant at 415, N-­linked glycosylation of 415 is another possible escape mechanism. Using the same analysis pipeline, I also identified additional candidate escape mutations outside the epitope, which could be potential therapeutic targets

    Attention and Pooling based Sigmoid Colon Segmentation in 3D CT images

    Full text link
    Segmentation of the sigmoid colon is a crucial aspect of treating diverticulitis. It enables accurate identification and localisation of inflammation, which in turn helps healthcare professionals make informed decisions about the most appropriate treatment options. This research presents a novel deep learning architecture for segmenting the sigmoid colon from Computed Tomography (CT) images using a modified 3D U-Net architecture. Several variations of the 3D U-Net model with modified hyper-parameters were examined in this study. Pyramid pooling (PyP) and channel-spatial Squeeze and Excitation (csSE) were also used to improve the model performance. The networks were trained using manually annotated sigmoid colon. A five-fold cross-validation procedure was used on a test dataset to evaluate the network's performance. As indicated by the maximum Dice similarity coefficient (DSC) of 56.92+/-1.42%, the application of PyP and csSE techniques improves segmentation precision. We explored ensemble methods including averaging, weighted averaging, majority voting, and max ensemble. The results show that average and majority voting approaches with a threshold value of 0.5 and consistent weight distribution among the top three models produced comparable and optimal results with DSC of 88.11+/-3.52%. The results indicate that the application of a modified 3D U-Net architecture is effective for segmenting the sigmoid colon in Computed Tomography (CT) images. In addition, the study highlights the potential benefits of integrating ensemble methods to improve segmentation precision.Comment: 8 Pages, 6 figures, Accepted at IEEE DICTA 202

    Factorbook.org: a Wiki-based database for transcription factor-binding data generated by the ENCODE consortium

    Get PDF
    The Encyclopedia of DNA Elements (ENCODE) consortium aims to identify all functional elements in the human genome including transcripts, transcriptional regulatory regions, along with their chromatin states and DNA methylation patterns. The ENCODE project generates data utilizing a variety of techniques that can enrich for regulatory regions, such as chromatin immunoprecipitation (ChIP), micrococcal nuclease (MNase) digestion and DNase I digestion, followed by deeply sequencing the resulting DNA. As part of the ENCODE project, we have developed a Web-accessible repository accessible at http://factorbook.org. In Wiki format, factorbook is a transcription factor (TF)-centric repository of all ENCODE ChIP-seq datasets on TF-binding regions, as well as the rich analysis results of these data. In the first release, factorbook contains 457 ChIP-seq datasets on 119 TFs in a number of human cell lines, the average profiles of histone modifications and nucleosome positioning around the TF-binding regions, sequence motifs enriched in the regions and the distance and orientation preferences between motif sites

    Sequence features and chromatin structure around the genomic regions bound by 119 human transcription factors

    Get PDF
    Chromatin immunoprecipitation coupled with high-throughput sequencing (ChIP-seq) has become the dominant technique for mapping transcription factor (TF) binding regions genome-wide. We performed an integrative analysis centered around 457 ChIP-seq data sets on 119 human TFs generated by the ENCODE Consortium. We identified highly enriched sequence motifs in most data sets, revealing new motifs and validating known ones. The motif sites (TF binding sites) are highly conserved evolutionarily and show distinct footprints upon DNase I digestion. We frequently detected secondary motifs in addition to the canonical motifs of the TFs, indicating tethered binding and cobinding between multiple TFs. We observed significant position and orientation preferences between many cobinding TFs. Genes specifically expressed in a cell line are often associated with a greater occurrence of nearby TF binding in that cell line. We observed cell-line-specific secondary motifs that mediate the binding of the histone deacetylase HDAC2 and the enhancer-binding protein EP300. TF binding sites are located in GC-rich, nucleosome-depleted, and DNase I sensitive regions, flanked by well-positioned nucleosomes, and many of these features show cell type specificity. The GC-richness may be beneficial for regulating TF binding because, when unoccupied by a TF, these regions are occupied by nucleosomes in vivo. We present the results of our analysis in a TF-centric web repository Factorbook (http://factorbook.org) and will continually update this repository as more ENCODE data are generated

    Therapeutic targeting of LCK tyrosine kinase and mTOR signaling in T-cell acute lymphoblastic leukemia

    Get PDF
    Relapse and refractory T-cell acute lymphoblastic leukemia (T-ALL) has a poor prognosis, and new combination therapies are sorely needed. Here, we used an ex vivo high-throughput screening platform to identify drug combinations that kill zebrafish T-ALL and then validated top drug combinations for preclinical efficacy in human disease. This work uncovered potent drug synergies between AKT/mTORC1 (mammalian target of rapamycin complex 1) inhibitors and the general tyrosine kinase inhibitor dasatinib. Importantly, these same drug combinations effectively killed a subset of relapse and dexamethasone-resistant zebrafish T-ALL. Clinical trials are currently underway using the combination of mTORC1 inhibitor temsirolimus and dasatinib in other pediatric cancer indications, leading us to prioritize this therapy for preclinical testing. This combination effectively curbed T-ALL growth in human cell lines and primary human T-ALL and was well tolerated and effective in suppressing leukemia growth in patient-derived xenografts (PDX) grown in mice. Mechanistically, dasatinib inhibited phosphorylation and activation of the lymphocyte-specific protein tyrosine kinase (LCK) to blunt the T-cell receptor (TCR) signaling pathway, and when complexed with mTORC1 inhibition, induced potent T-ALL cell killing through reducing MCL-1 protein expression. In total, our work uncovered unexpected roles for the LCK kinase and its regulation of downstream TCR signaling in suppressing apoptosis and driving continued leukemia growth. Analysis of a wide array of primary human T-ALLs and PDXs grown in mice suggest that combination of temsirolimus and dasatinib treatment will be efficacious for a large fraction of human T-ALLs.Peer reviewe

    Mapping development and health effects of cooking with solid fuels in low-income and middle-income countries, 2000-18 : a geospatial modelling study

    Get PDF
    Background More than 3 billion people do not have access to clean energy and primarily use solid fuels to cook. Use of solid fuels generates household air pollution, which was associated with more than 2 million deaths in 2019. Although local patterns in cooking vary systematically, subnational trends in use of solid fuels have yet to be comprehensively analysed. We estimated the prevalence of solid-fuel use with high spatial resolution to explore subnational inequalities, assess local progress, and assess the effects on health in low-income and middle-income countries (LMICs) without universal access to clean fuels.Methods We did a geospatial modelling study to map the prevalence of solid-fuel use for cooking at a 5 km x 5 km resolution in 98 LMICs based on 2.1 million household observations of the primary cooking fuel used from 663 population-based household surveys over the years 2000 to 2018. We use observed temporal patterns to forecast household air pollution in 2030 and to assess the probability of attaining the Sustainable Development Goal (SDG) target indicator for clean cooking. We aligned our estimates of household air pollution to geospatial estimates of ambient air pollution to establish the risk transition occurring in LMICs. Finally, we quantified the effect of residual primary solid-fuel use for cooking on child health by doing a counterfactual risk assessment to estimate the proportion of deaths from lower respiratory tract infections in children younger than 5 years that could be associated with household air pollution.Findings Although primary reliance on solid-fuel use for cooking has declined globally, it remains widespread. 593 million people live in districts where the prevalence of solid-fuel use for cooking exceeds 95%. 66% of people in LMICs live in districts that are not on track to meet the SDG target for universal access to clean energy by 2030. Household air pollution continues to be a major contributor to particulate exposure in LMICs, and rising ambient air pollution is undermining potential gains from reductions in the prevalence of solid-fuel use for cooking in many countries. We estimated that, in 2018, 205000 (95% uncertainty interval 147000-257000) children younger than 5 years died from lower respiratory tract infections that could be attributed to household air pollution.Interpretation Efforts to accelerate the adoption of clean cooking fuels need to be substantially increased and recalibrated to account for subnational inequalities, because there are substantial opportunities to improve air quality and avert child mortality associated with household air pollution. Copyright (C) 2022 The Author(s). Published by Elsevier Ltd.Peer reviewe

    Adolescent transport and unintentional injuries: a systematic analysis using the Global Burden of Disease Study 2019

    Get PDF
    Background: Globally, transport and unintentional injuries persist as leading preventable causes of mortality and morbidity for adolescents. We sought to report comprehensive trends in injury-related mortality and morbidity for adolescents aged 10–24 years during the past three decades. Methods: Using the Global Burden of Disease, Injuries, and Risk Factors 2019 Study, we analysed mortality and disability-adjusted life-years (DALYs) attributed to transport and unintentional injuries for adolescents in 204 countries. Burden is reported in absolute numbers and age-standardised rates per 100 000 population by sex, age group (10–14, 15–19, and 20–24 years), and sociodemographic index (SDI) with 95% uncertainty intervals (UIs). We report percentage changes in deaths and DALYs between 1990 and 2019. Findings: In 2019, 369 061 deaths (of which 214 337 [58%] were transport related) and 31·1 million DALYs (of which 16·2 million [52%] were transport related) among adolescents aged 10–24 years were caused by transport and unintentional injuries combined. If compared with other causes, transport and unintentional injuries combined accounted for 25% of deaths and 14% of DALYs in 2019, and showed little improvement from 1990 when such injuries accounted for 26% of adolescent deaths and 17% of adolescent DALYs. Throughout adolescence, transport and unintentional injury fatality rates increased by age group. The unintentional injury burden was higher among males than females for all injury types, except for injuries related to fire, heat, and hot substances, or to adverse effects of medical treatment. From 1990 to 2019, global mortality rates declined by 34·4% (from 17·5 to 11·5 per 100 000) for transport injuries, and by 47·7% (from 15·9 to 8·3 per 100 000) for unintentional injuries. However, in low-SDI nations the absolute number of deaths increased (by 80·5% to 42 774 for transport injuries and by 39·4% to 31 961 for unintentional injuries). In the high-SDI quintile in 2010–19, the rate per 100 000 of transport injury DALYs was reduced by 16·7%, from 838 in 2010 to 699 in 2019. This was a substantially slower pace of reduction compared with the 48·5% reduction between 1990 and 2010, from 1626 per 100 000 in 1990 to 838 per 100 000 in 2010. Between 2010 and 2019, the rate of unintentional injury DALYs per 100 000 also remained largely unchanged in high-SDI countries (555 in 2010 vs 554 in 2019; 0·2% reduction). The number and rate of adolescent deaths and DALYs owing to environmental heat and cold exposure increased for the high-SDI quintile during 2010–19. Interpretation: As other causes of mortality are addressed, inadequate progress in reducing transport and unintentional injury mortality as a proportion of adolescent deaths becomes apparent. The relative shift in the burden of injury from high-SDI countries to low and low–middle-SDI countries necessitates focused action, including global donor, government, and industry investment in injury prevention. The persisting burden of DALYs related to transport and unintentional injuries indicates a need to prioritise innovative measures for the primary prevention of adolescent injury. Funding: Bill & Melinda Gates Foundation

    Measuring universal health coverage based on an index of effective coverage of health services in 204 countries and territories, 1990–2019 : A systematic analysis for the Global Burden of Disease Study 2019

    Get PDF
    Background Achieving universal health coverage (UHC) involves all people receiving the health services they need, of high quality, without experiencing financial hardship. Making progress towards UHC is a policy priority for both countries and global institutions, as highlighted by the agenda of the UN Sustainable Development Goals (SDGs) and WHO's Thirteenth General Programme of Work (GPW13). Measuring effective coverage at the health-system level is important for understanding whether health services are aligned with countries' health profiles and are of sufficient quality to produce health gains for populations of all ages. Methods Based on the Global Burden of Diseases, Injuries, and Risk Factors Study (GBD) 2019, we assessed UHC effective coverage for 204 countries and territories from 1990 to 2019. Drawing from a measurement framework developed through WHO's GPW13 consultation, we mapped 23 effective coverage indicators to a matrix representing health service types (eg, promotion, prevention, and treatment) and five population-age groups spanning from reproductive and newborn to older adults (≥65 years). Effective coverage indicators were based on intervention coverage or outcome-based measures such as mortality-to-incidence ratios to approximate access to quality care; outcome-based measures were transformed to values on a scale of 0–100 based on the 2·5th and 97·5th percentile of location-year values. We constructed the UHC effective coverage index by weighting each effective coverage indicator relative to its associated potential health gains, as measured by disability-adjusted life-years for each location-year and population-age group. For three tests of validity (content, known-groups, and convergent), UHC effective coverage index performance was generally better than that of other UHC service coverage indices from WHO (ie, the current metric for SDG indicator 3.8.1 on UHC service coverage), the World Bank, and GBD 2017. We quantified frontiers of UHC effective coverage performance on the basis of pooled health spending per capita, representing UHC effective coverage index levels achieved in 2019 relative to country-level government health spending, prepaid private expenditures, and development assistance for health. To assess current trajectories towards the GPW13 UHC billion target—1 billion more people benefiting from UHC by 2023—we estimated additional population equivalents with UHC effective coverage from 2018 to 2023. Findings Globally, performance on the UHC effective coverage index improved from 45·8 (95% uncertainty interval 44·2–47·5) in 1990 to 60·3 (58·7–61·9) in 2019, yet country-level UHC effective coverage in 2019 still spanned from 95 or higher in Japan and Iceland to lower than 25 in Somalia and the Central African Republic. Since 2010, sub-Saharan Africa showed accelerated gains on the UHC effective coverage index (at an average increase of 2·6% [1·9–3·3] per year up to 2019); by contrast, most other GBD super-regions had slowed rates of progress in 2010–2019 relative to 1990–2010. Many countries showed lagging performance on effective coverage indicators for non-communicable diseases relative to those for communicable diseases and maternal and child health, despite non-communicable diseases accounting for a greater proportion of potential health gains in 2019, suggesting that many health systems are not keeping pace with the rising non-communicable disease burden and associated population health needs. In 2019, the UHC effective coverage index was associated with pooled health spending per capita (r=0·79), although countries across the development spectrum had much lower UHC effective coverage than is potentially achievable relative to their health spending. Under maximum efficiency of translating health spending into UHC effective coverage performance, countries would need to reach 1398pooledhealthspendingpercapita(US1398 pooled health spending per capita (US adjusted for purchasing power parity) in order to achieve 80 on the UHC effective coverage index. From 2018 to 2023, an estimated 388·9 million (358·6–421·3) more population equivalents would have UHC effective coverage, falling well short of the GPW13 target of 1 billion more people benefiting from UHC during this time. Current projections point to an estimated 3·1 billion (3·0–3·2) population equivalents still lacking UHC effective coverage in 2023, with nearly a third (968·1 million [903·5–1040·3]) residing in south Asia. Interpretation The present study demonstrates the utility of measuring effective coverage and its role in supporting improved health outcomes for all people—the ultimate goal of UHC and its achievement. Global ambitions to accelerate progress on UHC service coverage are increasingly unlikely unless concerted action on non-communicable diseases occurs and countries can better translate health spending into improved performance. Focusing on effective coverage and accounting for the world's evolving health needs lays the groundwork for better understanding how close—or how far—all populations are in benefiting from UHC
    corecore