10 research outputs found

    Integrated Proteogenomic Characterization across Major Histological Types of Pediatric Brain Cancer

    Get PDF
    We report a comprehensive proteogenomics analysis, including whole-genome sequencing, RNA sequencing, and proteomics and phosphoproteomics profiling, of 218 tumors across 7 histological types of childhood brain cancer: low-grade glioma (n = 93), ependymoma (32), high-grade glioma (25), medulloblastoma (22), ganglioglioma (18), craniopharyngioma (16), and atypical teratoid rhabdoid tumor (12). Proteomics data identify common biological themes that span histological boundaries, suggesting that treatments used for one histological type may be applied effectively to other tumors sharing similar proteomics features. Immune landscape characterization reveals diverse tumor microenvironments across and within diagnoses. Proteomics data further reveal functional effects of somatic mutations and copy number variations (CNVs) not evident in transcriptomics data. Kinase-substrate association and co-expression network analysis identify important biological mechanisms of tumorigenesis. This is the first large-scale proteogenomics analysis across traditional histological boundaries to uncover foundational pediatric brain tumor biology and inform rational treatment selection

    METHODS FOR CLUSTERING TIME SERIES DATA ACQUIRED FROM MOBILE HEALTH APPS

    No full text
    In our recent Asthma Mobile Health Study (AMHS), thousands of asthma patients across the country contributed medical data through the iPhone Asthma Health App on a daily basis for an extended period of time. The collected data included daily self-reported asthma symptoms, symptom triggers, and real time geographic location information. The AMHS is just one of many studies occurring in the context of now many thousands of mobile health apps aimed at improving wellness and better managing chronic disease conditions, leveraging the passive and active collection of data from mobile, handheld smart devices. The ability to identify patient groups or patterns of symptoms that might predict adverse outcomes such as asthma exacerbations or hospitalizations from these types of large, prospectively collected data sets, would be of significant general interest. However, conventional clustering methods cannot be applied to these types of longitudinally collected data, especially survey data actively collected from app users, given heterogeneous patterns of missing values due to: 1) varying survey response rates among different users, 2) varying survey response rates over time of each user, and 3) non-overlapping periods of enrollment among different users. To handle such complicated missing data structure, we proposed a probability imputation model to infer missing data. We also employed a consensus clustering strategy in tandem with the multiple imputation procedure. Through simulation studies under a range of scenarios reflecting real data conditions, we identified favorable performance of the proposed method over other strategies that impute the missing value through low-rank matrix completion. When applying the proposed new method to study asthma triggers and symptoms collected as part of the AMHS, we identified several patient groups with distinct phenotype patterns. Further validation of the methods described in this paper might be used to identify clinically important patterns in large data sets with complicated missing data structure, improving the ability to use such data sets to identify at-risk populations for potential intervention

    Cumulative lifetime maternal stress and epigenome-wide placental DNA methylation in the PRISM cohort

    No full text
    <p>Evolving evidence links maternal stress exposure to changes in placental DNA methylation of specific genes regulating placental function that may have implications for the programming of a host of chronic disorders. Few studies have implemented an epigenome-wide approach. Using the Infinium HumanMethylation450 BeadChip (450K), we investigated epigenome-wide placental DNA methylation in relation to maternal experiences of traumatic and non-traumatic stressors over her lifetime assessed using the Life Stressor Checklist-Revised (LSC-R) survey (n = 207). We found differential DNA methylation at epigenome-wide statistical significance (FDR = 0.05) for 112 CpGs. Additionally, we observed three clusters that exhibited differential methylation in response to high maternal lifetime stress. Enrichment analyses, conducted at an FDR = 0.20, revealed lysine degradation to be the most significant pathway associated with maternal lifetimes stress exposure. Targeted enrichment analyses of the three largest clusters of probes, identified using the gap statistic, were enriched for genes associated with endocytosis (i.e., <i>SMAP1, ANKFY1</i>), tight junctions (i.e., <i>EPB41L4B</i>), and metabolic pathways (i.e., <i>INPP5E, EEF1B2</i>). These pathways, also identified in the top 10 KEGG pathways associated with maternal lifetime stress exposure, play important roles in multiple physiological functions necessary for proper fetal development. Further, two genes were identified to exhibit multiple probes associated with maternal lifetime stress (i.e., <i>ANKFY1, TM6SF1</i>). The methylation status of the probes belonging to each cluster and/or genes exhibiting multiple hits, may play a role in the pathogenesis of adverse health outcomes in children born to mothers with increased lifetime stress exposure.</p

    Pan-cancer proteogenomics connects oncogenic drivers to functional states

    Get PDF
    Cancer driver events refer to key genetic aberrations that drive oncogenesis; however, their exact molecular mechanisms remain insufficiently understood. Here, our multi-omics pan-cancer analysis uncovers insights into the impacts of cancer drivers by identifying their significant cis-effects and distal trans-effects quantified at the RNA, protein, and phosphoprotein levels. Salient observations include the association of point mutations and copy-number alterations with the rewiring of protein interaction networks, and notably, most cancer genes converge toward similar molecular states denoted by sequence-based kinase activity profiles. A correlation between predicted neoantigen burden and measured T cell infiltration suggests potential vulnerabilities for immunotherapies. Patterns of cancer hallmarks vary by polygenic protein abundance ranging from uniform to heterogeneous. Overall, our work demonstrates the value of comprehensive proteogenomics in understanding the functional states of oncogenic drivers and their links to cancer development, surpassing the limitations of studying individual cancer types
    corecore