730 research outputs found

    Gene Co-expression Network and Copy Number Variation Analyses Identify Transcription Factors Associated With Multiple Myeloma Progression

    Get PDF
    Multiple myeloma (MM) has two clinical precursor stages of disease: monoclonal gammopathy of undetermined significance (MGUS) and smoldering multiple myeloma (SMM). However, the mechanism of progression is not well understood. Because gene co-expression network analysis is a well-known method for discovering new gene functions and regulatory relationships, we utilized this framework to conduct differential co-expression analysis to identify interesting transcription factors (TFs) in two publicly available datasets. We then used copy number variation (CNV) data from a third public dataset to validate these TFs. First, we identified co-expressed gene modules in two publicly available datasets each containing three conditions: normal, MGUS, and SMM. These modules were assessed for condition-specific gene expression, and then enrichment analysis was conducted on condition-specific modules to identify their biological function and upstream TFs. TFs were assessed for differential gene expression between normal and MM precursors, then validated with CNV analysis to identify candidate genes. Functional enrichment analysis reaffirmed known functional categories in MM pathology, the main one relating to immune function. Enrichment analysis revealed a handful of differentially expressed TFs between normal and either MGUS or SMM in gene expression and/or CNV. Overall, we identified four genes of interest (MAX, TCF4, ZNF148, and ZNF281) that aid in our understanding of MM initiation and progression

    Changes in energy content of lunchtime purchases from fast food restaurants after introduction of calorie labelling: cross sectional customer surveys

    Get PDF
    Objective To assess the impact of fast food restaurants adding calorie labelling to menu items on the energy content of individual purchases

    Novel components of the Toxoplasma inner membrane complex revealed by BioID.

    Get PDF
    UNLABELLED:The inner membrane complex (IMC) of Toxoplasma gondii is a peripheral membrane system that is composed of flattened alveolar sacs that underlie the plasma membrane, coupled to a supporting cytoskeletal network. The IMC plays important roles in parasite replication, motility, and host cell invasion. Despite these central roles in the biology of the parasite, the proteins that constitute the IMC are largely unknown. In this study, we have adapted a technique named proximity-dependent biotin identification (BioID) for use in T. gondii to identify novel components of the IMC. Using IMC proteins in both the alveoli and the cytoskeletal network as bait, we have uncovered a total of 19 new IMC proteins in both of these suborganellar compartments, two of which we functionally evaluate by gene knockout. Importantly, labeling of IMC proteins using this approach has revealed a group of proteins that localize to the sutures of the alveolar sacs that have been seen in their entirety in Toxoplasma species only by freeze fracture electron microscopy. Collectively, our study greatly expands the repertoire of known proteins in the IMC and experimentally validates BioID as a strategy for discovering novel constituents of specific cellular compartments of T. gondii. IMPORTANCE:The identification of binding partners is critical for determining protein function within cellular compartments. However, discovery of protein-protein interactions within membrane or cytoskeletal compartments is challenging, particularly for transient or unstable interactions that are often disrupted by experimental manipulation of these compartments. To circumvent these problems, we adapted an in vivo biotinylation technique called BioID for Toxoplasma species to identify binding partners and proximal proteins within native cellular environments. We used BioID to identify 19 novel proteins in the parasite IMC, an organelle consisting of fused membrane sacs and an underlying cytoskeleton, whose protein composition is largely unknown. We also demonstrate the power of BioID for targeted discovery of proteins within specific compartments, such as the IMC cytoskeleton. In addition, we uncovered a new group of proteins localizing to the alveolar sutures of the IMC. BioID promises to reveal new insights on protein constituents and interactions within cellular compartments of Toxoplasma

    Gene Co-expression Network and Copy Number Variation Analyses Identify Transcription Factors Associated With Multiple Myeloma Progression

    Get PDF
    Multiple myeloma (MM) has two clinical precursor stages of disease: monoclonal gammopathy of undetermined significance (MGUS) and smoldering multiple myeloma (SMM). However, the mechanism of progression is not well understood. Because gene co-expression network analysis is a well-known method for discovering new gene functions and regulatory relationships, we utilized this framework to conduct differential co-expression analysis to identify interesting transcription factors (TFs) in two publicly available datasets. We then used copy number variation (CNV) data from a third public dataset to validate these TFs. First, we identified co-expressed gene modules in two publicly available datasets each containing three conditions: normal, MGUS, and SMM. These modules were assessed for condition-specific gene expression, and then enrichment analysis was conducted on condition-specific modules to identify their biological function and upstream TFs. TFs were assessed for differential gene expression between normal and MM precursors, then validated with CNV analysis to identify candidate genes. Functional enrichment analysis reaffirmed known functional categories in MM pathology, the main one relating to immune function. Enrichment analysis revealed a handful of differentially expressed TFs between normal and either MGUS or SMM in gene expression and/or CNV. Overall, we identified four genes of interest (MAX, TCF4, ZNF148, and ZNF281) that aid in our understanding of MM initiation and progression

    Deep learning-based cancer survival prognosis from RNA-seq data: approaches and evaluations

    Get PDF
    Background: Recent advances in kernel-based Deep Learning models have introduced a new era in medical research. Originally designed for pattern recognition and image processing, Deep Learning models are now applied to survival prognosis of cancer patients. Specifically, Deep Learning versions of the Cox proportional hazards models are trained with transcriptomic data to predict survival outcomes in cancer patients. Methods: In this study, a broad analysis was performed on TCGA cancers using a variety of Deep Learning-based models, including Cox-nnet, DeepSurv, and a method proposed by our group named AECOX (AutoEncoder with Cox regression network). Concordance index and p-value of the log-rank test are used to evaluate the model performances. Results: All models show competitive results across 12 cancer types. The last hidden layers of the Deep Learning approaches are lower dimensional representations of the input data that can be used for feature reduction and visualization. Furthermore, the prognosis performances reveal a negative correlation between model accuracy, overall survival time statistics, and tumor mutation burden (TMB), suggesting an association among overall survival time, TMB, and prognosis prediction accuracy. Conclusions: Deep Learning based algorithms demonstrate superior performances than traditional machine learning based models. The cancer prognosis results measured in concordance index are indistinguishable across models while are highly variable across cancers. These findings shedding some light into the relationships between patient characteristics and survival learnability on a pan-cancer level

    SALMON: Survival Analysis Learning With Multi-Omics Neural Networks on Breast Cancer

    Get PDF
    Improved cancer prognosis is a central goal for precision health medicine. Though many models can predict differential survival from data, there is a strong need for sophisticated algorithms that can aggregate and filter relevant predictors from increasingly complex data inputs. In turn, these models should provide deeper insight into which types of data are most relevant to improve prognosis. Deep Learning-based neural networks offer a potential solution for both problems because they are highly flexible and account for data complexity in a non-linear fashion. In this study, we implement Deep Learning-based networks to determine how gene expression data predicts Cox regression survival in breast cancer. We accomplish this through an algorithm called SALMON (Survival Analysis Learning with Multi-Omics Neural Networks), which aggregates and simplifies gene expression data and cancer biomarkers to enable prognosis prediction. The results revealed improved performance when more omics data were used in model construction. Rather than use raw gene expression values as model inputs, we innovatively use eigengene modules from the result of gene co-expression network analysis. The corresponding high impact co-expression modules and other omics data are identified by feature selection technique, then examined by conducting enrichment analysis and exploiting biological functions, escalated the interpretation of input feature from gene level to co-expression modules level. Our study shows the feasibility of discovering breast cancer related co-expression modules, sketch a blueprint of future endeavors on Deep Learning-based survival analysis. SALMON source code is available at https://github.com/huangzhii/SALMON/

    TPQCI: A topology potential-based method to quantify functional influence of copy number variations

    Get PDF
    Copy number variation (CNV) is a major type of chromosomal structural variation that play important roles in many diseases including cancers. Due to genome instability, a large number of CNV events can be detected in diseases such as cancer. Therefore, it is important to identify the functionally important CNVs in diseases, which currently still poses a challenge in genomics. One of the critical steps to solve the problem is to define the influence of CNV. In this paper, we provide a topology potential based method, TPQCI, to quantify this kind of influence by integrating statistics, gene regulatory associations, and biological function information. We used this metric to detect functionally enriched genes on genomic segments with CNV in breast cancer and multiple myeloma and discovered biological functions influenced by CNV. Our results demonstrate that, by using our proposed TPQCI metric, we can detect disease-specific genes that are influenced by CNVs. Source codes of TPQCI are provided in Github (https://github.com/usos/TPQCI)

    Single cells from human primary colorectal tumors exhibit polyfunctional heterogeneity in secretions of ELR+ CXC chemokines

    Get PDF
    Cancer is an inflammatory disease of tissue that is largely influenced by the interactions between multiple cell types, secreted factors, and signal transduction pathways. While single-cell sequencing continues to refine our understanding of the clonotypic heterogeneity within tumors, the complex interplay between genetic variations and non-genetic factors ultimately affects therapeutic outcome. Much has been learned through bulk studies of secreted factors in the tumor microenvironment, but the secretory behavior of single cells has been largely uncharacterized. Here we directly profiled the secretions of ELR+ CXC chemokines from thousands of single colorectal tumor and stromal cells, using an array of subnanoliter wells and a technique called microengraving to characterize both the rates of secretion of several factors at once and the numbers of cells secreting each chemokine. The ELR+ CXC chemokines are highly redundant, pro-angiogenic cytokines that signal via the CXCR1 and CXCR2 receptors, influencing tumor growth and progression. We find that human primary colorectal tumor and stromal cells exhibit polyfunctional heterogeneity in the combinations and magnitudes of secretions for these chemokines. In cell lines, we observe similar variance: phenotypes observed in bulk can be largely absent among the majority of single cells, and discordances exist between secretory states measured and gene expression for these chemokines among single cells. Together, these measures suggest secretory states among tumor cells are complex and can evolve dynamically. Most importantly, this study reveals new insight into the intratumoral phenotypic heterogeneity of human primary tumors.Janssen Pharmaceutical Ltd.National Cancer Institute (U.S.) (Cancer Center Support (Core) Grant P30-CA14051)National Science Foundation (U.S.). Graduate Research FellowshipSingapore. Agency for Science, Technology and ResearchSwiss National Science Foundation (Fellowship for Advanced Researchers PA00P3 139659

    A pan-kidney cancer study identifies subtype specific perturbations on pathways with potential drivers in renal cell carcinoma

    Get PDF
    Background: Renal cell carcinoma (RCC) is a complex disease and is comprised of several histological subtypes, the most frequent of which are clear cell renal cell carcinoma (ccRCC), papillary renal cell carcinoma (PRCC) and chromophobe renal cell carcinoma (ChRCC). While lots of studies have been performed to investigate the molecular characterizations of different subtypes of RCC, our knowledge regarding the underlying mechanisms are still incomplete. As molecular alterations are eventually reflected on the pathway level to execute certain biological functions, characterizing the pathway perturbations is crucial for understanding tumorigenesis and development of RCC. Methods: In this study, we investigated the pathway perturbations of various RCC subtype against normal tissue based on differential expressed genes within a certain pathway. We explored the potential upstream regulators of subtype-specific pathways with Ingenuity Pathway Analysis (IPA). We also evaluated the relationships between subtype-specific pathways and clinical outcome with survival analysis. Results: In this study, we carried out a pathway-based analysis to explore the mechanisms of various RCC subtypes with TCGA RNA-seq data. Both commonly altered pathways and subtype-specific pathways were detected. To identify the distinctive characteristics of each subtype, we focused on subtype-specific perturbed pathways. Specifically, we observed that some of the altered pathways were regulated by several recurrent upstream regulators which presenting different expression patterns among distinct RCC subtypes. We also noticed that a large number of perturbed pathways were controlled by the subtype-specific upstream regulators. Moreover, we also evaluated the relationships between perturbed pathways and clinical outcome. Prognostic pathways were identified and their roles in tumor development and progression were inferred. Conclusions: In summary, we evaluated the relationships among pathway perturbations, upstream regulators and clinical outcome for differential subtypes in RCC. We hypothesized that the alterations of common upstream regulators as well as subtype-specific upstream regulators work together to affect the downstream pathway perturbations and drive cancer initialization and prognosis. Our findings not only increase our understanding of the mechanisms of various RCC subtypes, but also provide targets for personalized therapeutic intervention

    TPSC: a module detection method based on topology potential and spectral clustering in weighted networks and its application in gene co-expression module discovery

    Get PDF
    Background: Gene co-expression networks are widely studied in the biomedical field, with algorithms such as WGCNA and lmQCM having been developed to detect co-expressed modules. However, these algorithms have limitations such as insufficient granularity and unbalanced module size, which prevent full acquisition of knowledge from data mining. In addition, it is difficult to incorporate prior knowledge in current co-expression module detection algorithms. Results: In this paper, we propose a novel module detection algorithm based on topology potential and spectral clustering algorithm to detect co-expressed modules in gene co-expression networks. By testing on TCGA data, our novel method can provide more complete coverage of genes, more balanced module size and finer granularity than current methods in detecting modules with significant overall survival difference. In addition, the proposed algorithm can identify modules by incorporating prior knowledge. Conclusion: In summary, we developed a method to obtain as much as possible information from networks with increased input coverage and the ability to detect more size-balanced and granular modules. In addition, our method can integrate data from different sources. Our proposed method performs better than current methods with complete coverage of input genes and finer granularity. Moreover, this method is designed not only for gene co-expression networks but can also be applied to any general fully connected weighted network
    • …
    corecore