37,804 research outputs found
Predictive genomics: A cancer hallmark network framework for predicting tumor clinical phenotypes using genome sequencing data
We discuss a cancer hallmark network framework for modelling
genome-sequencing data to predict cancer clonal evolution and associated
clinical phenotypes. Strategies of using this framework in conjunction with
genome sequencing data in an attempt to predict personalized drug targets, drug
resistance, and metastasis for a cancer patient, as well as cancer risks for a
healthy individual are discussed. Accurate prediction of cancer clonal
evolution and clinical phenotypes will have substantial impact on timely
diagnosis, personalized management and prevention of cancer.Comment: 5 figs, related papers, visit lab homepage:
http://www.cancer-systemsbiology.org, Seminar in Cancer Biology, 201
Study of meta-analysis strategies for network inference using information-theoretic approaches
© 2017 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.Reverse engineering of gene regulatory networks (GRNs) from gene expression data is a classical challenge in systems biology. Thanks to high-throughput technologies, a massive amount of gene-expression data has been accumulated in the public repositories. Modelling GRNs from multiple experiments (also called integrative analysis) has; therefore, naturally become a standard procedure in modern computational biology. Indeed, such analysis is usually more robust than the traditional approaches focused on individual datasets, which typically suffer from some experimental bias and a small number of samples.
To date, there are mainly two strategies for the problem of interest: the first one (”data merging”) merges all datasets together and then infers a GRN whereas the other (”networks ensemble”) infers GRNs from every dataset separately and then aggregates them using some ensemble rules (such as ranksum or weightsum). Unfortunately, a thorough comparison of these two approaches is lacking.
In this paper, we evaluate the performances of various metaanalysis approaches mentioned above with a systematic set of experiments based on in silico benchmarks. Furthermore, we present a new meta-analysis approach for inferring GRNs from multiple studies. Our proposed approach, adapted to methods based on pairwise measures such as correlation or mutual information, consists of two steps: aggregating matrices of the pairwise measures from every dataset followed by extracting the network from the meta-matrix.Peer ReviewedPostprint (author's final draft
MicroRNA and transcription factor co-regulatory networks and subtype classification of seminoma and non-seminoma in testicular germ cell tumors
Recent studies have revealed that feed-forward loops (FFLs) as regulatory motifs have synergistic roles in cellular systems and their disruption may cause diseases including cancer. FFLs may include two regulators such as transcription factors (TFs) and microRNAs (miRNAs). In this study, we extensively investigated TF and miRNA regulation pairs, their FFLs, and TF-miRNA mediated regulatory networks in two major types of testicular germ cell tumors (TGCT): seminoma (SE) and non-seminoma (NSE). Specifically, we identified differentially expressed mRNA genes and miRNAs in 103 tumors using the transcriptomic data from The Cancer Genome Atlas. Next, we determined significantly correlated TF-gene/miRNA and miRNA-gene/TF pairs with regulation direction. Subsequently, we determined 288 and 664 dysregulated TF-miRNA-gene FFLs in SE and NSE, respectively. By constructing dysregulated FFL networks, we found that many hub nodes (12 out of 30 for SE and 8 out of 32 for NSE) in the top ranked FFLs could predict subtype-classification (Random Forest classifier, average accuracy ≥90%). These hub molecules were validated by an independent dataset. Our network analysis pinpointed several SE-specific dysregulated miRNAs (miR-200c-3p, miR-25-3p, and miR-302a-3p) and genes (EPHA2, JUN, KLF4, PLXDC2, RND3, SPI1, and TIMP3) and NSE-specific dysregulated miRNAs (miR-367-3p, miR-519d-3p, and miR-96-5p) and genes (NR2F1 and NR2F2). This study is the first systematic investigation of TF and miRNA regulation and their co-regulation in two major TGCT subtypes
Recommended from our members
ZNF750 is a lineage-specific tumour suppressor in squamous cell carcinoma.
ZNF750 controls epithelial homeostasis by regulating epidermal-differentiation genes, a role underscored by its pathogenic mutations in esophageal squamous cell cancers (SCCs). However, the precise role of ZNF750 in SCC cell biology remains unclear. In this study, we report that ZNF750 is exclusively deleted, mutated and underexpressed in human SCCs, and low ZNF750 expression is associated with poor survival. Restoration of wildtype, but not mutant ZNF750 protein uniquely inhibited the malignant phenotypes of SCC cells both in vitro and in vivo. Notably, ZNF750 promoted the expression of a long non-coding RNA (TINCR), which mediated both cancer-inhibition and differentiation-induction effects of ZNF750. In addition, ZNF750 potently suppressed cell migration by directly inhibiting the transactivation of LAMC2. Together, our findings characterize ZNF750 as a crucial SCC-specific suppressor and uncover its novel anticancer-associated functions
Defining a robust biological prior from Pathway Analysis to drive Network Inference
Inferring genetic networks from gene expression data is one of the most
challenging work in the post-genomic era, partly due to the vast space of
possible networks and the relatively small amount of data available. In this
field, Gaussian Graphical Model (GGM) provides a convenient framework for the
discovery of biological networks. In this paper, we propose an original
approach for inferring gene regulation networks using a robust biological prior
on their structure in order to limit the set of candidate networks.
Pathways, that represent biological knowledge on the regulatory networks,
will be used as an informative prior knowledge to drive Network Inference. This
approach is based on the selection of a relevant set of genes, called the
"molecular signature", associated with a condition of interest (for instance,
the genes involved in disease development). In this context, differential
expression analysis is a well established strategy. However outcome signatures
are often not consistent and show little overlap between studies. Thus, we will
dedicate the first part of our work to the improvement of the standard process
of biomarker identification to guarantee the robustness and reproducibility of
the molecular signature.
Our approach enables to compare the networks inferred between two conditions
of interest (for instance case and control networks) and help along the
biological interpretation of results. Thus it allows to identify differential
regulations that occur in these conditions. We illustrate the proposed approach
by applying our method to a study of breast cancer's response to treatment
A model for gene deregulation detection using expression data
In tumoral cells, gene regulation mechanisms are severely altered, and these
modifications in the regulations may be characteristic of different subtypes of
cancer. However, these alterations do not necessarily induce differential
expressions between the subtypes. To answer this question, we propose a
statistical methodology to identify the misregulated genes given a reference
network and gene expression data. Our model is based on a regulatory process in
which all genes are allowed to be deregulated. We derive an EM algorithm where
the hidden variables correspond to the status (under/over/normally expressed)
of the genes and where the E-step is solved thanks to a message passing
algorithm. Our procedure provides posterior probabilities of deregulation in a
given sample for each gene. We assess the performance of our method by
numerical experiments on simulations and on a bladder cancer data set
Proteomic analyses reveal distinct chromatin-associated and soluble transcription factor complexes.
The current knowledge on how transcription factors (TFs), the ultimate targets and executors of cellular signalling pathways, are regulated by protein-protein interactions remains limited. Here, we performed proteomics analyses of soluble and chromatin-associated complexes of 56 TFs, including the targets of many signalling pathways involved in development and cancer, and 37 members of the Forkhead box (FOX) TF family. Using tandem affinity purification followed by mass spectrometry (TAP/MS), we performed 214 purifications and identified 2,156 high-confident protein-protein interactions. We found that most TFs form very distinct protein complexes on and off chromatin. Using this data set, we categorized the transcription-related or unrelated regulators for general or specific TFs. Our study offers a valuable resource of protein-protein interaction networks for a large number of TFs and underscores the general principle that TFs form distinct location-specific protein complexes that are associated with the different regulation and diverse functions of these TFs
- …