312 research outputs found

    Identifying Dynamic Protein Complexes Based on Gene Expression Profiles and PPI Networks

    Get PDF
    Identification of protein complexes fromprotein-protein interaction networks has become a key problem for understanding cellular life in postgenomic era. Many computational methods have been proposed for identifying protein complexes. Up to now, the existing computational methods are mostly applied on static PPI networks. However, proteins and their interactions are dynamic in reality. Identifying dynamic protein complexes is more meaningful and challenging. In this paper, a novel algorithm, named DPC, is proposed to identify dynamic protein complexes by integrating PPI data and gene expression profiles. According to Core-Attachment assumption, these proteins which are always active in the molecular cycle are regarded as core proteins. The protein-complex cores are identified from these always active proteins by detecting dense subgraphs. Final protein complexes are extended from the protein-complex cores by adding attachments based on a topological character of “closeness” and dynamic meaning. The protein complexes produced by our algorithm DPC contain two parts: static core expressed in all the molecular cycle and dynamic attachments short-lived.The proposed algorithm DPC was applied on the data of Saccharomyces cerevisiae and the experimental results show that DPC outperforms CMC, MCL, SPICi, HC-PIN, COACH, and Core-Attachment based on the validation of matching with known complexes and hF-measures

    Computational analysis on the effects of variations in T and B cells. Primary immunodeficiencies and cancer neoepitopes

    Get PDF
    Computational approaches are essential to study the effects of inborn and somatic variations. Results from such studies contribute to better diagnosis and therapies. Primary immunodeficiencies (PIDs) are rare inborn defects of key immune response genes. Somatic variations are main drivers of most cancers. Large and diverse data on PID genes and proteins can enable systems biology studies on their dynamic effects on T and B cells. Amino acid substitutions (AASs) are somatic variations that drive cancers. However, AASs also cause cancer-associated antigens that are recognized by lymphocytes as non-self, and are called neoantigens. Detail analysis these neoantigens can be performed due to the availability of cancer data from many consortia.The purpose of this thesis was to investigate the effects of PIDs on T and B cells and to explore features of neoepitopes in cancers. The object of the first study was to detect the central T cell-specific protein network. The purpose of the second and third studies were to reconstruct the T and B cell network model and simulate the dynamic effects of PID perturbations. The aim of the fourth study was to characterize neoepitopes from pan-cancer datasets.The immunome interactome was reconstructed, and the links weighed with gene expression correlation of integrated, time series data (Paper I). The significance of the weighted links were computed with the Global Statistical Significance (GloSS) method, and the weighted interactome network was filtered to obtain the central T cell network. Next, the T cell network model was reconstructed from literature mining and the core T cell protein interaction network (Paper II). The B cell network model was reconstructed by mining the literature for central B cell interactions (Paper III). The normalized HillCube software was used to study the dynamic effects of PID perturbations in T and B cells. Proteome-wide amino AASs on putatively derived 8-, 9-, 10-, and 11-mer neoepitopes in 30 cancer types were analyzed with the NetMHC 4.0 software (Paper IV).The interconnectedness of the major T cell pathways are maintained in the central T cell PPI network. Empirical evidence from Gene Ontology term and essential genes enrichment analyses were in support for the central T cell network. In the T and B cell simulations for several knockout PIDs correspond to previous results. In the T cell model, simulations for TCR, PTPRC, LCK, ZAP70 and ITK indicated profound disruption in network dynamics. BCL10, CARD11, MALT1, NEMO and MAP3K14 simulations showed significant effects. In B cell, the simulations for LYN, BTK, STIM1, ORAI1, CD19, CD21 and CD81 indicated profound changes to many proteins in the network. Severe effects were observed in the BCL10, IKKB, knockout CARD11, MALT1, NEMO, IKKB and WIPF1 simulations. No major effects were observed for constitutively active PID proteins. The most likely epitopes are those which are detected by several macromolecular histocompartibility complexes (MHCs) and of several peptide lengths. 0.17% of all variants yield more than 100 neoepitopes. Amino acid distributions indicate that variants at all positions in neoepitopes of any length are, on average, more hydrophobic compared to the wild-type.The core T cell network approach is general and applicable to any system with adequate data. The T and B cell models enable the understanding of the dynamic effects of PID disease processes and reveals several novel proteins that may be of interest when diagnosing and treating immunological defects. The neoepitope characteristics can be employed for targeted cancer vaccine applications in personalized therapies
    • …
    corecore