105 research outputs found

    SynSysNet:integration of experimental data on synaptic protein-protein interactions with drug-target relations

    Get PDF
    We created SynSysNet, available online at http://bioinformatics.charite.de/ synsysnet, to provide a platform that creates a comprehensive 4D network of synaptic interactions. Neuronal synapses are fundamental structures linking nerve cells in the brain and they are responsible for neuronal communication and information processing. These processes are dynamically regulated by a network of proteins. New developments in interaction prote-omics and yeast two-hybrid methods allow unbiased detection of interactors. The consolidation of data from different resources and methods is important to understand the relation to human behaviour and disease and to identify new therapeutic approaches. To this end, we established SynSysNet from a set of ∼1000 synapse specific proteins, their structures and small-molecule interactions. For two-thirds of these, 3D structures are provided (from Protein Data Bank and homology modelling). Drug-target interactions for 750 approved drugs and 50000 compounds, as well as 5000 experimentally validated protein-protein interactions, are included. The resulting interaction network and user-selected parts can be viewed interactively and exported in XGMML. Approximately 200 involved pathways can be explored regarding drug-target interactions. Homology-modelled structures are downloadable in Protein Data Bank format, and drugs are available as MOL-files. Protein-protein interactions and drug-target interactions can be viewed as networks; corresponding PubMed IDs or sources are given. © The Author(s) 2012

    Should We Abandon the t-Test in the Analysis of Gene Expression Microarray Data: A Comparison of Variance Modeling Strategies

    Get PDF
    High-throughput post-genomic studies are now routinely and promisingly investigated in biological and biomedical research. The main statistical approach to select genes differentially expressed between two groups is to apply a t-test, which is subject of criticism in the literature. Numerous alternatives have been developed based on different and innovative variance modeling strategies. However, a critical issue is that selecting a different test usually leads to a different gene list. In this context and given the current tendency to apply the t-test, identifying the most efficient approach in practice remains crucial. To provide elements to answer, we conduct a comparison of eight tests representative of variance modeling strategies in gene expression data: Welch's t-test, ANOVA [1], Wilcoxon's test, SAM [2], RVM [3], limma [4], VarMixt [5] and SMVar [6]. Our comparison process relies on four steps (gene list analysis, simulations, spike-in data and re-sampling) to formulate comprehensive and robust conclusions about test performance, in terms of statistical power, false-positive rate, execution time and ease of use. Our results raise concerns about the ability of some methods to control the expected number of false positives at a desirable level. Besides, two tests (limma and VarMixt) show significant improvement compared to the t-test, in particular to deal with small sample sizes. In addition limma presents several practical advantages, so we advocate its application to analyze gene expression data

    Detection and characterization of 3D-signature phosphorylation site motifs and their contribution towards improved phosphorylation site prediction in proteins

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Phosphorylation of proteins plays a crucial role in the regulation and activation of metabolic and signaling pathways and constitutes an important target for pharmaceutical intervention. Central to the phosphorylation process is the recognition of specific target sites by protein kinases followed by the covalent attachment of phosphate groups to the amino acids serine, threonine, or tyrosine. The experimental identification as well as computational prediction of phosphorylation sites (P-sites) has proved to be a challenging problem. Computational methods have focused primarily on extracting predictive features from the local, one-dimensional sequence information surrounding phosphorylation sites.</p> <p>Results</p> <p>We characterized the spatial context of phosphorylation sites and assessed its usability for improved phosphorylation site predictions. We identified 750 non-redundant, experimentally verified sites with three-dimensional (3D) structural information available in the protein data bank (PDB) and grouped them according to their respective kinase family. We studied the spatial distribution of amino acids around phosphorserines, phosphothreonines, and phosphotyrosines to extract signature 3D-profiles. Characteristic spatial distributions of amino acid residue types around phosphorylation sites were indeed discernable, especially when kinase-family-specific target sites were analyzed. To test the added value of using spatial information for the computational prediction of phosphorylation sites, Support Vector Machines were applied using both sequence as well as structural information. When compared to sequence-only based prediction methods, a small but consistent performance improvement was obtained when the prediction was informed by 3D-context information.</p> <p>Conclusion</p> <p>While local one-dimensional amino acid sequence information was observed to harbor most of the discriminatory power, spatial context information was identified as relevant for the recognition of kinases and their cognate target sites and can be used for an improved prediction of phosphorylation sites. A web-based service (Phos3D) implementing the developed structure-based P-site prediction method has been made available at <url>http://phos3d.mpimp-golm.mpg.de</url>.</p

    A verified genomic reference sample for assessing performance of cancer panels detecting small variants of low allele frequency

    Get PDF
    BackgroundOncopanel genomic testing, which identifies important somatic variants, is increasingly common in medical practice and especially in clinical trials. Currently, there is a paucity of reliable genomic reference samples having a suitably large number of pre-identified variants for properly assessing oncopanel assay analytical quality and performance. The FDA-led Sequencing and Quality Control Phase 2 (SEQC2) consortium analyze ten diverse cancer cell lines individually and their pool, termed Sample A, to develop a reference sample with suitably large numbers of coding positions with known (variant) positives and negatives for properly evaluating oncopanel analytical performance.ResultsIn reference Sample A, we identify more than 40,000 variants down to 1% allele frequency with more than 25,000 variants having less than 20% allele frequency with 1653 variants in COSMIC-related genes. This is 5-100x more than existing commercially available samples. We also identify an unprecedented number of negative positions in coding regions, allowing statistical rigor in assessing limit-of-detection, sensitivity, and precision. Over 300 loci are randomly selected and independently verified via droplet digital PCR with 100% concordance. Agilent normal reference Sample B can be admixed with Sample A to create new samples with a similar number of known variants at much lower allele frequency than what exists in Sample A natively, including known variants having allele frequency of 0.02%, a range suitable for assessing liquid biopsy panels.ConclusionThese new reference samples and their admixtures provide superior capability for performing oncopanel quality control, analytical accuracy, and validation for small to large oncopanels and liquid biopsy assays.Peer reviewe

    Investigating rare pathogenic/likely pathogenic exonic variation in bipolar disorder

    Get PDF
    Bipolar disorder (BD) is a serious mental illness with substantial common variant heritability. However, the role of rare coding variation in BD is not well established. We examined the protein-coding (exonic) sequences of 3,987 unrelated individuals with BD and 5,322 controls of predominantly European ancestry across four cohorts from the Bipolar Sequencing Consortium (BSC). We assessed the burden of rare, protein-altering, single nucleotide variants classified as pathogenic or likely pathogenic (P-LP) both exome-wide and within several groups of genes with phenotypic or biologic plausibility in BD. While we observed an increased burden of rare coding P-LP variants within 165 genes identified as BD GWAS regions in 3,987 BD cases (meta-analysis OR = 1.9, 95% CI = 1.3-2.8, one-sided p = 6.0 × 10-4), this enrichment did not replicate in an additional 9,929 BD cases and 14,018 controls (OR = 0.9, one-side p = 0.70). Although BD shares common variant heritability with schizophrenia, in the BSC sample we did not observe a significant enrichment of P-LP variants in SCZ GWAS genes, in two classes of neuronal synaptic genes (RBFOX2 and FMRP) associated with SCZ or in loss-of-function intolerant genes. In this study, the largest analysis of exonic variation in BD, individuals with BD do not carry a replicable enrichment of rare P-LP variants across the exome or in any of several groups of genes with biologic plausibility. Moreover, despite a strong shared susceptibility between BD and SCZ through common genetic variation, we do not observe an association between BD risk and rare P-LP coding variants in genes known to modulate risk for SCZ

    Analysis of Common and Specific Mechanisms of Liver Function Affected by Nitrotoluene Compounds

    Get PDF
    BACKGROUND: Nitrotoluenes are widely used chemical manufacturing and munitions applications. This group of chemicals has been shown to cause a range of effects from anemia and hypercholesterolemia to testicular atrophy. We have examined the molecular and functional effects of five different, but structurally related, nitrotoluenes on using an integrative systems biology approach to gain insight into common and disparate mechanisms underlying effects caused by these chemicals. METHODOLOGY/PRINCIPAL FINDINGS: Sprague-Dawley female rats were exposed via gavage to one of five concentrations of one of five nitrotoluenes [2,4,6-trinitrotoluene (TNT), 2-amino-4,6-dinitrotoluene (2ADNT) 4-amino-2,6-dinitrotoulene (4ADNT), 2,4-dinitrotoluene (2,4DNT) and 2,6-dinitrotoluene (2,6DNT)] with necropsy and tissue collection at 24 or 48 h. Gene expression profile results correlated well with clinical data and liver histopathology that lead to the concept that hematotoxicity was followed by hepatotoxicity. Overall, 2,4DNT, 2,6DNT and TNT had stronger effects than 2ADNT and 4ADNT. Common functional terms, gene expression patterns, pathways and networks were regulated across all nitrotoluenes. These pathways included NRF2-mediated oxidative stress response, aryl hydrocarbon receptor signaling, LPS/IL-1 mediated inhibition of RXR function, xenobiotic metabolism signaling and metabolism of xenobiotics by cytochrome P450. One biological process common to all compounds, lipid metabolism, was found to be impacted both at the transcriptional and lipid production level. CONCLUSIONS/SIGNIFICANCE: A systems biology strategy was used to identify biochemical pathways affected by five nitroaromatic compounds and to integrate data that tie biochemical alterations to pathological changes. An integrative graphical network model was constructed by combining genomic, gene pathway, lipidomic, and physiological endpoint results to better understand mechanisms of liver toxicity and physiological endpoints affected by these compounds

    Cross-oncopanel study reveals high sensitivity and accuracy with overall analytical performance depending on genomic regions

    Get PDF
    BackgroundTargeted sequencing using oncopanels requires comprehensive assessments of accuracy and detection sensitivity to ensure analytical validity. By employing reference materials characterized by the U.S. Food and Drug Administration-led SEquence Quality Control project phase2 (SEQC2) effort, we perform a cross-platform multi-lab evaluation of eight Pan-Cancer panels to assess best practices for oncopanel sequencing.ResultsAll panels demonstrate high sensitivity across targeted high-confidence coding regions and variant types for the variants previously verified to have variant allele frequency (VAF) in the 5-20% range. Sensitivity is reduced by utilizing VAF thresholds due to inherent variability in VAF measurements. Enforcing a VAF threshold for reporting has a positive impact on reducing false positive calls. Importantly, the false positive rate is found to be significantly higher outside the high-confidence coding regions, resulting in lower reproducibility. Thus, region restriction and VAF thresholds lead to low relative technical variability in estimating promising biomarkers and tumor mutational burden.ConclusionThis comprehensive study provides actionable guidelines for oncopanel sequencing and clear evidence that supports a simplified approach to assess the analytical performance of oncopanels. It will facilitate the rapid implementation, validation, and quality control of oncopanels in clinical use.Peer reviewe

    A Comprehensive Microarray-Based DNA Methylation Study of 367 Hematological Neoplasms

    Get PDF
    Background: Alterations in the DNA methylation pattern are a hallmark of leukemias and lymphomas. However, most epigenetic studies in hematologic neoplasms (HNs) have focused either on the analysis of few candidate genes or many genes and few HN entities, and comprehensive studies are required. Methodology/Principal Findings: Here, we report for the first time a microarray-based DNA methylation study of 767 genes in 367 HNs diagnosed with 16 of the most representative B-cell (n = 203), T-cell (n = 30), and myeloid (n = 134) neoplasias, as well as 37 samples from different cell types of the hematopoietic system. Using appropriate controls of B-, T-, or myeloid cellular origin, we identified a total of 220 genes hypermethylated in at least one HN entity. In general, promoter hypermethylation was more frequent in lymphoid malignancies than in myeloid malignancies, being germinal center mature B-cell lymphomas as well as B and T precursor lymphoid neoplasias those entities with highest frequency of gene-associated DNA hypermethylation. We also observed a significant correlation between the number of hypermethylated and hypomethylated genes in several mature B-cell neoplasias, but not in precursor B- and T-cell leukemias. Most of the genes becoming hypermethylated contained promoters with high CpG content, and a significant fraction of them are targets of the polycomb repressor complex. Interestingly, T-cell prolymphocytic leukemias show low levels of DNA hypermethylation and a comparatively large number of hypomethylated genes, many of them showing an increased gene expression. Conclusions/Significance: We have characterized the DNA methylation profile of a wide range of different HNs entities. As well as identifying genes showing aberrant DNA methylation in certain HN subtypes, we also detected six genes DBC1, DIO3, FZD9, HS3ST2, MOS, and MYOD1 that were significantly hypermethylated in B-cell, T-cell, and myeloid malignancies. These might therefore play an important role in the development of different HNs
    • …
    corecore