30 research outputs found

    botXminer: mining biomedical literature with a new web-based application

    Get PDF
    This paper outlines botXminer, a publicly available application to search XML-formatted MEDLINE(ยฎ) data in a complete, object-relational schema implemented in Oracle(ยฎ) XML DB. An advantage offered by botXminer is that it can generate quantitative results with certain queries that are not feasible through the Entrez-PubMed(ยฎ) interface. After retrieving citations associated with user-supplied search terms, MEDLINE fields (title, abstract, journal, MeSH(ยฎ) and chemical) and terms (MeSH qualifiers and descriptors, keywords, author, gene symbol and chemical), these citations are grouped and displayed as tabulated or graphic results. This work represents an extension of previous research for integrating these citations with relational systems. botXminer has a user-friendly, intuitive interface that can be freely accessed at

    Seeking unique and common biological themes in multiple gene lists or datasets: pathway pattern extraction pipeline for pathway-level comparative analysis

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>One of the challenges in the analysis of microarray data is to integrate and compare the selected (e.g., differential) gene lists from multiple experiments for common or unique underlying biological themes. A common way to approach this problem is to extract common genes from these gene lists and then subject these genes to enrichment analysis to reveal the underlying biology. However, the capacity of this approach is largely restricted by the limited number of common genes shared by datasets from multiple experiments, which could be caused by the complexity of the biological system itself.</p> <p>Results</p> <p>We now introduce a new Pathway Pattern Extraction Pipeline (PPEP), which extends the existing WPS application by providing a new pathway-level comparative analysis scheme. To facilitate comparing and correlating results from different studies and sources, PPEP contains new interfaces that allow evaluation of the pathway-level enrichment patterns across multiple gene lists. As an exploratory tool, this analysis pipeline may help reveal the underlying biological themes at both the pathway and gene levels. The analysis scheme provided by PPEP begins with multiple gene lists, which may be derived from different studies in terms of the biological contexts, applied technologies, or methodologies. These lists are then subjected to pathway-level comparative analysis for extraction of pathway-level patterns. This analysis pipeline helps to explore the commonality or uniqueness of these lists at the level of pathways or biological processes from different but relevant biological systems using a combination of statistical enrichment measurements, pathway-level pattern extraction, and graphical display of the relationships of genes and their associated pathways as Gene-Term Association Networks (GTANs) within the WPS platform. As a proof of concept, we have used the new method to analyze many datasets from our collaborators as well as some public microarray datasets.</p> <p>Conclusion</p> <p>This tool provides a new pathway-level analysis scheme for integrative and comparative analysis of data derived from different but relevant systems. The tool is freely available as a Pathway Pattern Extraction Pipeline implemented in our existing software package WPS, which can be obtained at <url>http://www.abcc.ncifcrf.gov/wps/wps_index.php</url></p

    The somatic autosomal mutation matrix in cancer genomes

    Get PDF
    DNA damage in somatic cells originates from both environmental and endogenous sources, giving rise to mutations through multiple mechanisms. When these mutations affect the function of critical genes, cancer may ensue. Although identifying genomic subsets of mutated genes may inform therapeutic options, a systematic survey of tumor mutational spectra is required to improve our understanding of the underlying mechanisms of mutagenesis involved in cancer etiology. Recent studies have presented genome-wide sets of somatic mutations as a 96-element vector, a procedure that only captures the immediate neighbors of the mutated nucleotide. Herein, we present a 32 ร— 12 mutation matrix that captures the nucleotide pattern two nucleotides upstream and downstream of the mutation. A somatic autosomal mutation matrix (SAMM) was constructed from tumor-specific mutations derived from each of 909 individual cancer genomes harboring a total of 10,681,843 single-base substitutions. In addition, mechanistic template mutation matrices (MTMMs) representing oxidative DNA damage, ultraviolet-induced DNA damage, 5mCpG deamination, and APOBEC-mediated cytosine mutation, are presented. MTMMs were mapped to the individual tumor SAMMs to determine the maximum contribution of each mutational mechanism to the overall mutation pattern. A Manhattan distance across all SAMM elements between any two tumor genomes was used to determine their relative distance. Employing this metric, 89.5 % of all tumor genomes were found to have a nearest neighbor from the same tissue of origin. When a distance-dependent 6-nearest neighbor classifier was used, 86.9 % of all SAMMs were assigned to the correct tissue of origin. Thus, although tumors from different tissues may have similar mutation patterns, their SAMMs often display signatures that are characteristic of specific tissues

    Guanine Holes Are Prominent Targets for Mutation in Cancer and Inherited Disease

    Get PDF
    Albino Bacolla, Guliang Wang, Aklank Jain, Karen M. Vasquez, Division of Pharmacology and Toxicology, The University of Texas at Austin, Dell Pediatric Research Institute, Austin, Texas, United States of AmericaAlbino Bacolla, Nuri A. Temiz, Ming Yi, Joseph Ivanic, Regina Z. Cer, Duncan E. Donohue, Uma S. Mudunuri, Natalia Volfovsky, Brian T. Luke, Robert M., Stephens, Jack R. Collins, Advanced Biomedical Computing Center, SAIC-Frederick, Inc., Frederick National Laboratory for Cancer Research, Frederick, Maryland, United States of AmericaEdward V. Ball, David N. Cooper, Institute of Medical Genetics, School of Medicine, Cardiff University, Cardiff, United KingdomSingle base substitutions constitute the most frequent type of human gene mutation and are a leading cause of cancer and inherited disease. These alterations occur non-randomly in DNA, being strongly influenced by the local nucleotide sequence context. However, the molecular mechanisms underlying such sequence context-dependent mutagenesis are not fully understood. Using bioinformatics, computational and molecular modeling analyses, we have determined the frequencies of mutation at Gโ€ขC bp in the context of all 64 5โ€ฒ-NGNN-3โ€ฒ motifs that contain the mutation at the second position. Twenty-four datasets were employed, comprising >530,000 somatic single base substitutions from 21 cancer genomes, >77,000 germline single-base substitutions causing or associated with human inherited disease and 16.7 million benign germline single-nucleotide variants. In several cancer types, the number of mutated motifs correlated both with the free energies of base stacking and the energies required for abstracting an electron from the target guanines (ionization potentials). Similar correlations were also evident for the pathological missense and nonsense germline mutations, but only when the target guanines were located on the non-transcribed DNA strand. Likewise, pathogenic splicing mutations predominantly affected positions in which a purine was located on the non-transcribed DNA strand. Novel candidate driver mutations and tissue-specific mutational patterns were also identified in the cancer datasets. We conclude that electron transfer reactions within the DNA molecule contribute to sequence context-dependent mutagenesis, involving both somatic driver and passenger mutations in cancer, as well as germline alterations causing or associated with inherited disease.This work was supported by grants from the NIH (CA097175 and CA093729) to KMV, NCI/NIH contract HHSN261200800001E to AB and the Frederick National Laboratory for Cancer Research, and CBIIT/caBIG ISRCE yellow task #09-260 to the Frederick National Laboratory for Cancer Research. DNC and EVB received financial support from BIOBASE GmbH through a license agreement (for HGMD) with Cardiff University. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.PharmacyEmail: [email protected]

    The Zinc-Dependent Protease Activity of the Botulinum Neurotoxins

    Get PDF
    The botulinum neurotoxins (BoNT, serotypes A-G) are some of the most toxic proteins known and are the causative agents of botulism. Following exposure, the neurotoxin binds and enters peripheral cholinergic nerve endings and specifically and selectively cleaves one or more SNARE proteins to produce flaccid paralysis. This review centers on the kinetics of the Zn-dependent proteolytic activities of these neurotoxins, and briefly describes inhibitors, activators and factors underlying persistence of toxin action. Some of the structural, enzymatic and inhibitor data that are discussed here are available at the botulinum neurotoxin resource, BotDB (http://botdb.abcc.ncifcrf.gov)

    Immuno-transcriptomic profiling of extracranial pediatric solid malignancies.

    Get PDF
    We perform an immunogenomics analysis utilizing whole-transcriptome sequencing of 657 pediatric extracranial solid cancer samples representing 14 diagnoses, and additionally utilize transcriptomes of 131 pediatric cancer cell lines and 147 normal tissue samples for comparison. We describe patterns of infiltrating immune cells, Tย cell receptor (TCR) clonal expansion, and translationally relevant immune checkpoints. We find that tumor-infiltrating lymphocytes and TCR counts vary widely across cancer types and within each diagnosis, and notably are significantly predictive of survival in osteosarcoma patients. We identify potential cancer-specific immunotherapeutic targets for adoptive cell therapies including cell-surface proteins, tumor germline antigens, and lineage-specific transcription factors. Using an orthogonal immunopeptidomics approach, we find several potential immunotherapeutic targets in osteosarcoma and Ewing sarcoma and validated PRAME as a bona fide multi-pediatric cancer target. Importantly, this work provides a critical framework for immune targeting of extracranial solid tumors using parallel immuno-transcriptomic and -peptidomic approaches

    PanoromiX: a time-course network medicine platform integrating molecular assays and pathophenotypic data

    No full text
    Abstract Background Network medicine aims to map molecular perturbations of any given diseases onto complex networks with functional interdependencies that underlie a pathological phenotype. Furthermore, investigating the time dimension of disease progression from a network perspective is key to gaining key insights to the disease process and to identify diagnostic or therapeutic targets. Existing platforms are ineffective to modularize the large complex systems into subgroups and consolidate heterogeneous data to web-based interactive animation. Results We have developed PanoromiX platform, a data-agnostic dynamic interactive visualization web application, enables the visualization of outputs from genome based molecular assays onto modular and interactive networks that are correlated with any pathophenotypic data (MRI, Xray, behavioral, etc.) over a time course all in one pane. As a result, PanoromiX reveals the complex organizing principles that orchestrate a disease-pathology from a gene regulatory network (nodes, edges, hubs, etc.) perspective instead of snap shots of assays. Without extensive programming experience, users can design, share, and interpret their dynamic networks through the PanoromiX platform with rich built-in functionalities. Conclusions This emergent tool of network medicine is the first to visualize the interconnectedness of tailored genome assays to pathological networks and phenotypes for cells or organisms in a data-agnostic manner. As an advanced network medicine tool, PanoromiX allows monitoring of panel of biomarker perturbations over the progression of diseases, disease classification based on changing network modules that corresponds to specific patho-phenotype as opposed to clinical symptoms, systematic exploration of complex molecular interactions and distinct disease states via regulatory network changes, and the discovery of novel diagnostic and therapeutic targets
    corecore