38 research outputs found

    Non-B DB: a database of predicted non-B DNA-forming motifs in mammalian genomes

    Get PDF
    Although the capability of DNA to form a variety of non-canonical (non-B) structures has long been recognized, the overall significance of these alternate conformations in biology has only recently become accepted en masse. In order to provide access to genome-wide locations of these classes of predicted structures, we have developed non-B DB, a database integrating annotations and analysis of non-B DNA-forming sequence motifs. The database provides the most complete list of alternative DNA structure predictions available, including Z-DNA motifs, quadruplex-forming motifs, inverted repeats, mirror repeats and direct repeats and their associated subsets of cruciforms, triplex and slipped structures, respectively. The database also contains motifs predicted to form static DNA bends, short tandem repeats and homo(purine•pyrimidine) tracts that have been associated with disease. The database has been built using the latest releases of the human, chimp, dog, macaque and mouse genomes, so that the results can be compared directly with other data sources. In order to make the data interpretable in a genomic context, features such as genes, single-nucleotide polymorphisms and repetitive elements (SINE, LINE, etc.) have also been incorporated. The database is accessed through query pages that produce results with links to the UCSC browser and a GBrowse-based genomic viewer. It is freely accessible at http://nonb.abcc.ncifcrf.gov

    Immuno-transcriptomic profiling of extracranial pediatric solid malignancies.

    Full text link
    We perform an immunogenomics analysis utilizing whole-transcriptome sequencing of 657 pediatric extracranial solid cancer samples representing 14 diagnoses, and additionally utilize transcriptomes of 131 pediatric cancer cell lines and 147 normal tissue samples for comparison. We describe patterns of infiltrating immune cells, T cell receptor (TCR) clonal expansion, and translationally relevant immune checkpoints. We find that tumor-infiltrating lymphocytes and TCR counts vary widely across cancer types and within each diagnosis, and notably are significantly predictive of survival in osteosarcoma patients. We identify potential cancer-specific immunotherapeutic targets for adoptive cell therapies including cell-surface proteins, tumor germline antigens, and lineage-specific transcription factors. Using an orthogonal immunopeptidomics approach, we find several potential immunotherapeutic targets in osteosarcoma and Ewing sarcoma and validated PRAME as a bona fide multi-pediatric cancer target. Importantly, this work provides a critical framework for immune targeting of extracranial solid tumors using parallel immuno-transcriptomic and -peptidomic approaches

    pubmed2ensembl: A Resource for Mining the Biological Literature on Genes

    Get PDF
    The last two decades have witnessed a dramatic acceleration in the production of genomic sequence information and publication of biomedical articles. Despite the fact that genome sequence data and publications are two of the most heavily relied-upon sources of information for many biologists, very little effort has been made to systematically integrate data from genomic sequences directly with the biological literature. For a limited number of model organisms dedicated teams manually curate publications about genes; however for species with no such dedicated staff many thousands of articles are never mapped to genes or genomic regions.To overcome the lack of integration between genomic data and biological literature, we have developed pubmed2ensembl (http://www.pubmed2ensembl.org), an extension to the BioMart system that links over 2,000,000 articles in PubMed to nearly 150,000 genes in Ensembl from 50 species. We use several sources of curated (e.g., Entrez Gene) and automatically generated (e.g., gene names extracted through text-mining on MEDLINE records) sources of gene-publication links, allowing users to filter and combine different data sources to suit their individual needs for information extraction and biological discovery. In addition to extending the Ensembl BioMart database to include published information on genes, we also implemented a scripting language for automated BioMart construction and a novel BioMart interface that allows text-based queries to be performed against PubMed and PubMed Central documents in conjunction with constraints on genomic features. Finally, we illustrate the potential of pubmed2ensembl through typical use cases that involve integrated queries across the biomedical literature and genomic data.By allowing biologists to find the relevant literature on specific genomic regions or sets of functionally related genes more easily, pubmed2ensembl offers a much-needed genome informatics inspired solution to accessing the ever-increasing biomedical literature

    The Effect of Iron Limitation on the Transcriptome and Proteome of Pseudomonas fluorescens Pf-5

    Get PDF
    One of the most important micronutrients for bacterial growth is iron, whose bioavailability in soil is limited. Consequently, rhizospheric bacteria such as Pseudomonas fluorescens employ a range of mechanisms to acquire or compete for iron. We investigated the transcriptomic and proteomic effects of iron limitation on P. fluorescens Pf-5 by employing microarray and iTRAQ techniques, respectively. Analysis of this data revealed that genes encoding functions related to iron homeostasis, including pyoverdine and enantio-pyochelin biosynthesis, a number of TonB-dependent receptor systems, as well as some inner-membrane transporters, were significantly up-regulated in response to iron limitation. Transcription of a ribosomal protein L36-encoding gene was also highly up-regulated during iron limitation. Certain genes or proteins involved in biosynthesis of secondary metabolites such as 2,4-diacetylphloroglucinol (DAPG), orfamide A and pyrrolnitrin, as well as a chitinase, were over-expressed under iron-limited conditions. In contrast, we observed that expression of genes involved in hydrogen cyanide production and flagellar biosynthesis were down-regulated in an iron-depleted culture medium. Phenotypic tests revealed that Pf-5 had reduced swarming motility on semi-solid agar in response to iron limitation. Comparison of the transcriptomic data with the proteomic data suggested that iron acquisition is regulated at both the transcriptional and post-transcriptional levels

    Protein Co-Expression Analysis as a Strategy to Complement a Standard Quantitative Proteomics Approach:Case of a Glioblastoma Multiforme Study

    Get PDF
    Although correlation network studies from co-expression analysis are increasingly popular, they are rarely applied to proteomics datasets. Protein co-expression analysis provides a complementary view of underlying trends, which can be overlooked by conventional data analysis. The core of the present study is based on Weighted Gene Co-expression Network Analysis applied to a glioblastoma multiforme proteomic dataset. Using this method, we have identified three main modules which are associated with three different membrane associated groups; mitochondrial, endoplasmic reticulum, and a vesicle fraction. The three networks based on protein co-expression were assessed against a publicly available database (STRING) and show a statistically significant overlap. Each of the three main modules were de-clustered into smaller networks using different strategies based on the identification of highly connected networks, hierarchical clustering and enrichment of Gene Ontology functional terms. Most of the highly connected proteins found in the endoplasmic reticulum module were associated with redox activity while a core of the unfolded protein response was identified in addition to proteins involved in oxidative stress pathways. The proteins composing the electron transfer chain were found differently affected with proteins from mitochondrial Complex I being more down-regulated than proteins from Complex III. Finally, the two pyruvate kinases isoforms show major differences in their co-expressed protein networks suggesting roles in different cellular locations

    Metabolic Turnover of Synaptic Proteins: Kinetics, Interdependencies and Implications for Synaptic Maintenance

    Get PDF
    Chemical synapses contain multitudes of proteins, which in common with all proteins, have finite lifetimes and therefore need to be continuously replaced. Given the huge numbers of synaptic connections typical neurons form, the demand to maintain the protein contents of these connections might be expected to place considerable metabolic demands on each neuron. Moreover, synaptic proteostasis might differ according to distance from global protein synthesis sites, the availability of distributed protein synthesis facilities, trafficking rates and synaptic protein dynamics. To date, the turnover kinetics of synaptic proteins have not been studied or analyzed systematically, and thus metabolic demands or the aforementioned relationships remain largely unknown. In the current study we used dynamic Stable Isotope Labeling with Amino acids in Cell culture (SILAC), mass spectrometry (MS), Fluorescent Non-Canonical Amino acid Tagging (FUNCAT), quantitative immunohistochemistry and bioinformatics to systematically measure the metabolic half-lives of hundreds of synaptic proteins, examine how these depend on their pre/postsynaptic affiliation or their association with particular molecular complexes, and assess the metabolic load of synaptic proteostasis. We found that nearly all synaptic proteins identified here exhibited half-lifetimes in the range of 2-5 days. Unexpectedly, metabolic turnover rates were not significantly different for presynaptic and postsynaptic proteins, or for proteins for which mRNAs are consistently found in dendrites. Some functionally or structurally related proteins exhibited very similar turnover rates, indicating that their biogenesis and degradation might be coupled, a possibility further supported by bioinformatics-based analyses. The relatively low turnover rates measured here (∼0.7% of synaptic protein content per hour) are in good agreement with imaging-based studies of synaptic protein trafficking, yet indicate that the metabolic load synaptic protein turnover places on individual neurons is very substantial
    corecore