96 research outputs found

    Genome-wide polyadenylation site mapping datasets in the rice blast fungus Magnaporthe oryzae

    Get PDF
    Polyadenylation plays an important role in gene regulation, thus affecting a wide variety of biological processes. In the rice blast fungus Magnaporthe oryzae the cleavage factor I protein Rpb35 is required for pre-mRNA polyadenylation and fungal virulence. Here we present the bioinformatic approach and output data related to a global survey of polyadenylation site usage in M. oryzae wild-type and Delta rbp35 strains under a variety of nutrient conditions, some of which simulate the conditions experienced by the fungus during part of its infection cycle

    Bio::Homology::InterologWalk - A Perl module to build putative protein-protein interaction networks through interolog mapping

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Protein-protein interaction (PPI) data are widely used to generate network models that aim to describe the relationships between proteins in biological systems. The fidelity and completeness of such networks is primarily limited by the paucity of protein interaction information and by the restriction of most of these data to just a few widely studied experimental organisms. In order to extend the utility of existing PPIs, computational methods can be used that exploit functional conservation between orthologous proteins across taxa to predict putative PPIs or 'interologs'. To date most interolog prediction efforts have been restricted to specific biological domains with fixed underlying data sources and there are no software tools available that provide a generalised framework for 'on-the-fly' interolog prediction.</p> <p>Results</p> <p>We introduce <monospace>Bio::Homology::InterologWalk</monospace>, a Perl module to retrieve, prioritise and visualise putative protein-protein interactions through an orthology-walk method. The module uses orthology and experimental interaction data to generate putative PPIs and optionally collates meta-data into an Interaction Prioritisation Index that can be used to help prioritise interologs for further analysis. We show the application of our interolog prediction method to the genomic interactome of the fruit fly, <it>Drosophila melanogaster</it>. We analyse the resulting interaction networks and show that the method proposes new interactome members and interactions that are candidates for future experimental investigation.</p> <p>Conclusions</p> <p>Our interolog prediction tool employs the Ensembl Perl API and PSICQUIC enabled protein interaction data sources to generate up to date interologs 'on-the-fly'. This represents a significant advance on previous methods for interolog prediction as it allows the use of the latest orthology and protein interaction data for all of the genomes in Ensembl. The module outputs simple text files, making it easy to customise the results by post-processing, allowing the putative PPI datasets to be easily integrated into existing analysis workflows. The <monospace>Bio::Homology::InterologWalk</monospace> module, sample scripts and full documentation are freely available from the Comprehensive Perl Archive Network (CPAN) under the GNU Public license.</p

    Selective IgA Deficiency

    Get PDF
    Immunoglobulin A (IgA) deficiency is the most common primary immunodeficiency defined as decreased serum level of IgA in the presence of normal levels of other immunoglobulin isotypes. Most individuals with IgA deficiency are asymptomatic and identified coincidentally. However, some patients may present with recurrent infections of the respiratory and gastrointestinal tracts, allergic disorders, and autoimmune manifestations. Although IgA is the most abundant antibody isotype produced in the body, its functions are not clearly understood. Subclass IgA1 in monomeric form is mainly found in the blood circulation, whereas subclass IgA2 in dimeric form is the dominant immunoglobulin in mucosal secretions. Secretory IgA appears to have prime importance in immune exclusion of pathogenic microorganisms and maintenance of intestinal homeostasis. Despite this critical role, there may be some compensatory mechanisms that would prevent disease manifestations in some IgA-deficient individuals. In IgA deficiency, a maturation defect in B cells to produce IgA is commonly observed. Alterations in transmembrane activator and calcium modulator and cyclophilin ligand interactor gene appear to act as disease-modifying mutations in both IgA deficiency and common variable immunodeficiency, two diseases which probably lie in the same spectrum. Certain major histocompatibility complex haplotypes have been associated with susceptibility to IgA deficiency. The genetic basis of IgA deficiency remains to be clarified. Better understanding of the production and function of IgA is essential in elucidating the disease mechanism in IgA deficiency

    High-Throughput Proteomics Detection of Novel Splice Isoforms in Human Platelets

    Get PDF
    Alternative splicing (AS) is an intrinsic regulatory mechanism of all metazoans. Recent findings suggest that 100% of multiexonic human genes give rise to splice isoforms. AS can be specific to tissue type, environment or developmentally regulated. Splice variants have also been implicated in various diseases including cancer. Detection of these variants will enhance our understanding of the complexity of the human genome and provide disease-specific and prognostic biomarkers. We adopted a proteomics approach to identify exon skip events - the most common form of AS. We constructed a database harboring the peptide sequences derived from all hypothetical exon skip junctions in the human genome. Searching tandem mass spectrometry (MS/MS) data against the database allows the detection of exon skip events, directly at the protein level. Here we describe the application of this approach to human platelets, including the mRNA-based verification of novel splice isoforms of ITGA2, NPEPPS and FH. This methodology is applicable to all new or existing MS/MS datasets

    Identification of a Novel Class of Farnesylation Targets by Structure-Based Modeling of Binding Specificity

    Get PDF
    Farnesylation is an important post-translational modification catalyzed by farnesyltransferase (FTase). Until recently it was believed that a C-terminal CaaX motif is required for farnesylation, but recent experiments have revealed larger substrate diversity. In this study, we propose a general structural modeling scheme to account for peptide binding specificity and recapitulate the experimentally derived selectivity profile of FTase in vitro. In addition to highly accurate recovery of known FTase targets, we also identify a range of novel potential targets in the human genome, including a new substrate class with an acidic C-terminal residue (CxxD/E). In vitro experiments verified farnesylation of 26/29 tested peptides, including both novel human targets, as well as peptides predicted to tightly bind FTase. This study extends the putative range of biological farnesylation substrates. Moreover, it suggests that the ability of a peptide to bind FTase is a main determinant for the farnesylation reaction. Finally, simple adaptation of our approach can contribute to more accurate and complete elucidation of peptide-mediated interactions and modifications in the cell

    Towards a comprehensive structural coverage of completed genomes: a structural genomics viewpoint

    Get PDF
    BACKGROUND: Structural genomics initiatives were established with the aim of solving protein structures on a large-scale. For many initiatives, such as the Protein Structure Initiative (PSI), the primary aim of target selection is focussed towards structurally characterising protein families which, so far, lack a structural representative. It is therefore of considerable interest to gain insights into the number and distribution of these families, and what efforts may be required to achieve a comprehensive structural coverage across all protein families. RESULTS: In this analysis we have derived a comprehensive domain annotation of the genomes using CATH, Pfam-A and Newfam domain families. We consider what proportions of structurally uncharacterised families are accessible to high-throughput structural genomics pipelines, specifically those targeting families containing multiple prokaryotic orthologues. In measuring the domain coverage of the genomes, we show the benefits of selecting targets from both structurally uncharacterised domain families, whilst in addition, pursuing additional targets from large structurally characterised protein superfamilies. CONCLUSION: This work suggests that such a combined approach to target selection is essential if structural genomics is to achieve a comprehensive structural coverage of the genomes, leading to greater insights into structure and the mechanisms that underlie protein evolution

    Plasma Proteome Profiles Associated with Inflammation, Angiogenesis, and Cancer

    Get PDF
    Tumor development is accompanied by a complex host systemic response, which includes inflammatory and angiogenic reactions. Both tumor-derived and systemic response proteins are detected in plasma from cancer patients. However, given their non-specific nature, systemic response proteins can confound the detection or diagnosis of neoplasia. Here, we have applied an in-depth quantitative proteomic approach to analyze plasma protein changes in mouse models of subacute irritant-driven inflammation, autoreactive inflammation, and matrix associated angiogenesis and compared results to previously described findings from mouse models of polyoma middle T-driven breast cancer and Pdx1-Cre KrasG12D Ink4a/Arf lox/lox -induced pancreatic cancer. Among the confounding models, approximately 1/3 of all quantified plasma proteins exhibited a significant change in abundance compared to control mice. Of the proteins that changed in abundance, the majority were unique to each model. Altered proteins included those involved in acute phase response, inflammation, extracellular matrix remodeling, angiogenesis, and TGFβ signaling. Comparison of changes in plasma proteins between the confounder models and the two cancer models revealed proteins that were restricted to the cancer-bearing mice, reflecting the known biology of these tumors. This approach provides a basis for distinguishing between protein changes in plasma that are cancer-related and those that are part of a non-specific host response

    Network-Guided Analysis of Genes with Altered Somatic Copy Number and Gene Expression Reveals Pathways Commonly Perturbed in Metastatic Melanoma

    Get PDF
    Cancer genomes frequently contain somatic copy number alterations (SCNA) that can significantly perturb the expression level of affected genes and thus disrupt pathways controlling normal growth. In melanoma, many studies have focussed on the copy number and gene expression levels of the BRAF, PTEN and MITF genes, but little has been done to identify new genes using these parameters at the genome-wide scale. Using karyotyping, SNP and CGH arrays, and RNA-seq, we have identified SCNA affecting gene expression (‘SCNA-genes’) in seven human metastatic melanoma cell lines. We showed that the combination of these techniques is useful to identify candidate genes potentially involved in tumorigenesis. Since few of these alterations were recurrent across our samples, we used a protein network-guided approach to determine whether any pathways were enriched in SCNA-genes in one or more samples. From this unbiased genome-wide analysis, we identified 28 significantly enriched pathway modules. Comparison with two large, independent melanoma SCNA datasets showed less than 10% overlap at the individual gene level, but network-guided analysis revealed 66% shared pathways, including all but three of the pathways identified in our data. Frequently altered pathways included WNT, cadherin signalling, angiogenesis and melanogenesis. Additionally, our results emphasize the potential of the EPHA3 and FRS2 gene products, involved in angiogenesis and migration, as possible therapeutic targets in melanoma. Our study demonstrates the utility of network-guided approaches, for both large and small datasets, to identify pathways recurrently perturbed in cancer

    Bioinformatics and molecular modeling in glycobiology

    Get PDF
    The field of glycobiology is concerned with the study of the structure, properties, and biological functions of the family of biomolecules called carbohydrates. Bioinformatics for glycobiology is a particularly challenging field, because carbohydrates exhibit a high structural diversity and their chains are often branched. Significant improvements in experimental analytical methods over recent years have led to a tremendous increase in the amount of carbohydrate structure data generated. Consequently, the availability of databases and tools to store, retrieve and analyze these data in an efficient way is of fundamental importance to progress in glycobiology. In this review, the various graphical representations and sequence formats of carbohydrates are introduced, and an overview of newly developed databases, the latest developments in sequence alignment and data mining, and tools to support experimental glycan analysis are presented. Finally, the field of structural glycoinformatics and molecular modeling of carbohydrates, glycoproteins, and protein–carbohydrate interaction are reviewed
    corecore