431 research outputs found

    A large-scale proteogenomics study of apicomplexan pathogens-Toxoplasma gondii and Neospora caninum

    Get PDF
    Proteomics data can supplement genome annotation efforts, for example being used to confirm gene models or correct gene annotation errors. Here, we present a largeā€scale proteogenomics study of two important apicomplexan pathogens: Toxoplasma gondii and Neospora caninum. We queried proteomics data against a panel of official and alternate gene models generated directly from RNASeq data, using several newly generated and some previously published MS datasets for this metaā€analysis. We identified a total of 201 996 and 39 953 peptideā€spectrum matches for T. gondii and N. caninum, respectively, at a 1% peptide FDR threshold. This equated to the identification of 30 494 distinct peptide sequences and 2921 proteins (matches to official gene models) for T. gondii, and 8911 peptides/1273 proteins for N. caninum following stringent proteinā€level thresholding. We have also identified 289 and 140 loci for T. gondii and N. caninum, respectively, which mapped to RNAā€Seqā€derived gene models used in our analysis and apparently absent from the official annotation (release 10 from EuPathDB) of these species. We present several examples in our study where the RNAā€Seq evidence can help in correction of the current gene model and can help in discovery of potential new genes

    K2/Kleisli and GUS: Experiments in Integrated Access to Genomic Data Sources

    Get PDF
    The integration of heterogeneous data sources and software systems is a major issue in the biomed ical community and several approaches have been explored: linking databases, on-the- fly integration through views, and integration through warehousing. In this paper we report on our experiences with two systems that were developed at the University of Pennsylvania: an integration system called K2, which has primarily been used to provide views over multiple external data sources and software systems; and a data warehouse called GUS which downloads, cleans, integrates and annotates data from multiple external data sources. Although the view and warehouse approaches each have their advantages, there is no clear winner . Therefore, users must consider how the data is to be used, what the performance guarantees must be, and how much programmer time and expertise is available to choose the best strategy for a particular application

    A Path Algorithm for Constrained Estimation

    Full text link
    Many least squares problems involve affine equality and inequality constraints. Although there are variety of methods for solving such problems, most statisticians find constrained estimation challenging. The current paper proposes a new path following algorithm for quadratic programming based on exact penalization. Similar penalties arise in l1l_1 regularization in model selection. Classical penalty methods solve a sequence of unconstrained problems that put greater and greater stress on meeting the constraints. In the limit as the penalty constant tends to āˆž\infty, one recovers the constrained solution. In the exact penalty method, squared penalties are replaced by absolute value penalties, and the solution is recovered for a finite value of the penalty constant. The exact path following method starts at the unconstrained solution and follows the solution path as the penalty constant increases. In the process, the solution path hits, slides along, and exits from the various constraints. Path following in lasso penalized regression, in contrast, starts with a large value of the penalty constant and works its way downward. In both settings, inspection of the entire solution path is revealing. Just as with the lasso and generalized lasso, it is possible to plot the effective degrees of freedom along the solution path. For a strictly convex quadratic program, the exact penalty algorithm can be framed entirely in terms of the sweep operator of regression analysis. A few well chosen examples illustrate the mechanics and potential of path following.Comment: 26 pages, 5 figure

    Lysosomes in iron metabolism, ageing and apoptosis

    Get PDF
    The lysosomal compartment is essential for a variety of cellular functions, including the normal turnover of most long-lived proteins and all organelles. The compartment consists of numerous acidic vesicles (pH āˆ¼4 to 5) that constantly fuse and divide. It receives a large number of hydrolases (āˆ¼50) from the trans-Golgi network, and substrates from both the cellsā€™ outside (heterophagy) and inside (autophagy). Many macromolecules contain iron that gives rise to an iron-rich environment in lysosomes that recently have degraded such macromolecules. Iron-rich lysosomes are sensitive to oxidative stress, while ā€˜restingā€™ lysosomes, which have not recently participated in autophagic events, are not. The magnitude of oxidative stress determines the degree of lysosomal destabilization and, consequently, whether arrested growth, reparative autophagy, apoptosis, or necrosis will follow. Heterophagy is the first step in the process by which immunocompetent cells modify antigens and produce antibodies, while exocytosis of lysosomal enzymes may promote tumor invasion, angiogenesis, and metastasis. Apart from being an essential turnover process, autophagy is also a mechanism by which cells will be able to sustain temporary starvation and rid themselves of intracellular organisms that have invaded, although some pathogens have evolved mechanisms to prevent their destruction. Mutated lysosomal enzymes are the underlying cause of a number of lysosomal storage diseases involving the accumulation of materials that would be the substrate for the corresponding hydrolases, were they not defective. The normal, low-level diffusion of hydrogen peroxide into iron-rich lysosomes causes the slow formation of lipofuscin in long-lived postmitotic cells, where it occupies a substantial part of the lysosomal compartment at the end of the life span. This seems to result in the diversion of newly produced lysosomal enzymes away from autophagosomes, leading to the accumulation of malfunctioning mitochondria and proteins with consequent cellular dysfunction. If autophagy were a perfect turnover process, postmitotic ageing and several age-related neurodegenerative diseases would, perhaps, not take place

    Ligand-Receptor Interactions

    Full text link
    The formation and dissociation of specific noncovalent interactions between a variety of macromolecules play a crucial role in the function of biological systems. During the last few years, three main lines of research led to a dramatic improvement of our understanding of these important phenomena. First, combination of genetic engineering and X ray cristallography made available a simultaneous knowledg of the precise structure and affinity of series or related ligand-receptor systems differing by a few well-defined atoms. Second, improvement of computer power and simulation techniques allowed extended exploration of the interaction of realistic macromolecules. Third, simultaneous development of a variety of techniques based on atomic force microscopy, hydrodynamic flow, biomembrane probes, optical tweezers, magnetic fields or flexible transducers yielded direct experimental information of the behavior of single ligand receptor bonds. At the same time, investigation of well defined cellular models raised the interest of biologists to the kinetic and mechanical properties of cell membrane receptors. The aim of this review is to give a description of these advances that benefitted from a largely multidisciplinar approach

    The strategies WDK: a graphical search interface and web development kit for functional genomics databases

    Get PDF
    Web sites associated with the Eukaryotic Pathogen Bioinformatics Resource Center (EuPathDB.org) have recently introduced a graphical user interface, the Strategies WDK, intended to make advanced searching and set and interval operations easy and accessible to all users. With a design guided by usability studies, the system helps motivate researchers to perform dynamic computational experiments and explore relationships across data sets. For example, PlasmoDB users seeking novel therapeutic targets may wish to locate putative enzymes that distinguish pathogens from their hosts, and that are expressed during appropriate developmental stages. When a researcher runs one of the approximately 100 searches available on the site, the search is presented as a first step in a strategy. The strategy is extended by running additional searches, which are combined with set operators (union, intersect or minus), or genomic interval operators (overlap, contains). A graphical display uses Venn diagrams to make the strategyā€™s flow obvious. The interface facilitates interactive adjustment of the component searches with changes propagating forward through the strategy. Users may save their strategies, creating protocols that can be shared with colleagues. The strategy system has now been deployed on all EuPathDB databases, and successfully deployed by other projects. The Strategies WDK uses a configurable MVC architecture that is compatible with most genomics and biological warehouse databases, and is available for download at code.google.com/p/strategies-wdk

    ToxoDB: an integrated Toxoplasma gondii database resource

    Get PDF
    ToxoDB (http://ToxoDB.org) is a genome and functional genomic database for the protozoan parasite Toxoplasma gondii. It incorporates the sequence and annotation of the T. gondii ME49 strain, as well as genome sequences for the GT1, VEG and RH (Chr Ia, Chr Ib) strains. Sequence information is integrated with various other genomic-scale data, including community annotation, ESTs, gene expression and proteomics data. ToxoDB has matured significantly since its initial release. Here we outline the numerous updates with respect to the data and increased functionality available on the website

    FungiDB: an integrated functional genomics database for fungi

    Get PDF
    FungiDB (http://FungiDB.org) is a functional genomic resource for pan-fungal genomes that was developed in partnership with the Eukaryotic Pathogen Bioinformatic resource center (http://EuPathDB.org). FungiDB uses the same infrastructure and user interface as EuPathDB, which allows for sophisticated and integrated searches to be performed using an intuitive graphical system. The current release of FungiDB contains genome sequence and annotation from 18 species spanning several fungal classes, including the Ascomycota classes, Eurotiomycetes, Sordariomycetes, Saccharomycetes and the Basidiomycota orders, Pucciniomycetes and Tremellomycetes, and the basal ā€˜Zygomyceteā€™ lineage Mucormycotina. Additionally, FungiDB contains cell cycle microarray data, hyphal growth RNA-sequence data and yeast two hybrid interaction data. The underlying genomic sequence and annotation combined with functional data, additional data from the FungiDB standard analysis pipeline and the ability to leverage orthology provides a powerful resource for in silico experimentation

    GeneDB--an annotation database for pathogens.

    Get PDF
    GeneDB (http://www.genedb.org) is a genome database for prokaryotic and eukaryotic pathogens and closely related organisms. The resource provides a portal to genome sequence and annotation data, which is primarily generated by the Pathogen Genomics group at the Wellcome Trust Sanger Institute. It combines data from completed and ongoing genome projects with curated annotation, which is readily accessible from a web based resource. The development of the database in recent years has focused on providing database-driven annotation tools and pipelines, as well as catering for increasingly frequent assembly updates. The website has been significantly redesigned to take advantage of current web technologies, and improve usability. The current release stores 41 data sets, of which 17 are manually curated and maintained by biologists, who review and incorporate data from the scientific literature, as well as other sources. GeneDB is primarily a production and annotation database for the genomes of predominantly pathogenic organisms
    • ā€¦
    corecore