180 research outputs found

    The strategies WDK: a graphical search interface and web development kit for functional genomics databases

    Get PDF
    Web sites associated with the Eukaryotic Pathogen Bioinformatics Resource Center (EuPathDB.org) have recently introduced a graphical user interface, the Strategies WDK, intended to make advanced searching and set and interval operations easy and accessible to all users. With a design guided by usability studies, the system helps motivate researchers to perform dynamic computational experiments and explore relationships across data sets. For example, PlasmoDB users seeking novel therapeutic targets may wish to locate putative enzymes that distinguish pathogens from their hosts, and that are expressed during appropriate developmental stages. When a researcher runs one of the approximately 100 searches available on the site, the search is presented as a first step in a strategy. The strategy is extended by running additional searches, which are combined with set operators (union, intersect or minus), or genomic interval operators (overlap, contains). A graphical display uses Venn diagrams to make the strategy’s flow obvious. The interface facilitates interactive adjustment of the component searches with changes propagating forward through the strategy. Users may save their strategies, creating protocols that can be shared with colleagues. The strategy system has now been deployed on all EuPathDB databases, and successfully deployed by other projects. The Strategies WDK uses a configurable MVC architecture that is compatible with most genomics and biological warehouse databases, and is available for download at code.google.com/p/strategies-wdk

    Hydroxychloroquine is associated with a lower risk of polyautoimmunity: data from the RELESSER Registry

    Get PDF
    OBJECTIVES: This article estimates the frequency of polyautoimmunity and associated factors in a large retrospective cohort of patients with SLE. METHODS: RELESSER (Spanish Society of Rheumatology Lupus Registry) is a nationwide multicentre, hospital-based registry of SLE patients. This is a cross-sectional study. The main variable was polyautoimmunity, which was defined as the co-occurrence of SLE and another autoimmune disease, such as autoimmune thyroiditis, RA, scleroderma, inflammatory myopathy and MCTD. We also recorded the presence of multiple autoimmune syndrome, secondary SS, secondary APS and a family history of autoimmune disease. Multiple logistic regression analysis was performed to investigate possible risk factors for polyautoimmunity. RESULTS: Of the 3679 patients who fulfilled the criteria for SLE, 502 (13.6%) had polyautoimmunity. The most frequent types were autoimmune thyroiditis (7.9%), other systemic autoimmune diseases (6.2%), secondary SS (14.1%) and secondary APS (13.7%). Multiple autoimmune syndrome accounted for 10.2% of all cases of polyautoimmunity. A family history was recorded in 11.8%. According to the multivariate analysis, the factors associated with polyautoimmunity were female sex [odds ratio (95% CI), 1.72 (1.07, 2.72)], RP [1.63 (1.29, 2.05)], interstitial lung disease [3.35 (1.84, 6.01)], Jaccoud arthropathy [1.92 (1.40, 2.63)], anti-Ro/SSA and/or anti-La/SSB autoantibodies [2.03 (1.55, 2.67)], anti-RNP antibodies [1.48 (1.16, 1.90)], MTX [1.67 (1.26, 2.18)] and antimalarial drugs [0.50 (0.38, 0.67)]. CONCLUSION: Patients with SLE frequently present polyautoimmunity. We observed clinical and analytical characteristics associated with polyautoimmunity. Our finding that antimalarial drugs protected against polyautoimmunity should be verified in future studies

    A Genetic Screen for Attenuated Growth Identifies Genes Crucial for Intraerythrocytic Development of Plasmodium falciparum

    Get PDF
    A majority of the Plasmodium falciparum genome codes for genes with unknown functions, which presents a major challenge to understanding the parasite's biology. Large-scale functional analysis of the parasite genome is essential to pave the way for novel therapeutic intervention strategies against the disease and yet difficulties in genetic manipulation of this deadly human malaria parasite have been a major hindrance for functional analysis of its genome. Here, we used a forward functional genomic approach to study P. falciparum and identify genes important for optimal parasite development in the disease-causing, intraerythrocytic stages. We analyzed 123 piggyBac insertion mutants of P. falciparum for proliferation efficiency in the intraerythrocytic stages, in vitro. Almost 50% of the analyzed mutants showed significant reduction in proliferation efficiency, with 20% displaying severe defects. Functional categorization of genes in the severely attenuated mutants revealed significant enrichment for RNA binding proteins, suggesting the significance of post-transcriptional gene regulation in parasite development and emphasizing its importance as an antimalarial target. This study demonstrates the feasibility of much needed forward genetics approaches for P. falciparum to better characterize its genome and accelerate drug and vaccine development

    Ensembl Genomes: Extending Ensembl across the taxonomic space

    Get PDF
    Ensembl Genomes (http://www.ensemblgenomes.org) is a new portal offering integrated access to genome-scale data from non-vertebrate species of scientific interest, developed using the Ensembl genome annotation and visualisation platform. Ensembl Genomes consists of five sub-portals (for bacteria, protists, fungi, plants and invertebrate metazoa) designed to complement the availability of vertebrate genomes in Ensembl. Many of the databases supporting the portal have been built in close collaboration with the scientific community, which we consider as essential for maintaining the accuracy and usefulness of the resource. A common set of user interfaces (which include a graphical genome browser, FTP, BLAST search, a query optimised data warehouse, programmatic access, and a Perl API) is provided for all domains. Data types incorporated include annotation of (protein and non-protein coding) genes, cross references to external resources, and high throughput experimental data (e.g. data from large scale studies of gene expression and polymorphism visualised in their genomic context). Additionally, extensive comparative analysis has been performed, both within defined clades and across the wider taxonomy, and sequence alignments and gene trees resulting from this can be accessed through the site

    GiardiaDB and TrichDB : integrated genomic resources for the eukaryotic protist pathogens Giardia lamblia and Trichomonas vaginalis

    Get PDF
    © 2008 The Authors. This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License. The definitive version was published in Nucleic Acids Research 37 (2009): D526-D530, doi:10.1093/nar/gkn631.GiardiaDB (http://GiardiaDB.org) and TrichDB (http://TrichDB.org) house the genome databases for Giardia lamblia and Trichomonas vaginalis, respectively, and represent the latest additions to the EuPathDB (http://EuPathDB.org) family of functional genomic databases. GiardiaDB and TrichDB employ the same framework as other EuPathDB sites (CryptoDB, PlasmoDB and ToxoDB), supporting fully integrated and searchable databases. Genomic-scale data available via these resources may be queried based on BLAST searches, annotation keywords and gene ID searches, GO terms, sequence motifs and other protein characteristics. Functional queries may also be formulated, based on transcript and protein expression data from a variety of platforms. Phylogenetic relationships may also be interrogated. The ability to combine the results from independent queries, and to store queries and query results for future use facilitates complex, genome-wide mining of functional genomic data.Federal funds from the National Institute of Allergy and Infectious Diseases; Department of Health and Human Services, National Institutes of Health (HHSN266200400037C). Funding for open access charge: National Institutes of Health (HHSN266200400037C)

    A unified framework for managing provenance information in translational research

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>A critical aspect of the NIH <it>Translational Research </it>roadmap, which seeks to accelerate the delivery of "bench-side" discoveries to patient's "bedside," is the management of the <it>provenance </it>metadata that keeps track of the origin and history of data resources as they traverse the path from the bench to the bedside and back. A comprehensive provenance framework is essential for researchers to verify the quality of data, reproduce scientific results published in peer-reviewed literature, validate scientific process, and associate trust value with data and results. Traditional approaches to provenance management have focused on only partial sections of the translational research life cycle and they do not incorporate "domain semantics", which is essential to support domain-specific querying and analysis by scientists.</p> <p>Results</p> <p>We identify a common set of challenges in managing provenance information across the <it>pre-publication </it>and <it>post-publication </it>phases of data in the translational research lifecycle. We define the semantic provenance framework (SPF), underpinned by the Provenir upper-level provenance ontology, to address these challenges in the four stages of provenance metadata:</p> <p>(a) Provenance <b>collection </b>- during data generation</p> <p>(b) Provenance <b>representation </b>- to support interoperability, reasoning, and incorporate domain semantics</p> <p>(c) Provenance <b>storage </b>and <b>propagation </b>- to allow efficient storage and seamless propagation of provenance as the data is transferred across applications</p> <p>(d) Provenance <b>query </b>- to support queries with increasing complexity over large data size and also support knowledge discovery applications</p> <p>We apply the SPF to two exemplar translational research projects, namely the Semantic Problem Solving Environment for <it>Trypanosoma cruzi </it>(<it>T.cruzi </it>SPSE) and the Biomedical Knowledge Repository (BKR) project, to demonstrate its effectiveness.</p> <p>Conclusions</p> <p>The SPF provides a unified framework to effectively manage provenance of translational research data during pre and post-publication phases. This framework is underpinned by an upper-level provenance ontology called Provenir that is extended to create domain-specific provenance ontologies to facilitate provenance interoperability, seamless propagation of provenance, automated querying, and analysis.</p

    RSpred, a set of Hidden Markov Models to detect and classify the RIFIN and STEVOR proteins of Plasmodium falciparum

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Many parasites use multicopy protein families to avoid their host's immune system through a strategy called antigenic variation. RIFIN and STEVOR proteins are variable surface antigens uniquely found in the malaria parasites <it>Plasmodium falciparum </it>and <it>P. reichenowi</it>. Although these two protein families are different, they have more similarity to each other than to any other proteins described to date. As a result, they have been grouped together in one Pfam domain. However, a recent study has described the sub-division of the RIFIN protein family into several functionally distinct groups. These sub-groups require phylogenetic analysis to sort out, which is not practical for large-scale projects, such as the sequencing of patient isolates and meta-genomic analysis.</p> <p>Results</p> <p>We have manually curated the <it>rif </it>and <it>stevor </it>gene repertoires of two <it>Plasmodium falciparum </it>genomes, isolates DD2 and HB3. We have identified 25% of mis-annotated and ~30 missing <it>rif </it>and <it>stevor </it>genes. Using these data sets, as well as sequences from the well curated reference genome (isolate 3D7) and field isolate data from Uniprot, we have developed a tool named RSpred. The tool, based on a set of hidden Markov models and an evaluation program, automatically identifies STEVOR and RIFIN sequences as well as the sub-groups: A-RIFIN, B-RIFIN, B1-RIFIN and B2-RIFIN. In addition to these groups, we distinguish a small subset of STEVOR proteins that we named STEVOR-like, as they either differ remarkably from typical STEVOR proteins or are too fragmented to reach a high enough score. When compared to Pfam and TIGRFAMs, RSpred proves to be a more robust and more sensitive method. We have applied RSpred to the proteomes of several <it>P. falciparum </it>strains, <it>P. reichenowi, P. vivax</it>, <it>P. knowlesi </it>and the rodent malaria species. All groups were found in the <it>P. falciparum </it>strains, and also in the <it>P. reichenowi </it>parasite, whereas none were predicted in the other species.</p> <p>Conclusions</p> <p>We have generated a tool for the sorting of RIFIN and STEVOR proteins, large antigenic variant protein groups, into homogeneous sub-families. Assigning functions to such protein families requires their subdivision into meaningful groups such as we have shown for the RIFIN protein family. RSpred removes the need for complicated and time consuming phylogenetic analysis methods. It will benefit both research groups sequencing whole genomes as well as others working with field isolates. RSpred is freely accessible via <url>http://www.ifm.liu.se/bioinfo/</url>.</p

    A Semantic Problem Solving Environment for Integrative Parasite Research: Identification of Intervention Targets for Trypanosoma cruzi

    Get PDF
    Effective research in parasite biology requires analyzing experimental lab data in the context of constantly expanding public data resources. Integrating lab data with public resources is particularly difficult for biologists who may not possess significant computational skills to acquire and process heterogeneous data stored at different locations. Therefore, we develop a semantic problem solving environment (SPSE) that allows parasitologists to query their lab data integrated with public resources using ontologies. An ontology specifies a common vocabulary and formal relationships among the terms that describe an organism, and experimental data and processes in this case. SPSE supports capturing and querying provenance information, which is metadata on the experimental processes and data recorded for reproducibility, and includes a visual query-processing tool to formulate complex queries without learning the query language syntax. We demonstrate the significance of SPSE in identifying gene knockout targets for T. cruzi. The overall goal of SPSE is to help researchers discover new or existing knowledge that is implicitly present in the data but not always easily detected. Results demonstrate improved usefulness of SPSE over existing lab systems and approaches, and support for complex query design that is otherwise difficult to achieve without the knowledge of query language syntax

    clag9 Is Not Essential for PfEMP1 Surface Expression in Non-Cytoadherent Plasmodium falciparum Parasites with a Chromosome 9 Deletion

    Get PDF
    BACKGROUND: The expression of the clonally variant virulence factor PfEMP1 mediates the sequestration of Plasmodium falciparum infected erythrocytes in the host vasculature and contributes to chronic infection. Non-cytoadherent parasites with a chromosome 9 deletion lack clag9, a gene linked to cytoadhesion in previous studies. Here we present new clag9 data that challenge this view and show that surface the non-cytoadherence phenotype is linked to the expression of a non-functional PfEMP1. METHODOLOGY/PRINCIPAL FINDINGS: Loss of adhesion in P. falciparum D10, a parasite line with a large chromosome 9 deletion, was investigated. Surface iodination analysis of non-cytoadherent D10 parasites and COS-7 surface expression of the CD36-binding PfEMP1 CIDR1α domain were performed and showed that these parasites express an unusual trypsin-resistant, non-functional PfEMP1 at the erythrocyte surface. However, the CIDR1α domain of this var gene expressed in COS-7 cells showed strong binding to CD36. Atomic Force Microscopy showed a slightly modified D10 knob morphology compared to adherent parasites. Trafficking of PfEMP1 and KAHRP remained functional in D10. We link the non-cytoadherence phenotype to a chromosome 9 breakage and healing event resulting in the loss of 25 subtelomeric genes including clag9. In contrast to previous studies, knockout of the clag9 gene from 3D7 did not interfere with parasite adhesion to CD36. CONCLUSIONS/SIGNIFICANCE: Our data show the surface expression of non-functional PfEMP1 in D10 strongly indicating that genes other than clag9 deleted from chromosome 9 are involved in this virulence process possibly via post-translational modifications

    Genome analysis and comparative genomics of a Giardia intestinalis assemblage E isolate

    Get PDF
    <p>Abstract</p> <p>Background</p> <p><it>Giardia intestinalis </it>is a protozoan parasite that causes diarrhea in a wide range of mammalian species. To further understand the genetic diversity between the <it>Giardia intestinalis </it>species, we have performed genome sequencing and analysis of a wild-type <it>Giardia intestinalis </it>sample from the assemblage E group, isolated from a pig.</p> <p>Results</p> <p>We identified 5012 protein coding genes, the majority of which are conserved compared to the previously sequenced genomes of the WB and GS strains in terms of microsynteny and sequence identity. Despite this, there is an unexpectedly large number of chromosomal rearrangements and several smaller structural changes that are present in all chromosomes. Novel members of the VSP, NEK Kinase and HCMP gene families were identified, which may reveal possible mechanisms for host specificity and new avenues for antigenic variation. We used comparative genomics of the three diverse <it>Giardia intestinalis </it>isolates P15, GS and WB to define a core proteome for this species complex and to identify lineage-specific genes. Extensive analyses of polymorphisms in the core proteome of <it>Giardia </it>revealed differential rates of divergence among cellular processes.</p> <p>Conclusions</p> <p>Our results indicate that despite a well conserved core of genes there is significant genome variation between <it>Giardia </it>isolates, both in terms of gene content, gene polymorphisms, structural chromosomal variations and surface molecule repertoires. This study improves the annotation of the <it>Giardia </it>genomes and enables the identification of functionally important variation.</p
    corecore