56 research outputs found

    EST-PAC a web package for EST annotation and protein sequence prediction

    Get PDF
    With the decreasing cost of DNA sequencing technology and the vast diversity of biological resources, researchers increasingly face the basic challenge of annotating a larger number of expressed sequences tags (EST) from a variety of species. This typically consists of a series of repetitive tasks, which should be automated and easy to use. The results of these annotation tasks need to be stored and organized in a consistent way. All these operations should be self-installing, platform independent, easy to customize and amenable to using distributed bioinformatics resources available on the Internet. In order to address these issues, we present EST-PAC a web oriented multi-platform software package for expressed sequences tag (EST) annotation. EST-PAC provides a solution for the administration of EST and protein sequence annotations accessible through a web interface. Three aspects of EST annotation are automated: 1) searching local or remote biological databases for sequence similarities using Blast services, 2) predicting protein coding sequence from EST data and, 3) annotating predicted protein sequences with functional domain predictions. In practice, EST-PAC integrates the BLASTALL suite, EST-Scan2 and HMMER in a relational database system accessible through a simple web interface. EST-PAC also takes advantage of the relational database to allow consistent storage, powerful queries of results and, management of the annotation process. The system allows users to customize annotation strategies and provides an open-source data-management environment for research and education in bioinformatics

    annot8r: GO, EC and KEGG annotation of EST datasets

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The expressed sequence tag (EST) methodology is an attractive option for the generation of sequence data for species for which no completely sequenced genome is available. The annotation and comparative analysis of such datasets poses a formidable challenge for research groups that do not have the bioinformatics infrastructure of major genome sequencing centres. Therefore, there is a need for user-friendly tools to facilitate the annotation of non-model species EST datasets with well-defined ontologies that enable meaningful cross-species comparisons. To address this, we have developed annot8r, a platform for the rapid annotation of EST datasets with GO-terms, EC-numbers and KEGG-pathways.</p> <p>Results</p> <p>annot8r automatically downloads all files relevant for the annotation process and generates a reference database that stores UniProt entries, their associated Gene Ontology (GO), Enzyme Commission (EC) and Kyoto Encyclopaedia of Genes and Genomes (KEGG) annotation and additional relevant data. For each of GO, EC and KEGG, annot8r extracts a specific sequence subset from the UniProt dataset based on the information stored in the reference database. These three subsets are then formatted for BLAST searches. The user provides the protein or nucleotide sequences to be annotated and annot8r runs BLAST searches against these three subsets. The BLAST results are parsed and the corresponding annotations retrieved from the reference database. The annotations are saved both as flat files and also in a relational postgreSQL results database to facilitate more advanced searches within the results. annot8r is integrated with the PartiGene suite of EST analysis tools.</p> <p>Conclusion</p> <p>annot8r is a tool that assigns GO, EC and KEGG annotations for data sets resulting from EST sequencing projects both rapidly and efficiently. The benefits of an underlying relational database, flexibility and the ease of use of the program make it ideally suited for non-model species EST-sequencing projects.</p

    A transcriptomic analysis of Echinococcus granulosus larval stages:implications for parasite biology and host adaptation

    Get PDF
    The cestode Echinococcus granulosus--the agent of cystic echinococcosis, a zoonosis affecting humans and domestic animals worldwide--is an excellent model for the study of host-parasite cross-talk that interfaces with two mammalian hosts. To develop the molecular analysis of these interactions, we carried out an EST survey of E. granulosus larval stages. We report the salient features of this study with a focus on genes reflecting physiological adaptations of different parasite stages.We generated ~10,000 ESTs from two sets of full-length enriched libraries (derived from oligo-capped and trans-spliced cDNAs) prepared with three parasite materials: hydatid cyst wall, larval worms (protoscoleces), and pepsin/H(+)-activated protoscoleces. The ESTs were clustered into 2700 distinct gene products. In the context of the biology of E. granulosus, our analyses reveal: (i) a diverse group of abundant long non-protein coding transcripts showing homology to a middle repetitive element (EgBRep) that could either be active molecular species or represent precursors of small RNAs (like piRNAs); (ii) an up-regulation of fermentative pathways in the tissue of the cyst wall; (iii) highly expressed thiol- and selenol-dependent antioxidant enzyme targets of thioredoxin glutathione reductase, the functional hub of redox metabolism in parasitic flatworms; (iv) candidate apomucins for the external layer of the tissue-dwelling hydatid cyst, a mucin-rich structure that is critical for survival in the intermediate host; (v) a set of tetraspanins, a protein family that appears to have expanded in the cestode lineage; and (vi) a set of platyhelminth-specific gene products that may offer targets for novel pan-platyhelminth drug development.This survey has greatly increased the quality and the quantity of the molecular information on E. granulosus and constitutes a valuable resource for gene prediction on the parasite genome and for further genomic and proteomic analyses focused on cestodes and platyhelminths

    Clinical Outcomes of 2-Drug Regimens vs 3-Drug Regimens in Antiretroviral Treatment–Experienced People Living With Human Immunodeficiency Virus

    Get PDF
    BACKGROUND: Limited data exist comparing clinical outcomes of two-drug regimens (2DRs) and three-drug regimens (3DRs) in people living with HIV. METHODS: Antiretroviral treatment-experienced individuals in RESPOND switching to a new 2DR or 3DR from 1/1/12-1/10/18 were included. The incidence of clinical events (AIDS, non-AIDS cancer, cardiovascular disease, end-stage liver and renal disease, death) was compared between regimens using Poisson regression. RESULTS: Of 9791 individuals included, 1088 (11.1%) started 2DRs and 8703 (88.9%) 3DRs. The most common 2DRs were dolutegravir plus lamivudine (22.8%) and raltegravir plus boosted darunavir (19.8%); the most common 3DR was dolutegravir plus 2 nucleoside reverse transcriptase inhibitors (46.9%). Individuals on 2DRs were older (median 52.6 years [interquartile range 46.7-59.0] vs 47.7 [39.7-54.3]), and a higher proportion had β‰₯1 comorbidity (81.6% vs 73.9%).There were 619 events during 27,159 person-years of follow-up (PYFU): 540 (incidence rate [IR] 22.5/1000 PYFU [95% CI 20.7-24.5]) on 3DRs, 79 (30.9/1000 PYFU [24.8-38.5]) on 2DRs. The most common events were death (7.5/1000 PYFU [95% CI 6.5-8.6]) and non-AIDS cancer (5.8/1000 PYFU [4.9-6.8]). After adjustment for baseline demographic and clinical characteristics, there was a similar incidence of events on both regimen types (2DRs vs 3DRs IR ratio: 0.92 [0.72-1.19]; p=0.53). CONCLUSIONS: This is the first large, international cohort assessing clinical outcomes on 2DRs. After accounting for baseline characteristics, there was a similar incidence of events on 2DRs and 3DRs. 2DRs appear to be a viable treatment option with regard to clinical outcomes; further research on resistance barriers and long-term durability of 2DRs is needed

    Update of complications and functional outcome of the ileo-pouch anal anastomosis: overview of evidence and meta-analysis of 96 observational studies

    Get PDF
    Item does not contain fulltextOBJECTIVE: The objective of this study is to provide a comprehensive update of the outcome of the ileo-pouch anal anastomosis (IPAA). DATA SOURCES: An extensive search in PubMed, EMBASE, and The Cochrane Library was conducted. STUDY SELECTION AND DATA EXTRACTION: All studies published after 2000 reporting on complications or functional outcome after a primary open IPAA procedure for UC or FAP were selected. Study characteristics, functional outcome, and complications were extracted. DATA SYNTHESIS: A review with similar methodology conducted 10 years earlier was used to evaluate developments in outcome over time. Pooled estimates were compared using a random-effects logistic meta-analyzing technique. Analyses focusing on the effect of time of study conductance, centralization, and variation in surgical techniques were performed. RESULTS: Fifty-three studies including 14,966 patients were included. Pooled rates of pouch failure and pelvic sepsis were 4.3% (95% CI, 3.5-6.3) and 7.5% (95% CI 6.1-9.1), respectively. Compared to studies published before 2000, a reduction of 2.5% was observed in the pouch failure rate (p = 0.0038). Analysis on the effect of the time of study conductance confirmed a decline in pouch failure. Functional outcome remained stable over time, with a 24-h defecation frequency of 5.9 (95% CI, 5.0-6.9). Technical surgery aspects did not have an important effect on outcome. CONCLUSION: This review provides up to date outcome estimates of the IPAA procedure that can be useful as reference values for practice and research. It is also shows a reduction in pouch failure over time.1 juli 201

    TESTLoc: protein subcellular localization prediction from EST data

    Get PDF
    Abstract Background The eukaryotic cell has an intricate architecture with compartments and substructures dedicated to particular biological processes. Knowing the subcellular location of proteins not only indicates how bio-processes are organized in different cellular compartments, but also contributes to unravelling the function of individual proteins. Computational localization prediction is possible based on sequence information alone, and has been successfully applied to proteins from virtually all subcellular compartments and all domains of life. However, we realized that current prediction tools do not perform well on partial protein sequences such as those inferred from Expressed Sequence Tag (EST) data, limiting the exploitation of the large and taxonomically most comprehensive body of sequence information from eukaryotes. Results We developed a new predictor, TESTLoc, suited for subcellular localization prediction of proteins based on their partial sequence conceptually translated from ESTs (EST-peptides). Support Vector Machine (SVM) is used as computational method and EST-peptides are represented by different features such as amino acid composition and physicochemical properties. When TESTLoc was applied to the most challenging test case (plant data), it yielded high accuracy (~85%). Conclusions TESTLoc is a localization prediction tool tailored for EST data. It provides a variety of models for the users to choose from, and is available for download at http://megasun.bch.umontreal.ca/~shenyq/TESTLoc/TESTLoc.html</p

    Next generation transcriptomes for next generation genomes using est2assembly

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The decreasing costs of capillary-based Sanger sequencing and next generation technologies, such as 454 pyrosequencing, have prompted an explosion of transcriptome projects in non-model species, where even shallow sequencing of transcriptomes can now be used to examine a range of research questions. This rapid growth in data has outstripped the ability of researchers working on non-model species to analyze and mine transcriptome data efficiently.</p> <p>Results</p> <p>Here we present a semi-automated platform '<it>est2assembly</it>' that processes raw sequence data from Sanger or 454 sequencing into a hybrid <it>de-novo </it>assembly, annotates it and produces GMOD compatible output, including a SeqFeature database suitable for GBrowse. Users are able to parameterize assembler variables, judge assembly quality and determine the optimal assembly for their specific needs. We used <it>est2assembly </it>to process <it>Drosophila </it>and <it>Bicyclus </it>public Sanger EST data and then compared them to published 454 data as well as eight new insect transcriptome collections.</p> <p>Conclusions</p> <p>Analysis of such a wide variety of data allows us to understand how these new technologies can assist EST project design. We determine that assembler parameterization is as essential as standardized methods to judge the output of ESTs projects. Further, even shallow sequencing using 454 produces sufficient data to be of wide use to the community. <it>est2assembly </it>is an important tool to assist manual curation for gene models, an important resource in their own right but especially for species which are due to acquire a genome project using Next Generation Sequencing.</p

    Comparative Genomics of the Apicomplexan Parasites Toxoplasma gondii and Neospora caninum: Coccidia Differing in Host Range and Transmission Strategy

    Get PDF
    Toxoplasma gondii is a zoonotic protozoan parasite which infects nearly one third of the human population and is found in an extraordinary range of vertebrate hosts. Its epidemiology depends heavily on horizontal transmission, especially between rodents and its definitive host, the cat. Neospora caninum is a recently discovered close relative of Toxoplasma, whose definitive host is the dog. Both species are tissue-dwelling Coccidia and members of the phylum Apicomplexa; they share many common features, but Neospora neither infects humans nor shares the same wide host range as Toxoplasma, rather it shows a striking preference for highly efficient vertical transmission in cattle. These species therefore provide a remarkable opportunity to investigate mechanisms of host restriction, transmission strategies, virulence and zoonotic potential. We sequenced the genome of N. caninum and transcriptomes of the invasive stage of both species, undertaking an extensive comparative genomics and transcriptomics analysis. We estimate that these organisms diverged from their common ancestor around 28 million years ago and find that both genomes and gene expression are remarkably conserved. However, in N. caninum we identified an unexpected expansion of surface antigen gene families and the divergence of secreted virulence factors, including rhoptry kinases. Specifically we show that the rhoptry kinase ROP18 is pseudogenised in N. caninum and that, as a possible consequence, Neospora is unable to phosphorylate host immunity-related GTPases, as Toxoplasma does. This defense strategy is thought to be key to virulence in Toxoplasma. We conclude that the ecological niches occupied by these species are influenced by a relatively small number of gene products which operate at the host-parasite interface and that the dominance of vertical transmission in N. caninum may be associated with the evolution of reduced virulence in this species
    • …
    corecore