679 research outputs found
annot8r: GO, EC and KEGG annotation of EST datasets
<p>Abstract</p> <p>Background</p> <p>The expressed sequence tag (EST) methodology is an attractive option for the generation of sequence data for species for which no completely sequenced genome is available. The annotation and comparative analysis of such datasets poses a formidable challenge for research groups that do not have the bioinformatics infrastructure of major genome sequencing centres. Therefore, there is a need for user-friendly tools to facilitate the annotation of non-model species EST datasets with well-defined ontologies that enable meaningful cross-species comparisons. To address this, we have developed annot8r, a platform for the rapid annotation of EST datasets with GO-terms, EC-numbers and KEGG-pathways.</p> <p>Results</p> <p>annot8r automatically downloads all files relevant for the annotation process and generates a reference database that stores UniProt entries, their associated Gene Ontology (GO), Enzyme Commission (EC) and Kyoto Encyclopaedia of Genes and Genomes (KEGG) annotation and additional relevant data. For each of GO, EC and KEGG, annot8r extracts a specific sequence subset from the UniProt dataset based on the information stored in the reference database. These three subsets are then formatted for BLAST searches. The user provides the protein or nucleotide sequences to be annotated and annot8r runs BLAST searches against these three subsets. The BLAST results are parsed and the corresponding annotations retrieved from the reference database. The annotations are saved both as flat files and also in a relational postgreSQL results database to facilitate more advanced searches within the results. annot8r is integrated with the PartiGene suite of EST analysis tools.</p> <p>Conclusion</p> <p>annot8r is a tool that assigns GO, EC and KEGG annotations for data sets resulting from EST sequencing projects both rapidly and efficiently. The benefits of an underlying relational database, flexibility and the ease of use of the program make it ideally suited for non-model species EST-sequencing projects.</p
Recommended from our members
Exploratory Analysis of Mutations in Circulating Tumour DNA as Biomarkers of Treatment Response for Patients with Relapsed High-Grade Serous Ovarian Carcinoma: A Retrospective Study
Circulating tumour DNA (ctDNA) carrying tumour-specific sequence alterations may provide a minimally invasive means to dynamically assess tumour burden and response to treatment in cancer patients. Somatic mutations are a defining feature of high-grade serous ovarian carcinoma (HGSOC). We tested whether these mutations could be used as personalised markers to monitor tumour burden and early changes as a predictor of response and time to progression (TTP).
We performed a retrospective analysis of serial plasma samples collected during routine clinical visits from 40 patients with HGSOC undergoing heterogeneous standard of care treatment. Patient-specific assays were developed for 31 unique mutations identified in formalin-fixed paraffin-embedded tumour DNA from these patients. These assays were used to quantify ctDNA in 318 plasma samples using microfluidic digital PCR. The mutant allele fraction (TP53MAF) was compared to serum CA-125, the current gold-standard response marker for HGSOC in blood, as well as to disease volume on computed tomography scans by volumetric analysis. Changes after one cycle of treatment were compared with TTP. The median TP53MAF prior to treatment in 51 relapsed treatment courses was 8% (interquartile range [IQR] 1.2%-22%) compared to 0.7% (IQR 0.3%-2.0%) for seven untreated newly diagnosed stage IIIC/IV patients. TP53MAF correlated with volumetric measurements (Pearson = 0.59, 32 cm, ctDNA was detected at ≥20 amplifiable copies per millilitre of plasma. In 49 treatment courses for relapsed disease, pre-treatment TP53MAF concentration, but not CA-125, was associated with TTP. Response to chemotherapy was seen earlier with ctDNA, with a median time to nadir of 37 d (IQR 28-54) compared with a median time to nadir of 84 d (IQR 42-116) for CA-125. In 32 relapsed treatment courses evaluable for response after one cycle of chemotherapy, a decrease in TP53MAF of >60% was an independent predictor of TTP in multivariable analysis (hazard ratio 0.22, 95% CI 0.07-0.67, = 0.008). Conversely, a decrease in TP53MAF of ≤60% was associated with poor response and identified cases with TTP < 6 mo with 71% sensitivity (95% CI 42%-92%) and 88% specificity (95% CI 64%-99%). Specificity was improved when patients with recent drainage of ascites were excluded. Ascites drainage led to a reduction of TP53MAF concentration. The limitations of this study include retrospective design, small sample size, and heterogeneity of treatment within the cohort.
In this retrospective study, we demonstrated that ctDNA is correlated with volume of disease at the start of treatment in women with HGSOC and that a decrease of ≤60% in TP53MAF after one cycle of chemotherapy was associated with shorter TTP. These results provide evidence that ctDNA has the potential to be a highly specific early molecular response marker in HGSOC and warrants further investigation in larger cohorts receiving uniform treatment.This work was supported by Cancer Research UK Grant numbers: A15601 (JDB), A11906 (NR), A20240 (NR), A18072 (JDB). JDB was supported by the National Institute for Health Research Cambridge Biomedical Research Centre. CAP was supported in part by the Academy of Medical Sciences, the Wellcome Trust, British Heart Foundation and Arthritis Research UK
EST-PAC a web package for EST annotation and protein sequence prediction
With the decreasing cost of DNA sequencing technology and the vast diversity of biological resources, researchers increasingly face the basic challenge of annotating a larger number of expressed sequences tags (EST) from a variety of species. This typically consists of a series of repetitive tasks, which should be automated and easy to use. The results of these annotation tasks need to be stored and organized in a consistent way. All these operations should be self-installing, platform independent, easy to customize and amenable to using distributed bioinformatics resources available on the Internet. In order to address these issues, we present EST-PAC a web oriented multi-platform software package for expressed sequences tag (EST) annotation. EST-PAC provides a solution for the administration of EST and protein sequence annotations accessible through a web interface. Three aspects of EST annotation are automated: 1) searching local or remote biological databases for sequence similarities using Blast services, 2) predicting protein coding sequence from EST data and, 3) annotating predicted protein sequences with functional domain predictions. In practice, EST-PAC integrates the BLASTALL suite, EST-Scan2 and HMMER in a relational database system accessible through a simple web interface. EST-PAC also takes advantage of the relational database to allow consistent storage, powerful queries of results and, management of the annotation process. The system allows users to customize annotation strategies and provides an open-source data-management environment for research and education in bioinformatics
Genomic-Bioinformatic Analysis of Transcripts Enriched in the Third-Stage Larva of the Parasitic Nematode Ascaris suum
Differential transcription in Ascaris suum was investigated using a genomic-bioinformatic approach. A cDNA archive enriched for molecules in the infective third-stage larva (L3) of A. suum was constructed by suppressive-subtractive hybridization (SSH), and a subset of cDNAs from 3075 clones subjected to microarray analysis using cDNA probes derived from RNA from different developmental stages of A. suum. The cDNAs (n = 498) shown by microarray analysis to be enriched in the L3 were sequenced and subjected to bioinformatic analyses using a semi-automated pipeline (ESTExplorer). Using gene ontology (GO), 235 of these molecules were assigned to ‘biological process’ (n = 68), ‘cellular component’ (n = 50), or ‘molecular function’ (n = 117). Of the 91 clusters assembled, 56 molecules (61.5%) had homologues/orthologues in the free-living nematodes Caenorhabditis elegans and C. briggsae and/or other organisms, whereas 35 (38.5%) had no significant similarity to any sequences available in current gene databases. Transcripts encoding protein kinases, protein phosphatases (and their precursors), and enolases were abundantly represented in the L3 of A. suum, as were molecules involved in cellular processes, such as ubiquitination and proteasome function, gene transcription, protein–protein interactions, and function. In silico analyses inferred the C. elegans orthologues/homologues (n = 50) to be involved in apoptosis and insulin signaling (2%), ATP synthesis (2%), carbon metabolism (6%), fatty acid biosynthesis (2%), gap junction (2%), glucose metabolism (6%), or porphyrin metabolism (2%), although 34 (68%) of them could not be mapped to a specific metabolic pathway. Small numbers of these 50 molecules were predicted to be secreted (10%), anchored (2%), and/or transmembrane (12%) proteins. Functionally, 17 (34%) of them were predicted to be associated with (non-wild-type) RNAi phenotypes in C. elegans, the majority being embryonic lethality (Emb) (13 types; 58.8%), larval arrest (Lva) (23.5%) and larval lethality (Lvl) (47%). A genetic interaction network was predicted for these 17 C. elegans orthologues, revealing highly significant interactions for nine molecules associated with embryonic and larval development (66.9%), information storage and processing (5.1%), cellular processing and signaling (15.2%), metabolism (6.1%), and unknown function (6.7%). The potential roles of these molecules in development are discussed in relation to the known roles of their homologues/orthologues in C. elegans and some other nematodes. The results of the present study provide a basis for future functional genomic studies to elucidate molecular aspects governing larval developmental processes in A. suum and/or the transition to parasitism
Effort-related functions of nucleus accumbens dopamine and associated forebrain circuits
Background
Over the last several years, it has become apparent that there are critical problems with the hypothesis that brain dopamine (DA) systems, particularly in the nucleus accumbens, directly mediate the rewarding or primary motivational characteristics of natural stimuli such as food. Hypotheses related to DA function are undergoing a substantial restructuring, such that the classic emphasis on hedonia and primary reward is giving way to diverse lines of research that focus on aspects of instrumental learning, reward prediction, incentive motivation, and behavioral activation.
Objective
The present review discusses dopaminergic involvement in behavioral activation and, in particular, emphasizes the effort-related functions of nucleus accumbens DA and associated forebrain circuitry.
Results
The effects of accumbens DA depletions on food-seeking behavior are critically dependent upon the work requirements of the task. Lever pressing schedules that have minimal work requirements are largely unaffected by accumbens DA depletions, whereas reinforcement schedules that have high work (e.g., ratio) requirements are substantially impaired by accumbens DA depletions. Moreover, interference with accumbens DA transmission exerts a powerful influence over effort-related decision making. Rats with accumbens DA depletions reallocate their instrumental behavior away from food-reinforced tasks that have high response requirements, and instead, these rats select a less-effortful type of food-seeking behavior.
Conclusions
Along with prefrontal cortex and the amygdala, nucleus accumbens is a component of the brain circuitry regulating effort-related functions. Studies of the brain systems regulating effort-based processes may have implications for understanding drug abuse, as well as energy-related disorders such as psychomotor slowing, fatigue, or anergia in depression
Sequence Diversity in the Dickeya fliC Gene: Phylogeny of the Dickeya Genus and TaqMan® PCR for 'D. solani', New Biovar 3 Variant on Potato in Europe
Worldwide, Dickeya (formerly Erwinia chrysanthemi) is causing soft rot diseases on a large diversity of crops and ornamental plants. Strains affecting potato are mainly found in D. dadantii, D. dianthicola and D. zeae, which appear to have a marked geographical distribution. Furthermore, a few Dickeya isolates from potato are attributed to D. chrysanthemi and D. dieffenbachiae. In Europe, isolates of Erwinia chrysanthemi biovar 1 and biovar 7 from potato are now classified in D. dianthicola. However, in the past few years, a new Dickeya biovar 3 variant, tentatively named ‘Dickeya solani’, has emerged as a common major threat, in particular in seed potatoes. Sequences of a fliC gene fragment were used to generate a phylogeny of Dickeya reference strains from culture collections and with this reference backbone, to classify pectinolytic isolates, i.e. Dickeya spp. from potato and ornamental plants. The reference strains of the currently recognized Dickeya species and ‘D. solani’ were unambiguously delineated in the fliC phylogram. D. dadantii, D. dianthicola and ‘D. solani’ displayed unbranched clades, while D. chrysanthemi, D. zeae and D. dieffenbachiae branched into subclades and lineages. Moreover, Dickeya isolates from diagnostic samples, in particular biovar 3 isolates from greenhouse ornamentals, formed several new lineages. Most of these isolates were positioned between the clade of ‘D. solani’ and D. dadantii as transition variants. New lineages also appeared in D. dieffenbachiae and in D. zeae. The strains and isolates of D. dianthicola and ‘D. solani’ were differentiated by a fliC sequence useful for barcode identification. A fliC TaqMan®real-time PCR was developed for ‘D. solani’ and the assay was provisionally evaluated in direct analysis of diagnostic potato samples. This molecular tool can support the efforts to control this particular phytopathogen in seed potato certification
A transcriptomic analysis of Echinococcus granulosus larval stages:implications for parasite biology and host adaptation
The cestode Echinococcus granulosus--the agent of cystic echinococcosis, a zoonosis affecting humans and domestic animals worldwide--is an excellent model for the study of host-parasite cross-talk that interfaces with two mammalian hosts. To develop the molecular analysis of these interactions, we carried out an EST survey of E. granulosus larval stages. We report the salient features of this study with a focus on genes reflecting physiological adaptations of different parasite stages.We generated ~10,000 ESTs from two sets of full-length enriched libraries (derived from oligo-capped and trans-spliced cDNAs) prepared with three parasite materials: hydatid cyst wall, larval worms (protoscoleces), and pepsin/H(+)-activated protoscoleces. The ESTs were clustered into 2700 distinct gene products. In the context of the biology of E. granulosus, our analyses reveal: (i) a diverse group of abundant long non-protein coding transcripts showing homology to a middle repetitive element (EgBRep) that could either be active molecular species or represent precursors of small RNAs (like piRNAs); (ii) an up-regulation of fermentative pathways in the tissue of the cyst wall; (iii) highly expressed thiol- and selenol-dependent antioxidant enzyme targets of thioredoxin glutathione reductase, the functional hub of redox metabolism in parasitic flatworms; (iv) candidate apomucins for the external layer of the tissue-dwelling hydatid cyst, a mucin-rich structure that is critical for survival in the intermediate host; (v) a set of tetraspanins, a protein family that appears to have expanded in the cestode lineage; and (vi) a set of platyhelminth-specific gene products that may offer targets for novel pan-platyhelminth drug development.This survey has greatly increased the quality and the quantity of the molecular information on E. granulosus and constitutes a valuable resource for gene prediction on the parasite genome and for further genomic and proteomic analyses focused on cestodes and platyhelminths
A Family of Diverse Kunitz Inhibitors from Echinococcus granulosus Potentially Involved in Host-Parasite Cross-Talk
The cestode Echinococcus granulosus, the agent of hydatidosis/echinococcosis, is remarkably well adapted to its definitive host. However, the molecular mechanisms underlying the successful establishment of larval worms (protoscoleces) in the dog duodenum are unknown. With the aim of identifying molecules participating in the E. granulosus-dog cross-talk, we surveyed the transcriptomes of protoscoleces and protoscoleces treated with pepsin at pH 2. This analysis identified a multigene family of secreted monodomain Kunitz proteins associated mostly with pepsin/H+-treated worms, suggesting that they play a role at the onset of infection. We present the relevant molecular features of eight members of the E. granulosus Kunitz family (EgKU-1 – EgKU-8). Although diverse, the family includes three pairs of close paralogs (EgKU-1/EgKU-4; EgKU-3/EgKU-8; EgKU-6/EgKU-7), which would be the products of recent gene duplications. In addition, we describe the purification of EgKU-1 and EgKU-8 from larval worms, and provide data indicating that some members of the family (notably, EgKU-3 and EgKU-8) are secreted by protoscoleces. Detailed kinetic studies with native EgKU-1 and EgKU-8 highlighted their functional diversity. Like most monodomain Kunitz proteins, EgKU-8 behaved as a slow, tight-binding inhibitor of serine proteases, with global inhibition constants (KI*) versus trypsins in the picomolar range. In sharp contrast, EgKU-1 did not inhibit any of the assayed peptidases. Interestingly, molecular modeling revealed structural elements associated with activity in Kunitz cation-channel blockers. We propose that this family of inhibitors has the potential to act at the E. granulosus-dog interface and interfere with host physiological processes at the initial stages of infection
Detection of Gamma-Ray Emission from the Starburst Galaxies M82 and NGC 253 with the Large Area Telescope on Fermi
We report the detection of high-energy gamma-ray emission from two starburst
galaxies using data obtained with the Large Area Telescope on board the Fermi
Gamma-ray Space Telescope. Steady point-like emission above 200 MeV has been
detected at significance levels of 6.8 sigma and 4.8 sigma respectively, from
sources positionally coincident with locations of the starburst galaxies M82
and NGC 253. The total fluxes of the sources are consistent with gamma-ray
emission originating from the interaction of cosmic rays with local
interstellar gas and radiation fields and constitute evidence for a link
between massive star formation and gamma-ray emission in star-forming galaxies.Comment: Submitted to ApJ Letter
Fermi Gamma-ray Imaging of a Radio Galaxy
The Fermi Gamma-ray Space Telescope has detected the gamma-ray glow emanating
from the giant radio lobes of the radio galaxy Centaurus A. The resolved
gamma-ray image shows the lobes clearly separated from the central active
source. In contrast to all other active galaxies detected so far in high-energy
gamma-rays, the lobe flux constitutes a considerable portion (>1/2) of the
total source emission. The gamma-ray emission from the lobes is interpreted as
inverse Compton scattered relic radiation from the cosmic microwave background
(CMB), with additional contribution at higher energies from the
infrared-to-optical extragalactic background light (EBL). These measurements
provide gamma-ray constraints on the magnetic field and particle energy content
in radio galaxy lobes, and a promising method to probe the cosmic relic photon
fields.Comment: 27 pages, includes Supplementary Online Material; corresponding
authors: C.C. Cheung, Y. Fukazawa, J. Knodlseder, L. Stawar
- …