211 research outputs found

    MetaMine – A tool to detect and analyse gene patterns in their environmental context

    Get PDF
    Background Modern sequencing technologies allow rapid sequencing and bioinformatic analysis of genomes and metagenomes. With every new sequencing project a vast number of new proteins become available with many genes remaining functionally unclassified based on evidences from sequence similarities alone. Extending similarity searches with gene pattern approaches, defined as genes sharing a distinct genomic neighbourhood, have shown to significantly improve the number of functional assignments. Further functional evidences can be gained by correlating these gene patterns with prevailing environmental parameters. MetaMine was developed to approach the large pool of unclassified proteins by searching for recurrent gene patterns across habitats based on key genes. Results MetaMine is an interactive data mining tool which enables the detection of gene patterns in an environmental context. The gene pattern search starts with a user defined environmentally interesting key gene. With this gene a BLAST search is carried out against the Microbial Ecological Genomics DataBase (MEGDB) containing marine genomic and metagenomic sequences. This is followed by the determination of all neighbouring genes within a given distance and a search for functionally equivalent genes. In the final step a set of common genes present in a defined number of distinct genomes is determined. The gene patterns found are associated with their individual pattern instances describing gene order and directions. They are presented together with information about the sample and the habitat. MetaMine is implemented in Java and provided as a client/server application with a user-friendly graphical user interface. The system was evaluated with environmentally relevant genes related to the methane-cycle and carbon monoxide oxidation. Conclusion MetaMine offers a targeted, semi-automatic search for gene patterns based on expert input. The graphical user interface of MetaMine provides a user-friendly overview of the computed gene patterns for further inspection in an ecological context. Prevailing biological processes associated with a key gene can be used to infer new annotations and shape hypotheses to guide further analyses. The use-cases demonstrate that meaningful gene patterns can be quickly detected using MetaMine

    Short clones or long clones? A simulation study on the use of paired reads in metagenomics

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Metagenomics is the study of environmental samples using sequencing. Rapid advances in sequencing technology are fueling a vast increase in the number and scope of metagenomics projects. Most metagenome sequencing projects so far have been based on Sanger or Roche-454 sequencing, as only these technologies provide long enough reads, while Illumina sequencing has not been considered suitable for metagenomic studies due to a short read length of only 35 bp. However, now that reads of length 75 bp can be sequenced in pairs, Illumina sequencing has become a viable option for metagenome studies.</p> <p>Results</p> <p>This paper addresses the problem of taxonomical analysis of paired reads. We describe a new feature of our metagenome analysis software MEGAN that allows one to process sequencing reads in pairs and makes assignments of such reads based on the combined bit scores of their matches to reference sequences. Using this new software in a simulation study, we investigate the use of Illumina paired-sequencing in taxonomical analysis and compare the performance of single reads, short clones and long clones. In addition, we also compare against simulated Roche-454 sequencing runs.</p> <p>Conclusion</p> <p>This work shows that paired reads perform better than single reads, as expected, but also, perhaps slightly less obviously, that long clones allow more specific assignments than short ones. A new version of the program MEGAN that explicitly takes paired reads into account is available from our website.</p

    Psychological distress in newly diagnosed colorectal cancer patients following microsatellite instability testing for Lynch syndrome on the pathologist’s initiative

    Get PDF
    According to the Dutch Guideline on Hereditary Colorectal Cancer published in 2008, patients with recently diagnosed colorectal cancer (CRC) should undergo microsatellite instability (MSI) testing by a pathologist immediately after tumour resection if they are younger than 50 years, or if a second CRC has been diagnosed before the age of 70 years, owing to the high risk of Lynch syndrome (MIPA). The aim of the present MIPAPS study was to investigate general distress and cancer-specific distress following MSI testing. From March 2007 to September 2009, 400 patients who had been tested for MSI after newly diagnosed CRC were recruited from 30 Dutch hospitals. Levels of general distress (SCL-90) and cancer-specific distress (IES) were assessed immediately after MSI result disclosure (T1) and 6 months later (T2). Response rates were 23/77 (30%) in the MSI-positive patients and 58/323 (18%) in the MSI-negative patients. Levels of general distress and cancer-specific distress were moderate. In the MSI-positive group, 27% of the patients had high general distress at T1 versus 18% at T2 (p = 0.5), whereas in the MSI-negative group, these percentage were 14 and 18% (p = 0.6), respectively. At T1 and T2, cancer-specific distress rates in the MSI-positive group and MSI-negative group were 39 versus 27% (p = 0.3) and 38 versus 36% (p = 1.0), respectively. High levels of general distress were correlated with female gender, low social support and high perceived cancer risk. Moderate levels of distress were observed after MSI testing, similar to those found in other patients diagnosed with CRC. Immediately after result disclosure, high cancer-specific distress was observed in 40% of the MSI-positive patients

    Predicting protein linkages in bacteria: Which method is best depends on task

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Applications of computational methods for predicting protein functional linkages are increasing. In recent years, several bacteria-specific methods for predicting linkages have been developed. The four major genomic context methods are: Gene cluster, Gene neighbor, Rosetta Stone, and Phylogenetic profiles. These methods have been shown to be powerful tools and this paper provides guidelines for when each method is appropriate by exploring different features of each method and potential improvements offered by their combination. We also review many previous treatments of these prediction methods, use the latest available annotations, and offer a number of new observations.</p> <p>Results</p> <p>Using <it>Escherichia coli </it>K12 and <it>Bacillus subtilis</it>, linkage predictions made by each of these methods were evaluated against three benchmarks: functional categories defined by COG and KEGG, known pathways listed in EcoCyc, and known operons listed in RegulonDB. Each evaluated method had strengths and weaknesses, with no one method dominating all aspects of predictive ability studied. For functional categories, as previous studies have shown, the Rosetta Stone method was individually best at detecting linkages and predicting functions among proteins with shared KEGG categories while the Phylogenetic profile method was best for linkage detection and function prediction among proteins with common COG functions. Differences in performance under COG versus KEGG may be attributable to the presence of paralogs. Better function prediction was observed when using a weighted combination of linkages based on reliability versus using a simple unweighted union of the linkage sets. For pathway reconstruction, 99 complete metabolic pathways in <it>E. coli </it>K12 (out of the 209 known, non-trivial pathways) and 193 pathways with 50% of their proteins were covered by linkages from at least one method. Gene neighbor was most effective individually on pathway reconstruction, with 48 complete pathways reconstructed. For operon prediction, Gene cluster predicted completely 59% of the known operons in <it>E. coli </it>K12 and 88% (333/418)in <it>B. subtilis</it>. Comparing two versions of the <it>E. coli </it>K12 operon database, many of the unannotated predictions in the earlier version were updated to true predictions in the later version. Using only linkages found by both Gene Cluster and Gene Neighbor improved the precision of operon predictions. Additionally, as previous studies have shown, combining features based on intergenic region and protein function improved the specificity of operon prediction.</p> <p>Conclusion</p> <p>A common problem for computational methods is the generation of a large number of false positives that might be caused by an incomplete source of validation. By comparing two versions of a database, we demonstrated the dramatic differences on reported results. We used several benchmarks on which we have shown the comparative effectiveness of each prediction method, as well as provided guidelines as to which method is most appropriate for a given prediction task.</p

    Methods for comparative metagenomics

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Metagenomics is a rapidly growing field of research that aims at studying uncultured organisms to understand the true diversity of microbes, their functions, cooperation and evolution, in environments such as soil, water, ancient remains of animals, or the digestive system of animals and humans. The recent development of ultra-high throughput sequencing technologies, which do not require cloning or PCR amplification, and can produce huge numbers of DNA reads at an affordable cost, has boosted the number and scope of metagenomic sequencing projects. Increasingly, there is a need for new ways of comparing multiple metagenomics datasets, and for fast and user-friendly implementations of such approaches.</p> <p>Results</p> <p>This paper introduces a number of new methods for interactively exploring, analyzing and comparing multiple metagenomic datasets, which will be made freely available in a new, comparative version 2.0 of the stand-alone metagenome analysis tool MEGAN.</p> <p>Conclusion</p> <p>There is a great need for powerful and user-friendly tools for comparative analysis of metagenomic data and MEGAN 2.0 will help to fill this gap.</p

    Tolerance and rebound with zafirlukast in patients with persistent asthma

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The potential for tolerance to develop to zafirlukast, a cysteinyl leukotriene (CysLT) receptor antagonist (LRA) in persistent asthma, has not been specifically examined.</p> <p>Objective</p> <p>To look for any evidence of tolerance and potential for short-term clinical worsening on LRA withdrawal. Outcome measures included changes in; airway hyperresponsiveness to inhaled methacholine (PD<sub>20</sub>FEV<sub>1</sub>), daily symptoms and peak expiratory flows (PEF), sputum and blood cell profiles, sputum CysLT and prostaglandin (PG)E<sub>2 </sub>and exhaled nitric oxide (eNO) levels.</p> <p>Methods</p> <p>A double blind, placebo-controlled study of zafirlukast, 20 mg twice daily over 12 weeks in 21 asthmatics taking β<sub>2</sub>-agonists only (Group I), and 24 subjects treated with ICS (Group II).</p> <p>Results</p> <p>In Group I, zafirlukast significantly improved morning PEF and FEV<sub>1</sub>compared to placebo (p < 0.01), and reduced morning waking with asthma from baseline after two weeks (p < 0.05). Similarly in Group II, FEV<sub>1 </sub>improved compared to placebo (p < 0.05), and there were early within-treatment group improvements in morning PEF, β<sub>2</sub>-agonist use and asthma severity scores (p < 0.05). However, most improvements with zafirlukast in Group I and to a lesser extent in Group II deteriorated toward baseline values over 12 weeks. In both groups, one week following zafirlukast withdrawal there were significant deteriorations in morning and evening PEFs and FEV<sub>1 </sub>compared with placebo (p ≤ 0.05) and increased nocturnal awakenings in Group II (p < 0.05). There were no changes in PD<sub>20</sub>FEV<sub>1</sub>, sputum CysLT concentrations or exhaled nitric oxide (eNO) levels. However, blood neutrophils significantly increased in both groups following zafirlukast withdrawal compared to placebo (p = 0.007).</p> <p>Conclusion</p> <p>Tolerance appears to develop to zafirlukast and there is rebound clinical deterioration on drug withdrawal, accompanied by a blood neutrophilia.</p

    A fluorescence approach to investigate repartitioning of coalescing agents in acrylic polymer emulsions

    Get PDF
    Repartitioning of co-solvents between particles of latex emulsions was investigated by means of a fluorescence method based on the detection of the amount of co-solvent via the solvatochromic shift of the emission maximum of a fluorescent probe, copolymerized at a low concentration. Complete repartitioning of co-solvents between particles of latex materials with a low Tg (ca. 25 °C) occurred within minutes. For a hydrophilic latex with a Tg of 68 °C, equilibration was achieved within an hour. Repartitioning was faster for more hydrophobic co-solvents. For a hydrophobic latex of similar Tg, co-solvent repartitioning took place on the same time scale, but complete equilibration was not reached. Possibly, there is an additional slow component in the repartitioning, or the prolonged presence of co-solvent causes a structural change in the latex particles that affects the outcome of the experiment

    Predicted Relative Metabolomic Turnover (PRMT): determining metabolic turnover from a coastal marine metagenomic dataset

    Get PDF
    We present an approach in which the semantics of an XML language is defined by means of a transformation from an XML document model (an XML schema) to an application specific model. The application specific model implements the intended behavior of documents written in the language. A transformation is specified in a model transformation language used in the Model Driven Architecture (MDA) approach for software development. Our approach provides a better separation of three concerns found in XML applications: syntax, syntax processing logic and intended meaning of the syntax. It frees the developer of low-level syntactical details and improves the adaptability and reusability of XML applications. Declarative transformation rules and the explicit application model provide a finer control over the application parts affected by adaptations. Transformation rules and the application model for an XML language may be composed with the corresponding rules and application models defined for other XML languages. In that way we achieve reuse and composition of XML applications
    corecore