135 research outputs found

    Short clones or long clones? A simulation study on the use of paired reads in metagenomics

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Metagenomics is the study of environmental samples using sequencing. Rapid advances in sequencing technology are fueling a vast increase in the number and scope of metagenomics projects. Most metagenome sequencing projects so far have been based on Sanger or Roche-454 sequencing, as only these technologies provide long enough reads, while Illumina sequencing has not been considered suitable for metagenomic studies due to a short read length of only 35 bp. However, now that reads of length 75 bp can be sequenced in pairs, Illumina sequencing has become a viable option for metagenome studies.</p> <p>Results</p> <p>This paper addresses the problem of taxonomical analysis of paired reads. We describe a new feature of our metagenome analysis software MEGAN that allows one to process sequencing reads in pairs and makes assignments of such reads based on the combined bit scores of their matches to reference sequences. Using this new software in a simulation study, we investigate the use of Illumina paired-sequencing in taxonomical analysis and compare the performance of single reads, short clones and long clones. In addition, we also compare against simulated Roche-454 sequencing runs.</p> <p>Conclusion</p> <p>This work shows that paired reads perform better than single reads, as expected, but also, perhaps slightly less obviously, that long clones allow more specific assignments than short ones. A new version of the program MEGAN that explicitly takes paired reads into account is available from our website.</p

    Analysis and comparison of very large metagenomes with fast clustering and functional annotation

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The remarkable advance of metagenomics presents significant new challenges in data analysis. Metagenomic datasets (metagenomes) are large collections of sequencing reads from anonymous species within particular environments. Computational analyses for very large metagenomes are extremely time-consuming, and there are often many novel sequences in these metagenomes that are not fully utilized. The number of available metagenomes is rapidly increasing, so fast and efficient metagenome comparison methods are in great demand.</p> <p>Results</p> <p>The new metagenomic data analysis method Rapid Analysis of Multiple Metagenomes with a Clustering and Annotation Pipeline (<b>RAMMCAP</b>) was developed using an ultra-fast sequence clustering algorithm, fast protein family annotation tools, and a novel statistical metagenome comparison method that employs a unique graphic interface. RAMMCAP processes extremely large datasets with only moderate computational effort. It identifies raw read clusters and protein clusters that may include novel gene families, and compares metagenomes using clusters or functional annotations calculated by RAMMCAP. In this study, RAMMCAP was applied to the two largest available metagenomic collections, the "Global Ocean Sampling" and the "Metagenomic Profiling of Nine Biomes".</p> <p>Conclusion</p> <p>RAMMCAP is a very fast method that can cluster and annotate one million metagenomic reads in only hundreds of CPU hours. It is available from <url>http://tools.camera.calit2.net/camera/rammcap/</url>.</p

    Hippocampal volume in early onset depression

    Get PDF
    BACKGROUND: Abnormalities in limbic structures have been implicated in major depressive disorder (MDD). Although MDD is as common in adolescence as in adulthood, few studies have examined youth near illness onset in order to determine the possible influence of atypical development on the pathophysiology of this disorder. METHODS: Hippocampal volumes were measured in 17 MDD subjects (age = 16.67 ± 1.83 years [mean ± SD]; range = 13 – 18 years) and 17 age- and sex-matched healthy controls (16.23 ± 1.61 years [mean ± SD]; 13 – 18 years) using magnetic resonance imaging (MRI). RESULTS: An analysis of covariance revealed a significant difference between MDD and control subjects (F = 8.66, df = 1, 29, P = 0.006). This was more strongly localized to the left hippocampus (P = 0.001) than the right hippocampus (P = 0.047). CONCLUSIONS: Our findings provide new evidence of abnormalities in the hippocampus in early onset depression. However, our results should be considered preliminary given the small sample size studied

    Malignant mesothelioma

    Get PDF
    Malignant mesothelioma is a fatal asbestos-associated malignancy originating from the lining cells (mesothelium) of the pleural and peritoneal cavities, as well as the pericardium and the tunica vaginalis. The exact prevalence is unknown but it is estimated that mesotheliomas represent less than 1% of all cancers. Its incidence is increasing, with an expected peak in the next 10–20 years. Pleural malignant mesothelioma is the most common form of mesothelioma. Typical presenting features are those of chest pain and dyspnoea. Breathlessness due to a pleural effusion without chest pain is reported in about 30% of patients. A chest wall mass, weight loss, sweating, abdominal pain and ascites (due to peritoneal involvement) are less common presentations. Mesothelioma is directly attributable to occupational asbestos exposure with a history of exposure in over 90% of cases. There is also evidence that mesothelioma may result from both para-occupational exposure and non-occupational "environmental" exposure. Idiopathic or spontaneous mesothelioma can also occur in the absence of any exposure to asbestos, with a spontaneous rate in humans of around one per million. A combination of accurate exposure history, along with examination radiology and pathology are essential to make the diagnosis. Distinguishing malignant from benign pleural disease can be challenging. The most helpful CT findings suggesting malignant pleural disease are 1) a circumferential pleural rind, 2) nodular pleural thickening, 3) pleural thickening of > 1 cm and 4) mediastinal pleural involvement. Involvement of a multidisciplinary team is recommended to ensure prompt and appropriate management, using a framework of radiotherapy, chemotherapy, surgery and symptom palliation with end of life care. Compensation issues must also be considered. Life expectancy in malignant mesothelioma is poor, with a median survival of about one year following diagnosis

    The effect of tidal forcing on biogeochemical processes in intertidal salt marsh sediments

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Early diagenetic processes involved in natural organic matter (NOM) oxidation in marine sediments have been for the most part characterized after collecting sediment cores and extracting porewaters. These techniques have proven useful for deep-sea sediments where biogeochemical processes are limited to aerobic respiration, denitrification, and manganese reduction and span over several centimeters. In coastal marine sediments, however, the concentration of NOM is so high that the spatial resolution needed to characterize these processes cannot be achieved with conventional sampling techniques. In addition, coastal sediments are influenced by tidal forcing that likely affects the processes involved in carbon oxidation.</p> <p>Results</p> <p>In this study, we used in situ voltammetry to determine the role of tidal forcing on early diagenetic processes in intertidal salt marsh sediments. We compare ex situ measurements collected seasonally, in situ profiling measurements, and in situ time series collected at several depths in the sediment during tidal cycles at two distinct stations, a small perennial creek and a mud flat. Our results indicate that the tides coupled to the salt marsh topography drastically influence the distribution of redox geochemical species and may be responsible for local differences noted year-round in the same sediments. Monitoring wells deployed to observe the effects of the tides on the vertical component of porewater transport reveal that creek sediments, because of their confinements, are exposed to much higher hydrostatic pressure gradients than mud flats.</p> <p>Conclusion</p> <p>Our study indicates that iron reduction can be sustained in intertidal creek sediments by a combination of physical forcing and chemical oxidation, while intertidal mud flat sediments are mainly subject to sulfate reduction. These processes likely allow microbial iron reduction to be an important terminal electron accepting process in intertidal coastal sediments.</p

    Declining mortality following acute myocardial infarction in the Department of Veterans Affairs Health Care System

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Mortality from acute myocardial infarction (AMI) is declining worldwide. We sought to determine if mortality in the Veterans Health Administration (VHA) has also been declining.</p> <p>Methods</p> <p>We calculated 30-day mortality rates between 2004 and 2006 using data from the VHA External Peer Review Program (EPRP), which entails detailed abstraction of records of all patients with AMI. To compare trends within VHA with other systems of care, we estimated relative mortality rates between 2000 and 2005 for all males 65 years and older with a primary diagnosis of AMI using administrative data from the VHA Patient Treatment File and the Medicare Provider Analysis and Review (MedPAR) files.</p> <p>Results</p> <p>Using EPRP data on 11,609 patients, we observed a statistically significant decline in adjusted 30-day mortality following AMI in VHA from 16.3% in 2004 to 13.9% in 2006, a relative decrease of 15% and a decrease in the odds of dying of 10% per year (p = .011). Similar declines were found for in-hospital and 90-day mortality.</p> <p>Based on administrative data on 27,494 VHA patients age 65 years and older and 789,400 Medicare patients, 30-day mortality following AMI declined from 16.0% during 2000-2001 to 15.7% during 2004-June 2005 in VHA and from 16.7% to 15.5% in private sector hospitals. After adjusting for patient characteristics and hospital effects, the overall relative odds of death were similar for VHA and Medicare (odds ratio 1.02, 95% C.I. 0.96-1.08).</p> <p>Conclusion</p> <p>Mortality following AMI within VHA has declined significantly since 2003 at a rate that parallels that in Medicare-funded hospitals.</p

    Accurate Genome Relative Abundance Estimation Based on Shotgun Metagenomic Reads

    Get PDF
    Accurate estimation of microbial community composition based on metagenomic sequencing data is fundamental for subsequent metagenomics analysis. Prevalent estimation methods are mainly based on directly summarizing alignment results or its variants; often result in biased and/or unstable estimates. We have developed a unified probabilistic framework (named GRAMMy) by explicitly modeling read assignment ambiguities, genome size biases and read distributions along the genomes. Maximum likelihood method is employed to compute Genome Relative Abundance of microbial communities using the Mixture Model theory (GRAMMy). GRAMMy has been demonstrated to give estimates that are accurate and robust across both simulated and real read benchmark datasets. We applied GRAMMy to a collection of 34 metagenomic read sets from four metagenomics projects and identified 99 frequent species (minimally 0.5% abundant in at least 50% of the data- sets) in the human gut samples. Our results show substantial improvements over previous studies, such as adjusting the over-estimated abundance for Bacteroides species for human gut samples, by providing a new reference-based strategy for metagenomic sample comparisons. GRAMMy can be used flexibly with many read assignment tools (mapping, alignment or composition-based) even with low-sensitivity mapping results from huge short-read datasets. It will be increasingly useful as an accurate and robust tool for abundance estimation with the growing size of read sets and the expanding database of reference genomes

    How Spatial Heterogeneity of Cover Affects Patterns of Shrub Encroachment into Mesic Grasslands

    Get PDF
    We used a multi-method approach to analyze the spatial patterns of shrubs and cover types (plant species, litter or bare soil) in grassland-shrubland ecotones. This approach allows us to assess how fine-scale spatial heterogeneity of cover types affects the patterns of Cytisus balansae shrub encroachment into mesic mountain grasslands (Catalan Pyrenees, Spain). Spatial patterns and the spatial associations between juvenile shrubs and different cover types were assessed in mesic grasslands dominated by species with different palatabilities (palatable grass Festuca nigrescens and unpalatable grass Festuca eskia). A new index, called RISES (“Relative Index of Shrub Encroachment Susceptibility”), was proposed to calculate the chances of shrub encroachment into a given grassland, combining the magnitude of the spatial associations and the surface area for each cover type. Overall, juveniles showed positive associations with palatable F. nigrescens and negative associations with unpalatable F. eskia, although these associations shifted with shrub development stage. In F. eskia grasslands, bare soil showed a low scale of pattern and positive associations with juveniles. Although the highest RISES values were found in F. nigrescens plots, the number of juvenile Cytisus was similar in both types of grasslands. However, F. nigrescens grasslands showed the greatest number of juveniles in early development stage (i.e. height<10 cm) whereas F. eskia grasslands showed the greatest number of juveniles in late development stages (i.e. height>30 cm). We concluded that in F. eskia grasslands, where establishment may be constrained by the dominant cover type, the low scale of pattern on bare soil may result in higher chances of shrub establishment and survival. In contrast, although grasslands dominated by the palatable F. nigrescens may be more susceptible to shrub establishment; current grazing rates may reduce juvenile survival

    Unusual Regulation of a Leaderless Operon Involved in the Catabolism of Dimethylsulfoniopropionate in Rhodobacter sphaeroides

    Get PDF
    Rhodobacter sphaeroides strain 2.4.1 is a widely studied bacterium that has recently been shown to cleave the abundant marine anti-stress molecule dimethylsulfoniopropionate (DMSP) into acrylate plus gaseous dimethyl sulfide. It does so by using a lyase encoded by dddL, the promoter-distal gene of a three-gene operon, acuR-acuI-dddL. Transcription of the operon was enhanced when cells were pre-grown with the substrate DMSP, but this induction is indirect, and requires the conversion of DMSP to the product acrylate, the bona fide co-inducer. This regulation is mediated by the product of the promoter-proximal gene acuR, a transcriptional regulator in the TetR family. AcuR represses the operon in the absence of acrylate, but this is relieved by the presence of the co-inducer. Another unusual regulatory feature is that the acuR-acuI-dddL mRNA transcript is leaderless, such that acuR lacks a Shine-Dalgarno ribosomal binding site and 5′-UTR, and is translated at a lower level compared to the downstream genes. This regulatory unit may be quite widespread in bacteria, since several other taxonomically diverse lineages have adjacent acuR-like and acuI-like genes; these operons also have no 5′ leader sequences or ribosomal binding sites and their predicted cis-acting regulatory sequences resemble those of R. sphaeroides acuR-acuI-dddL

    Evaluating the Fidelity of De Novo Short Read Metagenomic Assembly Using Simulated Data

    Get PDF
    A frequent step in metagenomic data analysis comprises the assembly of the sequenced reads. Many assembly tools have been published in the last years targeting data coming from next-generation sequencing (NGS) technologies but these assemblers have not been designed for or tested in multi-genome scenarios that characterize metagenomic studies. Here we provide a critical assessment of current de novo short reads assembly tools in multi-genome scenarios using complex simulated metagenomic data. With this approach we tested the fidelity of different assemblers in metagenomic studies demonstrating that even under the simplest compositions the number of chimeric contigs involving different species is noticeable. We further showed that the assembly process reduces the accuracy of the functional classification of the metagenomic data and that these errors can be overcome raising the coverage of the studied metagenome. The results presented here highlight the particular difficulties that de novo genome assemblers face in multi-genome scenarios demonstrating that these difficulties, that often compromise the functional classification of the analyzed data, can be overcome with a high sequencing effort
    corecore