31 research outputs found

    Metassembler: merging and optimizing de novo genome assemblies

    Get PDF
    Genome assembly projects typically run multiple algorithms in an attempt to find the single best assembly, although those assemblies often have complementary, if untapped, strengths and weaknesses. We present our metassembler algorithm that merges multiple assemblies of a genome into a single superior sequence. We apply it to the four genomes from the Assemblathon competitions and show it consistently and substantially improves the contiguity and quality of each assembly. We also develop guidelines for meta-assembly by systematically evaluating 120 permutations of merging the top 5 assemblies of the first Assemblathon competition. The software is open-source at http://metassembler.sourceforge.net

    Estimation of the national disease burden of influenza-associated severe acute respiratory illness in Kenya and Guatemala : a novel methodology

    Get PDF
    Background: Knowing the national disease burden of severe influenza in low-income countries can inform policy decisions around influenza treatment and prevention. We present a novel methodology using locally generated data for estimating this burden. Methods and Findings: This method begins with calculating the hospitalized severe acute respiratory illness (SARI) incidence for children <5 years old and persons ≥5 years old from population-based surveillance in one province. This base rate of SARI is then adjusted for each province based on the prevalence of risk factors and healthcare-seeking behavior. The percentage of SARI with influenza virus detected is determined from provincial-level sentinel surveillance and applied to the adjusted provincial rates of hospitalized SARI. Healthcare-seeking data from healthcare utilization surveys is used to estimate non-hospitalized influenza-associated SARI. Rates of hospitalized and non-hospitalized influenza-associated SARI are applied to census data to calculate the national number of cases. The method was field-tested in Kenya, and validated in Guatemala, using data from August 2009–July 2011. In Kenya (2009 population 38.6 million persons), the annual number of hospitalized influenza-associated SARI cases ranged from 17,129–27,659 for children <5 years old (2.9–4.7 per 1,000 persons) and 6,882–7,836 for persons ≥5 years old (0.21–0.24 per 1,000 persons), depending on year and base rate used. In Guatemala (2011 population 14.7 million persons), the annual number of hospitalized cases of influenza-associated pneumonia ranged from 1,065–2,259 (0.5–1.0 per 1,000 persons) among children <5 years old and 779–2,252 cases (0.1–0.2 per 1,000 persons) for persons ≥5 years old, depending on year and base rate used. In both countries, the number of non-hospitalized influenza-associated cases was several-fold higher than the hospitalized cases. Conclusions: Influenza virus was associated with a substantial amount of severe disease in Kenya and Guatemala. This method can be performed in most low and lower-middle income countries

    Mammalian microRNA: an important modulator of host-pathogen interactions in human viral infections

    Get PDF
    MicroRNAs (miRNAs), which are small non-coding RNAs expressed by almost all metazoans, have key roles in the regulation of cell differentiation, organism development and gene expression. Thousands of miRNAs regulating approximately 60æ% of the total human genome have been identified. They regulate genetic expression either by direct cleavage or by translational repression of the target mRNAs recognized through partial complementary base pairing. The active and functional unit of miRNA is its complex with Argonaute proteins known as the microRNA-induced silencing complex (miRISC). De-regulated miRNA expression in the human cell may contribute to a diverse group of disorders including cancer, cardiovascular dysfunctions, liver damage, immunological dysfunction, metabolic syndromes and pathogenic infections. Current day studies have revealed that miRNAs are indeed a pivotal component of host-pathogen interactions and host immune responses toward microorganisms. miRNA is emerging as a tool for genetic study, therapeutic development and diagnosis for human pathogenic infections caused by viruses, bacteria, parasites and fungi. Many pathogens can exploit the host miRNA system for their own benefit such as surviving inside the host cell, replication, pathogenesis and bypassing some host immune barriers, while some express pathogen-encoded miRNA inside the host contributing to their replication, survival and/or latency. In this review, we discuss the role and significance of miRNA in relation to some pathogenic viruses

    The CWP object‐oriented optimization library

    No full text

    Whole genome de novo assemblies of three divergent strains of rice, Oryza sativa, document novel gene space of aus and indica

    Get PDF
    BACKGROUND: The use of high throughput genome-sequencing technologies has uncovered a large extent of structural variation in eukaryotic genomes that makes important contributions to genomic diversity and phenotypic variation. When the genomes of different strains of a given organism are compared, whole genome resequencing data are typically aligned to an established reference sequence. However, when the reference differs in significant structural ways from the individuals under study, the analysis is often incomplete or inaccurate. RESULTS: Here, we use rice as a model to demonstrate how improvements in sequencing and assembly technology allow rapid and inexpensive de novo assembly of next generation sequence data into high-quality assemblies that can be directly compared using whole genome alignment to provide an unbiased assessment. Using this approach, we are able to accurately assess the "pan-genome" of three divergent rice varieties and document several megabases of each genome absent in the other two. CONCLUSIONS: Many of the genome-specific loci are annotated to contain genes, reflecting the potential for new biological properties that would be missed by standard reference-mapping approaches. We further provide a detailed analysis of several loci associated with agriculturally important traits, including the S5 hybrid sterility locus, the Sub1 submergence tolerance locus, the LRK gene cluster associated with improved yield, and the Pup1 cluster associated with phosphorus deficiency, illustrating the utility of our approach for biological discovery. All of the data and software are openly available to support further breeding and functional studies of rice and other species
    corecore