104 research outputs found

    LOCAS – A Low Coverage Assembly Tool for Resequencing Projects

    Get PDF
    Motivation: Next Generation Sequencing (NGS) is a frequently applied approach to detect sequence variations between highly related genomes. Recent large-scale re-sequencing studies as the Human 1000 Genomes Project utilize NGS data of low coverage to afford sequencing of hundreds of individuals. Here, SNPs and micro-indels can be detected by applying an alignment-consensus approach. However, computational methods capable of discovering other variations such as novel insertions or highly diverged sequence from low coverage NGS data are still lacking. Results: We present LOCAS, a new NGS assembler particularly designed for low coverage assembly of eukaryotic genomes using a mismatch sensitive overlap-layout-consensus approach. LOCAS assembles homologous regions in a homologyguided manner while it performs de novo assemblies of insertions and highly polymorphic target regions subsequently to an alignment-consensus approach. LOCAS has been evaluated in homology-guided assembly scenarios with low sequence coverage of Arabidopsis thaliana strains sequenced as part of the Arabidopsis 1001 Genomes Project. While assembling the same amount of long insertions as state-of-the-art NGS assemblers, LOCAS showed best results regarding contig size, error rate and runtime. Conclusion: LOCAS produces excellent results for homology-guided assembly of eukaryotic genomes with short reads and low sequencing depth, and therefore appears to be the assembly tool of choice for the detection of novel sequenc

    Gene-Boosted Assembly of a Novel Bacterial Genome from Very Short Reads

    Get PDF
    Recent improvements in technology have made DNA sequencing dramatically faster and more efficient than ever before. The new technologies produce highly accurate sequences, but one drawback is that the most efficient technology produces the shortest read lengths. Short-read sequencing has been applied successfully to resequence the human genome and those of other species but not to whole-genome sequencing of novel organisms. Here we describe the sequencing and assembly of a novel clinical isolate of Pseudomonas aeruginosa, strain PAb1, using very short read technology. From 8,627,900 reads, each 33 nucleotides in length, we assembled the genome into one scaffold of 76 ordered contiguous sequences containing 6,290,005 nucleotides, including one contig spanning 512,638 nucleotides, plus an additional 436 unordered contigs containing 416,897 nucleotides. Our method includes a novel gene-boosting algorithm that uses amino acid sequences from predicted proteins to build a better assembly. This study demonstrates the feasibility of very short read sequencing for the sequencing of bacterial genomes, particularly those for which a related species has been sequenced previously, and expands the potential application of this new technology to most known prokaryotic species

    Mindful Parenting in Mental Health Care

    Get PDF
    Mindfulness is a form of meditation based on the Buddhist tradition, which has been used over the last two decades to successfully treat a multitude of mental health problems. Bringing mindfulness into parenting (“mindful parenting”) is one of the applications of mindfulness. Mindful parenting interventions are increasingly being used to help prevent and treat mental disorders in children, parenting problems, and prevent intergenerational transmission of mental disorders from parents to children. However, to date, few studies have examined the hypothesized mechanisms of change brought about by mindful parenting. We discuss six possible mechanisms through which mindful parenting may bring about change in parent–child interactions in the context of child and parent mental health problems. These mechanisms are hypothesized to be mediated by the effects of mindfulness on parental attention by: (1) reducing parental stress and resulting parental reactivity; (2) reducing parental preoccupation resulting from parental and/or child psychopathology; (3) improving parental executive functioning in impulsive parents; (4) breaking the cycle of intergenerational transmission of dysfunctional parenting schemas and habits; (5) increasing self-nourishing attention; and (6) improving marital functioning and co-parenting. We review research that has applied mindful parenting in mental health settings, with a focus on evidence for these six mechanisms. Finally, we discuss directions for future research into mindful parenting and the crucial questions that this research should strive to answer

    Fast splice site detection using information content and feature reduction

    Get PDF
    Background: Accurate identification of splice sites in DNA sequences plays a key role in the prediction of gene structure in eukaryotes. Already many computational methods have been proposed for the detection of splice sites and some of them showed high prediction accuracy. However, most of these methods are limited in terms of their long computation time when applied to whole genome sequence data. Results: In this paper we propose a hybrid algorithm which combines several effective and informative input features with the state of the art support vector machine (SVM). To obtain the input features we employ information content method based on Shannon\u27s information theory, Shapiro\u27s score scheme, and Markovian probabilities. We also use a feature elimination scheme to reduce the less informative features from the input data. Conclusion: In this study we propose a new feature based splice site detection method that shows improved acceptor and donor splice site detection in DNA sequences when the performance is compared with various state of the art and well known method

    Characterization of the Influenza A H5N1 Viruses of the 2008-09 Outbreaks in India Reveals a Third Introduction and Possible Endemicity

    Get PDF
    Widespread infection of highly pathogenic avian influenza A H5N1 was reported from backyard and commercial poultry in West Bengal (WB), an eastern state of India in early 2008. Infection gradually spread to Tripura, Assam and Sikkim, the northeastern states, with 70 outbreaks reported between January 2008 and May 2009. Whole genome sequence analysis of three isolates from WB, one isolate from Tripura along with the analysis of hemagglutinin (HA) and neuraminidase (NA) genes of 17 other isolates was performed during this study. In the HA gene phylogenetic tree, all the 2008-09 Indian isolates belonged to EMA3 sublineage of clade 2.2. The closest phylogenetic relationship was found to be with the 2007-09 isolates from Bangladesh and not with the earlier 2006 and 2007 Indian isolates implying a third introduction into the country. The receptor-binding pocket of HA1 of two isolates from WB showed S221P mutation, one of the markers predicted to be associated with human receptor specificity. Two substitutions E119A (2 isolates of WB) and N294S (2 other isolates of WB) known to confer resistance to NA inhibitors were observed in the active site of neuraminidase. Several additional mutations were observed within the 2008-09 Indian isolates indicating genetic diversification. Overall, the study is indicative of a possible endemicity in the eastern and northeastern parts of the country, demanding active surveillance specifically in view of the critical mutations that have been observed in the influenza A H5N1 viruses

    Genome and Transcriptome Analysis of the Food-Yeast Candida utilis

    Get PDF
    The industrially important food-yeast Candida utilis is a Crabtree effect-negative yeast used to produce valuable chemicals and recombinant proteins. In the present study, we conducted whole genome sequencing and phylogenetic analysis of C. utilis, which showed that this yeast diverged long before the formation of the CUG and Saccharomyces/Kluyveromyces clades. In addition, we performed comparative genome and transcriptome analyses using next-generation sequencing, which resulted in the identification of genes important for characteristic phenotypes of C. utilis such as those involved in nitrate assimilation, in addition to the gene encoding the functional hexose transporter. We also found that an antisense transcript of the alcohol dehydrogenase gene, which in silico analysis did not predict to be a functional gene, was transcribed in the stationary-phase, suggesting a novel system of repression of ethanol production. These findings should facilitate the development of more sophisticated systems for the production of useful reagents using C. utilis

    LTC: a novel algorithm to improve the efficiency of contig assembly for physical mapping in complex genomes

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Physical maps are the substrate of genome sequencing and map-based cloning and their construction relies on the accurate assembly of BAC clones into large contigs that are then anchored to genetic maps with molecular markers. High Information Content Fingerprinting has become the method of choice for large and repetitive genomes such as those of maize, barley, and wheat. However, the high level of repeated DNA present in these genomes requires the application of very stringent criteria to ensure a reliable assembly with the FingerPrinted Contig (FPC) software, which often results in short contig lengths (of 3-5 clones before merging) as well as an unreliable assembly in some difficult regions. Difficulties can originate from a non-linear topological structure of clone overlaps, low power of clone ordering algorithms, and the absence of tools to identify sources of gaps in Minimal Tiling Paths (MTPs).</p> <p>Results</p> <p>To address these problems, we propose a novel approach that: (i) reduces the rate of false connections and Q-clones by using a new cutoff calculation method; (ii) obtains reliable clusters robust to the exclusion of single clone or clone overlap; (iii) explores the topological contig structure by considering contigs as networks of clones connected by significant overlaps; (iv) performs iterative clone clustering combined with ordering and order verification using re-sampling methods; and (v) uses global optimization methods for clone ordering and Band Map construction. The elements of this new analytical framework called Linear Topological Contig (LTC) were applied on datasets used previously for the construction of the physical map of wheat chromosome 3B with FPC. The performance of LTC vs. FPC was compared also on the simulated BAC libraries based on the known genome sequences for chromosome 1 of rice and chromosome 1 of maize.</p> <p>Conclusions</p> <p>The results show that compared to other methods, LTC enables the construction of highly reliable and longer contigs (5-12 clones before merging), the detection of "weak" connections in contigs and their "repair", and the elongation of contigs obtained by other assembly methods.</p

    Recovering complete and draft population genomes from metagenome datasets

    Get PDF
    Assembly of metagenomic sequence data into microbial genomes is of fundamental value to improving our understanding of microbial ecology and metabolism by elucidating the functional potential of hard-to-culture microorganisms. Here, we provide a synthesis of available methods to bin metagenomic contigs into species-level groups and highlight how genetic diversity, sequencing depth, and coverage influence binning success. Despite the computational cost on application to deeply sequenced complex metagenomes (e.g., soil), covarying patterns of contig coverage across multiple datasets significantly improves the binning process. We also discuss and compare current genome validation methods and reveal how these methods tackle the problem of chimeric genome bins i.e., sequences from multiple species. Finally, we explore how population genome assembly can be used to uncover biogeographic trends and to characterize the effect of in situ functional constraints on the genome-wide evolution
    corecore