6,654 research outputs found

    T-IDBA: A de novo Iterative de Bruijn Graph Assembler for Transcriptome

    Get PDF
    LNCS v. 6577 entitled: Research in computational molecular biology: 15th annual international conference, RECOMB 2011 ... : proceedingsRNA-seq data produced by next-generation sequencing technology is a useful tool for analyzing transcriptomes. However, existing de novo transcriptome assemblers do not fully utilize the properties of transcriptomes and may result in short contigs because of the splicing nature (shared exons) of the genes. We propose the T-IDBA algorithm to reconstruct expressed isoforms without reference genome. By using pair-end information to solve the problem of long repeats in different genes and branching in the same gene due to alternative splicing, the graph can be decomposed into small components, each corresponds to a gene. The most possible isoforms with sufficient support from the pair-end reads will be found heuristically. In practice, our de novo transcriptome assembler, T-IDBA, outperforms Abyss substantially in terms of sensitivity and precision for both simulated and real data. T-IDBA is available at http://www.cs.hku.hk/~alse/ tidba/. © 2011 Springer-Verlag.postprin

    IDBA-UD: a de novo assembler for single-cell and metagenomic sequencing data with highly uneven depth

    Get PDF
    MOTIVATION: Next-generation sequencing allows us to sequence reads from a microbial environment using single-cell sequencing or metagenomic sequencing technologies. However, both technologies suffer from the problem that sequencing depth of different regions of a genome or genomes from different species are highly uneven. Most existing genome assemblers usually have an assumption that sequencing depths are even. These assemblers fail to construct correct long contigs. RESULTS: We introduce the IDBA-UD algorithm that is based on the de Bruijn graph approach for assembling reads from single-cell sequencing or metagenomic sequencing technologies with uneven sequencing depths. Several non-trivial techniques have been employed to tackle the problems. Instead of using a simple threshold, we use multiple depthrelative thresholds to remove erroneous k-mers in both low-depth and high-depth regions. The technique of local assembly with paired-end information is used to solve the branch problem of low-depth short repeat regions. To speed up the process, an error correction step is conducted to correct reads of high-depth regions that can be aligned to highconfident contigs. Comparison of the performances of IDBA-UD and existing assemblers (Velvet, Velvet-SC, SOAPdenovo and Meta-IDBA) for different datasets, shows that IDBA-UD can reconstruct longer contigs with higher accuracy. AVAILABILITY: The IDBA-UD toolkit is available at our website http://www.cs.hku.hk/alse/idba_udpostprin

    Risk factors of developmental defects of enamel - A prospective cohort study

    Get PDF
    BACKGROUND AND OBJECTIVE: Current studies on the aetiology of developmental defects of enamel (DDE) are subject to recall bias because of the retrospective collection of information. Our objective was to investigate potential risk factors associated with the occurrence of DDE through a prospective cohort study. METHODS: Using a random community sample of Hong Kong children born in 1997, we performed a cohort study in which the subjects' background information, medical and dental records were prospectively collected. A clinical examination to identify DDE was conducted in 2010 when the subjects were 12 years old. The central incisor, lateral incisor and first molar in each quadrant were chosen as the index teeth and were examined 'wet' by two trained and calibrated examiners using the modified FDI (DDE) Index. RESULTS: With a response rate of 74.9%, the 514 examined subjects had matched data for background information. Diffuse opacites were the most common type of DDE. Of the various possible aetiological factors considered, only experience of severe diseases during the period 0-3 years was associated with the occurrence of 'any defect' (p = 0.017) and diffuse opacities (p = 0.044). The children with experience of severe diseases before 3 years of age were 7.89 times more likely to be affected by 'any defect' compared with those who did not have the experience (OR 7.89; 95% CI 1.07, 58.14; p = 0.043). However, after adjusting for confounding factors, the association no longer existed. CONCLUSION: No variables could be identified as risk factors of DDE in this Hong Kong birth cohort.published_or_final_versio

    Demystifying construction project time-effort distribution curves: a BIM and non-BIM comparison

    Get PDF
    MacLeamy's time-effort distribution curves are among the most oft-cited sources for researchers interested in mainstreaming building information modeling (BIM) implementation in the architecture, engineering, and construction (AEC) industry. Succinctly, the curves offer a clever answer to the question: How can BIM benefit AEC processes? However, despite their significant theoretical and practical value, little previous research has been conducted to elaborate the time-effort distribution curves of any real-life projects. This research aims to demystify the time-effort distribution curves through comparison of a representative BIM project and a non-BIM project. Applying a set of innovative approaches, the actual time-effort distribution curves of two public housing construction projects in Hong Kong are produced and analyzed in-depth. The curves vividly show that BIM implementation increases the effort spent at design stage - that is, throughout the architecture and engineering processes - but the extra effort pays off at the building stage. Further, the curves are found to be a useful graphical analytic tool for other purposes, such as adjusting the fee structure among AEC processes and informing improved BIM adoption.postprin

    Tooth eruption and obesity in 12-year-old children

    Get PDF
    published_or_final_versio

    Unsupervised binning of environmental genomic fragments based on an error robust selection of l-mers

    Get PDF
    BACKGROUND: With the rapid development of genome sequencing techniques, traditional research methods based on the isolation and cultivation of microorganisms are being gradually replaced by metagenomics, which is also known as environmental genomics. The first step, which is still a major bottleneck, of metagenomics is the taxonomic characterization of DNA fragments (reads) resulting from sequencing a sample of mixed species. This step is usually referred as 'binning'. Existing binning methods are based on supervised or semi-supervised approaches which rely heavily on reference genomes of known microorganisms and phylogenetic marker genes. Due to the limited availability of reference genomes and the bias and instability of marker genes, existing binning methods may not be applicable in many cases. RESULTS: In this paper, we present an unsupervised binning method based on the distribution of a carefully selected set of l-mers (substrings of length l in DNA fragments). From our experiments, we show that our method can accurately bin DNA fragments with various lengths and relative species abundance ratios without using any reference and training datasets. Another feature of our method is its error robustness. The binning accuracy decreases by less than 1% when the sequencing error rate increases from 0% to 5%. Note that the typical sequencing error rate of existing commercial sequencing platforms is less than 2%. CONCLUSIONS: We provide a new and effective tool to solve the metagenome binning problem without using any reference datasets or markers information of any known reference genomes (species). The source code of our software tool, the reference genomes of the species for generating the test datasets and the corresponding test datasets are available at http://i.cs.hku.hk/alse/MetaCluster/.published_or_final_versio

    Classical and Quantum Equations of Motion for a BTZ Black String in AdS Space

    Full text link
    We investigate gravitational collapse of a (3+1)(3+1)-dimensional BTZ black string in AdS space in the context of both classical and quantum mechanics. This is done by first deriving the conserved mass per unit length of the cylindrically symmetric domain wall, which is taken as the classical Hamiltonian of the black string. In the quantum mechanical context, we take primary interest in the behavior of the collapse near the horizon and near the origin (classical singularity) from the point of view of an infalling observer. In the absence of radiation, quantum effects near the horizon do not change the classical conclusions for an infalling observer, meaning that the horizon is not an obstacle for him/her. The most interesting quantum mechanical effect comes in when investigating near the origin. First, quantum effects are able to remove the classical singularity at the origin, since the wave function is non-singular at the origin. Second, the Schr\"odinger equation describing the behavior near the origin displays non-local effects, which depend on the energy density of the domain wall. This is manifest in that derivatives of the wavefunction at one point are related to the value of the wavefunction at some other distant point.Comment: 9 pages, 1 figure. Minor Clarification and corrections. Accepted for Publication in JHE

    A robust and accurate binning algorithm for metagenomic sequences with arbitrary species abundance ratio

    Get PDF
    Motivation: With the rapid development of next-generation sequencing techniques, metagenomics, also known as environmental genomics, has emerged as an exciting research area that enables us to analyze the microbial environment in which we live. An important step for metagenomic data analysis is the identification and taxonomic characterization of DNA fragments (reads or contigs) resulting from sequencing a sample of mixed species. This step is referred to as 'binning'. Binning algorithms that are based on sequence similarity and sequence composition markers rely heavily on the reference genomes of known microorganisms or phylogenetic markers. Due to the limited availability of reference genomes and the bias and low availability of markers, these algorithms may not be applicable in all cases. Unsupervised binning algorithms which can handle fragments from unknown species provide an alternative approach. However, existing unsupervised binning algorithms only work on datasets either with balanced species abundance ratios or rather different abundance ratios, but not both. Results: In this article, we present MetaCluster 3.0, an integrated binning method based on the unsupervised top-down separation and bottom-up merging strategy, which can bin metagenomic fragments of species with very balanced abundance ratios (say 1:1) to very different abundance ratios (e.g. 1:24) with consistently higher accuracy than existing methods. © The Author 2011. Published by Oxford University Press. All rights reserved.postprin

    Synthesis and Photoluminescence Properties of Porous Silicon Nanowire Arrays

    Get PDF
    Herein, we prepare vertical and single crystalline porous silicon nanowires (SiNWs) via a two-step metal-assisted electroless etching method. The porosity of the nanowires is restricted by etchant concentration, etching time and doping lever of the silicon wafer. The diffusion of silver ions could lead to the nucleation of silver nanoparticles on the nanowires and open new etching ways. Like porous silicon (PS), these porous nanowires also show excellent photoluminescence (PL) properties. The PL intensity increases with porosity, with an enhancement of about 100 times observed in our condition experiments. A “red-shift” of the PL peak is also found. Further studies prove that the PL spectrum should be decomposed into two elementary PL bands. The peak at 850 nm is the emission of the localized excitation in the nanoporous structure, while the 750-nm peak should be attributed to the surface-oxidized nanostructure. It could be confirmed from the Fourier transform infrared spectroscopy analyses. These porous SiNW arrays may be useful as the nanoscale optoelectronic devices
    • …
    corecore