31 research outputs found

    ALLPATHS 2: Small Genomes Assembled Accurately and with High Continuity from Short Paired Reads

    Get PDF
    We demonstrate that genome sequences approaching finished quality can be generated from short paired reads. Using 36 base (fragment) and 26 base (jumping) reads from five microbial genomes of varied GC composition and sizes up to 40 Mb, ALLPATHS2 generated assemblies with long, accurate contigs and scaffolds. Velvet and EULER-SR were less accurate. For example, for Escherichia coli, the fraction of 10-kb stretches that were perfect was 99.8% (ALLPATHS2), 68.7% (Velvet), and 42.1% (EULER-SR).Organismic and Evolutionary Biolog

    Assemblathon 2: evaluating de novo methods of genome assembly in three vertebrate species

    Get PDF
    Background: The process of generating raw genome sequence data continues to become cheaper, faster, and more accurate. However, assembly of such data into high-quality, finished genome sequences remains challenging. Many genome assembly tools are available, but they differ greatly in terms of their performance (speed, scalability, hardware requirements, acceptance of newer read technologies) and in their final output (composition of assembled sequence). More importantly, it remains largely unclear how to best assess the quality of assembled genome sequences. The Assemblathon competitions are intended to assess current state-of-the-art methods in genome assembly. Results: In Assemblathon 2, we provided a variety of sequence data to be assembled for three vertebrate species (a bird, a fish, and snake). This resulted in a total of 43 submitted assemblies from 21 participating teams. We evaluated these assemblies using a combination of optical map data, Fosmid sequences, and several statistical methods. From over 100 different metrics, we chose ten key measures by which to assess the overall quality of the assemblies. Conclusions: Many current genome assemblers produced useful assemblies, containing a significant representation of their genes and overall genome structure. However, the high degree of variability between the entries suggests that there is still much room for improvement in the field of genome assembly and that approaches which work well in assembling the genome of one species may not necessarily work well for another

    The genomic substrate for adaptive radiation in African cichlid fish

    Get PDF
    Cichlid fishes are famous for large, diverse and replicated adaptive radiations in the Great Lakes of East Africa. To understand the molecular mechanisms underlying cichlid phenotypic diversity, we sequenced the genomes and transcriptomes of five lineages of African cichlids: the Nile tilapia (Oreochromis niloticus), an ancestral lineage with low diversity; and four members of the East African lineage: Neolamprologus brichardi/pulcher (older radiation, Lake Tanganyika), Metriaclima zebra (recent radiation, Lake Malawi), Pundamilia nyererei (very recent radiation, Lake Victoria), and Astatotilapia burtoni (riverine species around Lake Tanganyika). We found an excess of gene duplications in the East African lineage compared to tilapia and other teleosts, an abundance of non-coding element divergence, accelerated coding sequence evolution, expression divergence associated with transposable element insertions, and regulation by novel microRNAs. In addition, we analysed sequence data from sixty individuals representing six closely related species from Lake Victoria, and show genome-wide diversifying selection on coding and regulatory variants, some of which were recruited from ancient polymorphisms. We conclude that a number of molecular mechanisms shaped East African cichlid genomes, and that amassing of standing variation during periods of relaxed purifying selection may have been important in facilitating subsequent evolutionary diversification

    Measurement and modelling of phytoplankton light scattering

    No full text
    Available from British Library Document Supply Centre- DSC:DXN055940 / BLDSC - British Library Document Supply CentreSIGLEGBUnited Kingdo

    Round-Pound Arithmetic

    No full text
    Rounded Arithmetic is the traditional way of presenting previous year accounts. The method of showing rounded sums of vectors in which the arithmetic still "adds up" and the errors in the display of the components are bounded by twice the maximum rounding error have been well known for at least 30 years in spite of there being little or nothing in the literature and no attempt to implement this in currently available spread-sheets. This paper extends the method from vectors to accumulator trees, and suggests a heuristic algorithm for the more general accumulator graphs in which nodes accumulate into more than one accumulator. The special case of crosstabulations, in which all nodes accumulate to 2 accumulators represents is considered by both recursive and non-recursive methods. 1 Introduction Rounded Arithmetic is the traditional way of presenting previous year accounts. The method of showing rounded sums in which the arithmetic still "adds up" and the errors in the display of the co..

    Measurement and modelling of light scattering by phytoplankton cells at narrow forward angles

    No full text
    A procedure has been devised for measuring the angular dependence of light scattering from suspensions of phytoplankton cells at forward angles from 0.25â—¦ to 8â—¦. The cells were illuminated with a spatially-filtered laser beam and the angular distribution of scattered light measured by tracking a photodetector across the Fourier plane of a collecting lens using a stepper-motor driven stage. The procedure was calibrated by measuring scattering from latex bead suspensions with known size distributions. It was then used to examine the scattering from cultures of the unicellular algae Isochrysis galbana (4 ÎĽm Ă— 5 ÎĽm), Dunaliella primolecta (6 ÎĽm Ă— 7 ÎĽm) and Rhinomonas reticulata (5 ÎĽm Ă— 11 ÎĽm). The results were compared with the predictions of Mie theory. Excellent agreement was obtained for spherical particles. A suitable choice of spherical-equivalent scattering parameters was required to enable reasonable agreement within the first diffraction lobe for ellipsoidal particles

    ALLPATHS: De novo assembly of whole-genome shotgun microreads

    Get PDF
    New DNA sequencing technologies deliver data at dramatically lower costs but demand new analytical methods to take full advantage of the very short reads that they produce. We provide an initial, theoretical solution to the challenge of de novo assembly from whole-genome shotgun “microreads.” For 11 genomes of sizes up to 39 Mb, we generated high-quality assemblies from 80× coverage by paired 30-base simulated reads modeled after real Illumina-Solexa reads. The bacterial genomes of Campylobacter jejuni and Escherichia coli assemble optimally, yielding single perfect contigs, and larger genomes yield assemblies that are highly connected and accurate. Assemblies are presented in a graph form that retains intrinsic ambiguities such as those arising from polymorphism, thereby providing information that has been absent from previous genome assemblies. For both C. jejuni and E. coli, this assembly graph is a single edge encompassing the entire genome. Larger genomes produce more complicated graphs, but the vast majority of the bases in their assemblies are present in long edges that are nearly always perfect. We describe a general method for genome assembly that can be applied to all types of DNA sequence data, not only short read data, but also conventional sequence reads
    corecore