11 research outputs found

    RLZAP: Relative Lempel-Ziv with Adaptive Pointers

    Full text link
    Relative Lempel-Ziv (RLZ) is a popular algorithm for compressing databases of genomes from individuals of the same species when fast random access is desired. With Kuruppu et al.'s (SPIRE 2010) original implementation, a reference genome is selected and then the other genomes are greedily parsed into phrases exactly matching substrings of the reference. Deorowicz and Grabowski (Bioinformatics, 2011) pointed out that letting each phrase end with a mismatch character usually gives better compression because many of the differences between individuals' genomes are single-nucleotide substitutions. Ferrada et al. (SPIRE 2014) then pointed out that also using relative pointers and run-length compressing them usually gives even better compression. In this paper we generalize Ferrada et al.'s idea to handle well also short insertions, deletions and multi-character substitutions. We show experimentally that our generalization achieves better compression than Ferrada et al.'s implementation with comparable random-access times

    Relative Lempel-Ziv Compression of Suffix Arrays

    Get PDF
    We show that a combination of differential encoding, random sampling, and relative Lempel-Ziv (RLZ) parsing is effective for compressing suffix arrays, while simultaneously allowing very fast decompression of arbitrary suffix array intervals, facilitating pattern matching. The resulting text index, while somewhat larger (5-10x) than the recent r-index of Gagie, Navarro, and Prezza (Proc. SODA ’18)—still provides significant compression, and allows pattern location queries to be answered more than two orders of magnitude faster in practice.Peer reviewe

    Prioritising catchment management projects to improve marine water quality

    No full text
    Runoff from human land-uses is one of the most significant threats to some coastal marine environments. Initiatives to reduce that runoff usually set runoff reduction targets but do not give guidance on how to prioritize the different options that exist to achieve them. This paper demonstrates an easy to interpret economic framework to prioritise investment for conservation projects that aim to reduce pollution of marine ecosystems caused by runoff from agricultural land-uses. We demonstrate how to apply this framework using data on project cost, benefit and feasibility with a subset of projects that have been funded to reduce runoff from subcatchments adjacent to the Great Barrier Reef. Our analysis provides a graphical overview of the cost-effectiveness of the investment options, enables transparent planning for different budgets, assesses the existence of trends in the cost-effectiveness of different categories, and can test if the results are robust under uncertainty in one or more of the parameters. The framework provided solutions that were up to 4 times more efficient than when omitting information on cost or benefit. The presented framework can be used as a benchmark for evaluating results from a range of prioritisation processes against the best possible conservation outcomes

    Physical characterisation of high amylose maize starch and acylated high amylose maize starches

    Get PDF
    The particle size, water sorption properties and molecular mobility of high amylose maize starch (HAMS) and high amylose maize starch acylated with acetate (HAMSA), propionate (HAMSP) and butyrate (HAMSB) were investigated. Acylation increased the mean particle size (D(4,3)) and lowered the specific gravity (G) of the starch granules with an inverse relationship between the length of the fatty acid chain and particle size. Acylation of HAMS with fatty acids lowered the monolayer moisture content with the trend being HAMSB<HAMSA<HAMSP<HAMS, showing that the decrease is affected by factors other than the length of the fatty acid chain. Measurement of molecular mobility of the starch granules by NMR spectroscopy with Carr-Purcell-Meiboom-Gill (CMPG) experiments showed that T2 long was reduced in acylated starches and that drying and storage of the starch granules further reduced T2 long. Analysis of the Free Induction Decay (FID) focussing on the short components of T2 (correlated to the solid matrix), indicated that drying and subsequent storage resulted in alterations of starch at 0.33a(w) and that these changes were reduced with acylation. In vitro enzymatic digestibility of heated starch dispersions by bacterial α-amylase was increased by acylation (HAMS<HAMSB<HAMSP≤HAMSA) showing that the trend was not related to the length of the fatty acid chain. Digestibility was enhanced with an increase in particle size, or decrease in G, and inversely proportional to the total T2 signal. It is suggested that both external surface area and an internal network of pores and channels collectively influence the digestibility of starch

    Quantitative comparison of preparation methodologies for x-ray fluorescence microscopy of brain tissue

    No full text
    C1 - Journal Articles RefereedX-ray fluorescence microscopy (XFM) facilitates high-sensitivity quantitative imaging of trace metals at high spatial resolution over large sample areas and can be applied to a diverse range of biological samples. Accurate determination of elemental content from recorded spectra requires proper calibration of the XFM instrument under the relevant operating conditions. Here, we describe the manufacture, characterization, and utilization of multi-element thin-film reference foils for use in calibration of XFM measurements of biological and other specimens. We have used these internal standards to assess the two-dimensional distribution of trace metals in a thin tissue section of a rat hippocampus. The data used in this study was acquired at the XFM beamline of the Australian Synchrotron using a new 384-element array detector (Maia) and at beamline 2-ID-E at the Advanced Photon Source. Post-processing of samples by different fixation techniques was investigated, with the conclusion that differences in solvent type and sample handling can significantly alter elemental content. The present study highlights the quantitative capability, high statistical power, and versatility of the XFM technique for mapping trace metals in biological samples, e.g., brain tissue samples in order to help understand neurological processes, especially when implemented in conjunction with a high-performance detector such as Maia
    corecore