1,805 research outputs found

    RLZAP: Relative Lempel-Ziv with Adaptive Pointers

    Full text link
    Relative Lempel-Ziv (RLZ) is a popular algorithm for compressing databases of genomes from individuals of the same species when fast random access is desired. With Kuruppu et al.'s (SPIRE 2010) original implementation, a reference genome is selected and then the other genomes are greedily parsed into phrases exactly matching substrings of the reference. Deorowicz and Grabowski (Bioinformatics, 2011) pointed out that letting each phrase end with a mismatch character usually gives better compression because many of the differences between individuals' genomes are single-nucleotide substitutions. Ferrada et al. (SPIRE 2014) then pointed out that also using relative pointers and run-length compressing them usually gives even better compression. In this paper we generalize Ferrada et al.'s idea to handle well also short insertions, deletions and multi-character substitutions. We show experimentally that our generalization achieves better compression than Ferrada et al.'s implementation with comparable random-access times

    Composite repetition-aware data structures

    Get PDF
    In highly repetitive strings, like collections of genomes from the same species, distinct measures of repetition all grow sublinearly in the length of the text, and indexes targeted to such strings typically depend only on one of these measures. We describe two data structures whose size depends on multiple measures of repetition at once, and that provide competitive tradeoffs between the time for counting and reporting all the exact occurrences of a pattern, and the space taken by the structure. The key component of our constructions is the run-length encoded BWT (RLBWT), which takes space proportional to the number of BWT runs: rather than augmenting RLBWT with suffix array samples, we combine it with data structures from LZ77 indexes, which take space proportional to the number of LZ77 factors, and with the compact directed acyclic word graph (CDAWG), which takes space proportional to the number of extensions of maximal repeats. The combination of CDAWG and RLBWT enables also a new representation of the suffix tree, whose size depends again on the number of extensions of maximal repeats, and that is powerful enough to support matching statistics and constant-space traversal.Comment: (the name of the third co-author was inadvertently omitted from previous version

    Dynamic Fluctuation Phenomena in Double Membrane Films

    Full text link
    Dynamics of double membrane films is investigated in the long-wavelength limit including the overdamped squeezing mode. We demonstrate that thermal fluctuations essentially modify the character of the mode due to its nonlinear coupling to the transversal shear hydrodynamic mode. The corresponding Green function acquires as a function of the frequency a cut along the imaginary semi-axis. Fluctuations lead to increasing the attenuation of the squeezing mode it becomes larger than the `bare' value.Comment: 7 pages, Revte

    Time-lapsing biodiversity: an open source method for measuring diversity changes by remote sensing

    Get PDF
    Understanding biodiversity changes in time is crucial to promptly provide management practices against diversity loss. This is overall true when considering global scales, since human-induced global change is expected to make significant changes on the Earth's biota. Biodiversity management and planning is mainly based on field observations related to community diversity, considering different taxa. However, such methods are time and cost demanding and do not allow in most cases to get temporal replicates. In this view, remote sensing can provide a wide data coverage in a short period of time. Recently, the use of Rao's Q diversity as a measure of spectral diversity has been proposed in order to explicitly take into account differences in a neighbourhood considering abundance and relative distance among pixels. The aim of this paper was to extend such a measure over the temporal dimension and to present an innovative approach to calculate remotely sensed temporal diversity. We demonstrated that temporal beta-diversity (spectral turnover) can be calculated pixel-wise in terms of both slope and coefficient of variation and further plotted over the whole matrix / image. From an ecological and operational point of view, for prioritisation practices in biodiversity protection, temporal variability could be beneficial in order to plan more efficient conservation practices starting from spectral diversity hotspots in space and time. In this paper, we delivered a highly reproducible approach to calculate spatio-temporal diversity in a robust and straightforward manner. Since it is based on open source code, we expect that our method will be further used by several researchers and landscape managers

    Classification of Dust Days by Satellite Remotely Sensed Aerosol Products

    Get PDF
    Considerable progress in satellite remote sensing (SRS) of dust particles has been seen in the last decade. From an environmental health perspective, such an event detection, after linking it to ground particulate matter (PM) concentrations, can proxy acute exposure to respirable particles of certain properties (i.e. size, composition, and toxicity). Being affected considerably by atmospheric dust, previous studies in the Eastern Mediterranean, and in Israel in particular, have focused on mechanistic and synoptic prediction, classification, and characterization of dust events. In particular, a scheme for identifying dust days (DD) in Israel based on ground PM10 (particulate matter of size smaller than 10 nm) measurements has been suggested, which has been validated by compositional analysis. This scheme requires information regarding ground PM10 levels, which is naturally limited in places with sparse ground-monitoring coverage. In such cases, SRS may be an efficient and cost-effective alternative to ground measurements. This work demonstrates a new model for identifying DD and non-DD (NDD) over Israel based on an integration of aerosol products from different satellite platforms (Moderate Resolution Imaging Spectroradiometer (MODIS) and Ozone Monitoring Instrument (OMI)). Analysis of ground-monitoring data from 2007 to 2008 in southern Israel revealed 67 DD, with more than 88 percent occurring during winter and spring. A Classification and Regression Tree (CART) model that was applied to a database containing ground monitoring (the dependent variable) and SRS aerosol product (the independent variables) records revealed an optimal set of binary variables for the identification of DD. These variables are combinations of the following primary variables: the calendar month, ground-level relative humidity (RH), the aerosol optical depth (AOD) from MODIS, and the aerosol absorbing index (AAI) from OMI. A logistic regression that uses these variables, coded as binary variables, demonstrated 93.2 percent correct classifications of DD and NDD. Evaluation of the combined CART-logistic regression scheme in an adjacent geographical region (Gush Dan) demonstrated good results. Using SRS aerosol products for DD and NDD, identification may enable us to distinguish between health, ecological, and environmental effects that result from exposure to these distinct particle populations

    Cutaneous manifestations of COVID-19: Report of three cases and a review of literature

    Get PDF
    Background: Various cutaneous manifestations have been observed in patients with COVID-19 infection. However, overall similarities in the clinical presentation of these dermatological manifestations have not yet been summarized. Objective: This review aims to provide an overview of various cutaneous manifestations in patients with COVID-19 through three case reports and a literature review. Methods: A literature search was conducted using PubMed, OVID, and Google search engines for original and review articles. Studies written in the English language that mentioned cutaneous symptoms and COVID-19 were included. Results: Eighteen articles and three additional cases reported in this paper were included in this review. Of these studies, 6 are case series and 12 are case report studies. The most common cutaneous manifestation of COVID-19 was found to be maculopapular exanthem (morbilliform), presenting in 36.1% (26/72) patients. The other cutaneous manifestations included: a papulovesicular rash (34.7%, 25/72), urticaria (9.7%, 7/72), painful acral red purple papules (15.3%, 11/72) of patients, livedo reticularis lesions (2.8%, 2/72) and petechiae (1.4%, 1/72). Majority of lesions were localized on the trunk (66.7%, 50/72), however, 19.4% (14/72) of patients experienced cutaneous manifestations in the hands and feet. Skin lesion development occurred before the onset of respiratory symptoms or COVID-19 diagnosis in 12.5% (9/72) of the patients, and lesions spontaneously healed in all patients within 10 days. Majority of the studies reported no correlation between COVID-19 severity and skin lesions. Conclusion: Infection with COVID-19 may result in dermatological manifestations with various clinical presentations, which may aid in the timely diagnosis of this infection

    Factorised Steady States in Mass Transport Models

    Get PDF
    We study a class of mass transport models where mass is transported in a preferred direction around a one-dimensional periodic lattice and is globally conserved. The model encompasses both discrete and continuous masses and parallel and random sequential dynamics and includes models such as the Zero-range process and Asymmetric random average process as special cases. We derive a necessary and sufficient condition for the steady state to factorise, which takes a rather simple form.Comment: 6 page

    On the suitability of suffix arrays for lempel-ziv data compression

    Get PDF
    Lossless compression algorithms of the Lempel-Ziv (LZ) family are widely used nowadays. Regarding time and memory requirements, LZ encoding is much more demanding than decoding. In order to speed up the encoding process, efficient data structures, like suffix trees, have been used. In this paper, we explore the use of suffix arrays to hold the dictionary of the LZ encoder, and propose an algorithm to search over it. We show that the resulting encoder attains roughly the same compression ratios as those based on suffix trees. However, the amount of memory required by the suffix array is fixed, and much lower than the variable amount of memory used by encoders based on suffix trees (which depends on the text to encode). We conclude that suffix arrays, when compared to suffix trees in terms of the trade-off among time, memory, and compression ratio, may be preferable in scenarios (e.g., embedded systems) where memory is at a premium and high speed is not critical

    Fingerprints in Compressed Strings

    Get PDF
    The Karp-Rabin fingerprint of a string is a type of hash value that due to its strong properties has been used in many string algorithms. In this paper we show how to construct a data structure for a string S of size N compressed by a context-free grammar of size n that answers fingerprint queries. That is, given indices i and j, the answer to a query is the fingerprint of the substring S[i,j]. We present the first O(n) space data structures that answer fingerprint queries without decompressing any characters. For Straight Line Programs (SLP) we get O(logN) query time, and for Linear SLPs (an SLP derivative that captures LZ78 compression and its variations) we get O(log log N) query time. Hence, our data structures has the same time and space complexity as for random access in SLPs. We utilize the fingerprint data structures to solve the longest common extension problem in query time O(log N log l) and O(log l log log l + log log N) for SLPs and Linear SLPs, respectively. Here, l denotes the length of the LCE

    Instability of Myelin Tubes under Dehydration: deswelling of layered cylindrical structures

    Full text link
    We report experimental observations of an undulational instability of myelin figures. Motivated by this, we examine theoretically the deformation and possible instability of concentric, cylindrical, multi-lamellar membrane structures. Under conditions of osmotic stress (swelling or dehydration), we find a stable, deformed state in which the layer deformation is given by \delta R ~ r^{\sqrt{B_A/(hB)}}, where B_A is the area compression modulus, B is the inter-layer compression modulus, and h is the repeat distance of layers. Also, above a finite threshold of dehydration (or osmotic stress), we find that the system becomes unstable to undulations, first with a characteristic wavelength of order \sqrt{xi d_0}, where xi is the standard smectic penetration depth and d_0 is the thickness of dehydrated region.Comment: 5 pages + 3 figures [revtex 4
    • 

    corecore