2,116 research outputs found

    Compressed Data Structures for Dynamic Sequences

    Full text link
    We consider the problem of storing a dynamic string SS over an alphabet Σ={1,,σ}\Sigma=\{\,1,\ldots,\sigma\,\} in compressed form. Our representation supports insertions and deletions of symbols and answers three fundamental queries: access(i,S)\mathrm{access}(i,S) returns the ii-th symbol in SS, ranka(i,S)\mathrm{rank}_a(i,S) counts how many times a symbol aa occurs among the first ii positions in SS, and selecta(i,S)\mathrm{select}_a(i,S) finds the position where a symbol aa occurs for the ii-th time. We present the first fully-dynamic data structure for arbitrarily large alphabets that achieves optimal query times for all three operations and supports updates with worst-case time guarantees. Ours is also the first fully-dynamic data structure that needs only nHk+o(nlogσ)nH_k+o(n\log\sigma) bits, where HkH_k is the kk-th order entropy and nn is the string length. Moreover our representation supports extraction of a substring S[i..i+]S[i..i+\ell] in optimal O(logn/loglogn+/logσn)O(\log n/\log\log n + \ell/\log_{\sigma}n) time

    Economic evidence for the prevention and treatment of atopic eczema: a protocol for a systematic review

    Get PDF
    Background: Eczema, synonymous with atopic eczema or atopic dermatitis, is a chronic skin disease that has a similar impact on health-related quality of life as other chronic diseases. The proposed research aims to provide a comprehensive systematic assessment of the economic evidence base available to inform economic modelling and decision making on interventions to prevent and treat eczema at any stage of the life course. Whilst the Global Resource of Eczema Trials (GREAT) database collects together the effectiveness evidence for eczema there is currently no such systematic resource on the economics of eczema. It is important to gain an overview of the current state of the art of economic methods in the field of eczema in order to strengthen the economic evidence base further. Methods/design: The proposed study is a systematic review of the economic evidence surrounding interventions for the prevention and treatment of eczema. Relevant search terms will be used to search MEDLINE, EMBASE, Database of Abstracts of Reviews of Effects, Cochrane Database of Systematic Reviews, Cochrane Central Register of Controlled Trials, NHS Economic Evaluation Database, Health Technology Assessment, Cumulative Index to Nursing and Allied Health Literature, Econ Lit, Scopus, Cost-Effectiveness Analysis Registry and Web of Science in order to identify relevant evidence. To be eligible for inclusion studies will be primary empirical studies evaluating the cost, utility or full economic evaluation of interventions for preventing or treating eczema. Two reviewers will independently assess studies for eligibility and perform data abstraction. Evidence tables will be produced presenting details of study characteristics, costing methods, outcome methods and quality assessment. The methodological quality of studies will be assessed using accepted checklists. Discussion: The systematic review is being undertaken to identify the type of economic evidence available, summarise the results of the available economic evidence and critically appraise the quality of economic evidence currently available to inform future economic modelling and resource allocation decisions about interventions to prevent or treat eczema. We aim to use the review to offer guidance about how to gather economic evidence in studies of eczema and/or what further research is necessary in order to inform this

    Efficient Representation of Multidimensional Data over Hierarchical Domains

    Get PDF
    The final publication is available at Springer via http://dx.doi.org/10.1007/978-3-319-46049-9_19[Abstract] We consider the problem of representing multidimensional data where the domain of each dimension is organized hierarchically, and the queries require summary information at a different node in the hierarchy of each dimension. This is the typical case of OLAP databases. A basic approach is to represent each hierarchy as a one-dimensional line and recast the queries as multidimensional range queries. This approach can be implemented compactly by generalizing to more dimensions the k2k2 -treap, a compact representation of two-dimensional points that allows for efficient summarization queries along generic ranges. Instead, we propose a more flexible generalization, which instead of a generic quadtree-like partition of the space, follows the domain hierarchies across each dimension to organize the partitioning. The resulting structure is much more efficient than a generic multidimensional structure, since queries are resolved by aggregating much fewer nodes of the tree.Ministerio de Economía, Industria y Competitividad; TIN2013-46238-C4-3-RMinisterio de Economía, Industria y Competitividad; IDI-20141259Ministerio de Economía, Industria y Competitividad; ITC-20151305Ministerio de Economía y Competitividad; ITC-20151247Xunta de Galicia; GRC2013/053Chile.Fondo Nacional de Desarrollo Científico y Tecnológico; 1-140796COST. IC130

    Reconstructing complex regions of genomes using long-read sequencing technology

    Get PDF
    Cataloged from PDF version of article.Obtaining high-quality sequence continuity of complex regions of recent segmental duplication remains one of the major challenges of finishing genome assemblies. In the human and mouse genomes, this was achieved by targeting large-insert clones using costly and laborious capillary-based sequencing approaches. Sanger shotgun sequencing of clone inserts, however, has now been largely abandoned, leaving most of these regions unresolved in newer genome assemblies generated primarily by next-generation sequencing hybrid approaches. Here we show that it is possible to resolve regions that are complex in a genome-wide context but simple in isolation for a fraction of the time and cost of traditional methods using long-read single molecule, real-time (SMRT) sequencing and assembly technology from Pacific Biosciences (PacBio). We sequenced and assembled BAC clones corresponding to a 1.3-Mbp complex region of chromosome 17q21.31, demonstrating 99.994% identity to Sanger assemblies of the same clones. We targeted 44 differences using Illumina sequencing and find that PacBio and Sanger assemblies share a comparable number of validated variants, albeit with different sequence context biases. Finally, we targeted a poorly assembled 766-kbp duplicated region of the chimpanzee genome and resolved the structure and organization for a fraction of the cost and time of traditional finishing approaches. Our data suggest a straightforward path for upgrading genomes to a higher quality finished state

    Effects of broadening and electron overheating in tunnel structures based on metallic clusters

    Full text link
    We study the influence of energy levels broadening and electron subsystem overheating in island electrode (cluster) on current-voltage characteristics of three-electrode structure. A calculation scheme for broadening effect in one-dimensional case is suggested. Estimation of broadening is performed for electron levels in disc-like and spherical gold clusters. Within the two-temperature model of metallic cluster and by using a size dependence of the Debye frequency the effective electron temperature as a function of bias voltage is found approximately. We suggest that the effects of broadening and electron overheating are responsible for the strong smoothing of current-voltage curves, which is observed experimentally at low temperatures in structures based on clusters consisting of accountable number of atoms.Comment: 8 pages, 5 figure
    corecore