788 research outputs found

    Efficient Management of Short-Lived Data

    Full text link
    Motivated by the increasing prominence of loosely-coupled systems, such as mobile and sensor networks, which are characterised by intermittent connectivity and volatile data, we study the tagging of data with so-called expiration times. More specifically, when data are inserted into a database, they may be tagged with time values indicating when they expire, i.e., when they are regarded as stale or invalid and thus are no longer considered part of the database. In a number of applications, expiration times are known and can be assigned at insertion time. We present data structures and algorithms for online management of data tagged with expiration times. The algorithms are based on fully functional, persistent treaps, which are a combination of binary search trees with respect to a primary attribute and heaps with respect to a secondary attribute. The primary attribute implements primary keys, and the secondary attribute stores expiration times in a minimum heap, thus keeping a priority queue of tuples to expire. A detailed and comprehensive experimental study demonstrates the well-behavedness and scalability of the approach as well as its efficiency with respect to a number of competitors.Comment: switched to TimeCenter latex styl

    O2-tree: a shared memory resident index in multicore architectures

    Get PDF
    Shared memory multicore computer architectures are now commonplace in computing. These can be found in modern desktops and workstation computers and also in High Performance Computing (HPC) systems. Recent advances in memory architecture and in 64-bit addressing, allow such systems to have memory sizes of the order of hundreds of gigabytes and beyond. This now allows for realistic development of main memory resident database systems. This still requires the use of a memory resident index such as T-Tree, and the B+-Tree for fast access to the data items. This thesis proposes a new indexing structure, called the O2-Tree, which is essentially an augmented Red-Black Tree in which the leaf nodes are index data blocks that store multiple pairs of key and value referred to as \key-value" pairs. The value is either the entire record associated with the key or a pointer to the location of the record. The internal nodes contain copies of the keys that split blocks of the leaf nodes in a manner similar to the B+-Tree. O2-Tree structure has the advantage that: it can be easily reconstructed by reading only the lowest value of the key of each leaf node page. The size is su ciently small and thus can be dumped and restored much faster. Analysis and comparative experimental study show that the performance of the O2-Tree is superior to other tree-based index structures with respect to various query operations for large datasets. We also present results which indicate that the O2-Tree outperforms popular key-value stores such as BerkelyDB and TreeDB of Kyoto Cabinet for various workloads. The thesis addresses various concurrent access techniques for the O2-Tree for shared memory multicore architecture and gives analysis of the O2-Tree with respect to query operations, storage utilization, failover and recovery

    Scapegoat trees: a comparative performance assessment

    Get PDF
    Η εργασία αυτή παρουσιάζει και αναλύει μια εναλλακτική μέθοδο για την αποδοτική εξισορρόπηση δυαδικών δέντρων αναζήτησης. Τα δέντρα scapegoat είναι ελαφρώς ισορροπημένα και ανακατασκευάζουν μέρη τους υπό ορισμένες συνθήκες. Η amortized χρονική πολυπλοκότητα για κάθε λειτουργία INSERT ή REMOVE είναι Ο(logn), ενώ η worst-case πραγματική χρονική πολυπλοκότητα μιας FIND είναι Ο(logn). Τα δέντρα scapegoat, σε αντίθεση με τις περισσότερες υλοποιήσεις αυτεξισορροπούμενων BST, δεν απαιτούν επιπλέον δεδομένα (π.χ. χρώματα, βάρη, ύψη) στους κόμβους των δέντρων. Χρησιμοποιούνται διάφορες μετρικές και δοκιμές για τη σύγκριση των δέντρων AVL και scapegoat ως προς τη λειτουργικότητά τους. Τέλος, παρέχουμε δεδομένα ως προς την επίδοση των δέντρων scapegoat σε πιθανές εφαρμογές πραγματικού χρόνου.This Thesis presents and analyzes an alternative tree scheme for the efficient balancing of binary search trees. Scapegoat trees are loosely balanced and restructure parts of themselves on certain conditions. The amortized time complexity for each INSERT or REMOVE operation is Ο(logn), while the worst-case real-time complexity of a FIND one is Ο(logn). Scapegoat trees, unlike most self-balancing BST implementations, do not require extra data (e.g. colors, weights, heights) in the tree nodes. Various metrics and tests are used to compare AVL and scapegoat trees on their functionality. Finally, we provide data as to the performance of scapegoat trees in potential real-time applications

    Interactive visualization of information hierarchies and applications on the web

    Get PDF
    The visualization of information hierarchies is concerned with the presentation of abstract hierarchical information about relationships between various entities. It has many applications in diverse domains such as software engineering, information systems, biology, and chemistry. Information hierarchies are typically modeled by an abstract tree, where vertices are entities and edges represent relationships between entities. The aim of visualizing tree drawings is to automatically produce drawings of trees which clearly reflect the relationships of the information hierarchy. This thesis is primarily concerned with problems related to the automatic generation of area-efficient grid drawings of trees, interactively visualizing information hierarchies, and applying our techniques on Web data. The main achievements of this thesis include: 1. An experimental study on algorithms that produce planar straight-line grid drawings of binary trees, 2. An experimental study that shows the algorithm for producing planar straight-line grid drawings of degree-d trees with n nodes with optimal linear area and with user-defined arbitrary aspect ratio, works well in practice, 3. A rings-based technique for interactively visualizing information hierarchies, in real-time, 4. A survey of Web visualization systems developed to address the lost in cyberspace problem, 5. A separation-based Web visualization system that we present as a viable solution to the lost in cyberspace problem, 6. A rings-based Web visualization system that we propose as a solution to the lost in cyberspace problem

    Phylogenetic and Functional Diversity of Saprolegniales and Fungi Isolated from Temperate Lakes in Northeast Germany

    Get PDF
    Funding: This research was funded by the Iranian Ministry of Science, Research, and Technology and DAAD (German Academic Exchange Service). The APC was funded by the Leibniz Association. Acknowledgments: The publication of this article was funded by the Open Access Fund of the Leibniz Association.Peer reviewedPublisher PD

    Search Tree Data Structures and Their Applications

    Get PDF
    This study concerns the discussion of search tree data structures and their applications. The thesis presents three new top-down updating algorithms for the concurrent data processing environment.Computing and Information Scienc

    A Smartphone-based Connected Vehicle Solution for Winter Road Surface Condition Monitoring

    Get PDF
    The monitoring of winter road surface conditions (RSCs) is essential to transportation agencies and the traveling public, since the former needs to be aware of the location and severity of existing RSCs in order to effectively maintain safe roadways with minimal environmental impact, while the latter uses RSC information to make informed travel decisions. However, current RSC monitoring practice still relies on methods that are time-consuming, labour-intensive and lacking in objectivity, therefore limiting their ability to provide sufficient spatial and temporal coverage across a road network. This research was motivated by the need for accurate, timely and reliable RSC monitoring for winter maintenance personnel and the travelling public. To achieve this objective, the field performance of a smartphone-based RSC monitoring system was evaluated on a section of Highway 6 in Ontario, Canada during the winter of 2014. A comparison between this system and current monitoring methods indicated that the former was capable of providing reliable results particularly at the maintenance route level; however, classification accuracy was found to vary according to RSC type. To improve the results produced by the smartphone-based system, this thesis proposes a connected- vehicle (CV) based RSC monitoring system that utilizes Road Weather Information System (RWIS) data in addition to the smartphone-based system’s data. Three techniques in artificial neural networks (ANNs), random trees (RTs), and random forests (RFs) were tested as the underlying models of the CV system, and the results indicated that all three models successfully increased the classification accuracy of the smartphone-based system. RFs were found to provide the most accurate RSC classifications for the standard (three-class) classification scheme while RTs were found to be most accurate when using a more detailed (five-class) classification scheme. Model transferability was also tested using data captured from a different test site during the winter of 2015; and it was found that although the proposed CV system significantly increased the reliability of RSC classifications, the underlying models were non-transferable and would therefore require local calibration before being used at different sites across a road network
    corecore