134,644 research outputs found

    A random forest system combination approach for error detection in digital dictionaries

    Full text link
    When digitizing a print bilingual dictionary, whether via optical character recognition or manual entry, it is inevitable that errors are introduced into the electronic version that is created. We investigate automating the process of detecting errors in an XML representation of a digitized print dictionary using a hybrid approach that combines rule-based, feature-based, and language model-based methods. We investigate combining methods and show that using random forests is a promising approach. We find that in isolation, unsupervised methods rival the performance of supervised methods. Random forests typically require training data so we investigate how we can apply random forests to combine individual base methods that are themselves unsupervised without requiring large amounts of training data. Experiments reveal empirically that a relatively small amount of data is sufficient and can potentially be further reduced through specific selection criteria.Comment: 9 pages, 7 figures, 10 tables; appeared in Proceedings of the Workshop on Innovative Hybrid Approaches to the Processing of Textual Data, April 201

    The FORS Deep Field: Field selection, photometric observations and photometric catalog

    Get PDF
    The FORS Deep Field project is a multi-colour, multi-object spectroscopic investigation of an approx. 7 times 7 region near the south galactic pole based mostly on observations carried out with the FORS instruments attached to the VLT telescopes. It includes the QSO Q 0103-260 (z = 3.36). The goal of this study is to improve our understanding of the formation and evolution of galaxies in the young Universe. In this paper the field selection, the photometric observations, and the data reduction are described. The source detection and photometry of objects in the FORS Deep Field is discussed in detail. A combined B and I selected UBgRIJKs photometric catalog of 8753 objects in the FDF is presented and its properties are briefly discussed. The formal 50% completeness limits for point sources, derived from the co-added images, are 25.64, 27.69, 26.86, 26.68, 26.37, 23.60 and 21.57 in U, B, g, R, I, J and Ks (Vega-system), respectively. A comparison of the number counts in the FORS Deep Field to those derived in other deep field surveys shows very good agreement.Comment: 15 pages, 11 figures (included), accepted for publication in A&

    A methodology for the generation of efficient error detection mechanisms

    Get PDF
    A dependable software system must contain error detection mechanisms and error recovery mechanisms. Software components for the detection of errors are typically designed based on a system specification or the experience of software engineers, with their efficiency typically being measured using fault injection and metrics such as coverage and latency. In this paper, we introduce a methodology for the design of highly efficient error detection mechanisms. The proposed methodology combines fault injection analysis and data mining techniques in order to generate predicates for efficient error detection mechanisms. The results presented demonstrate the viability of the methodology as an approach for the development of efficient error detection mechanisms, as the predicates generated yield a true positive rate of almost 100% and a false positive rate very close to 0% for the detection of failure-inducing states. The main advantage of the proposed methodology over current state-of-the-art approaches is that efficient detectors are obtained by design, rather than by using specification-based detector design or the experience of software engineers

    Evolution of shuttle avionics redundancy management/fault tolerance

    Get PDF
    The challenge of providing redundancy management (RM) and fault tolerance to meet the Shuttle Program requirements of fail operational/fail safe for the avionics systems was complicated by the critical program constraints of weight, cost, and schedule. The basic and sometimes false effectivity of less than pure RM designs is addressed. Evolution of the multiple input selection filter (the heart of the RM function) is discussed with emphasis on the subtle interactions of the flight control system that were found to be potentially catastrophic. Several other general RM development problems are discussed, with particular emphasis on the inertial measurement unit RM, indicative of the complexity of managing that three string system and its critical interfaces with the guidance and control systems

    System Description for a Scalable, Fault-Tolerant, Distributed Garbage Collector

    Full text link
    We describe an efficient and fault-tolerant algorithm for distributed cyclic garbage collection. The algorithm imposes few requirements on the local machines and allows for flexibility in the choice of local collector and distributed acyclic garbage collector to use with it. We have emphasized reducing the number and size of network messages without sacrificing the promptness of collection throughout the algorithm. Our proposed collector is a variant of back tracing to avoid extensive synchronization between machines. We have added an explicit forward tracing stage to the standard back tracing stage and designed a tuned heuristic to reduce the total amount of work done by the collector. Of particular note is the development of fault-tolerant cooperation between traces and a heuristic that aggressively reduces the set of suspect objects.Comment: 47 pages, LaTe

    Fault-Tolerant Dot-Product Engines

    Full text link
    Coding schemes are presented that provide the ability to correct and detect computational errors while using dot-product engines for integer vector--matrix multiplication. Both the L1L_1-metric and the Hamming metric are considered

    Publications of the Jet Propulsion Laboratory, July 1961 through June 1962

    Get PDF
    Jpl bibliography on space science, 1961-196
    corecore