101,078 research outputs found

    Survivable algorithms and redundancy management in NASA's distributed computing systems

    Get PDF
    The design of survivable algorithms requires a solid foundation for executing them. While hardware techniques for fault-tolerant computing are relatively well understood, fault-tolerant operating systems, as well as fault-tolerant applications (survivable algorithms), are, by contrast, little understood, and much more work in this field is required. We outline some of our work that contributes to the foundation of ultrareliable operating systems and fault-tolerant algorithm design. We introduce our consensus-based framework for fault-tolerant system design. This is followed by a description of a hierarchical partitioning method for efficient consensus. A scheduler for redundancy management is introduced, and application-specific fault tolerance is described. We give an overview of our hybrid algorithm technique, which is an alternative to the formal approach given

    Fault diagnosis and fault-tolerant control for nonlinear systems with linear output structure

    Get PDF
    Article describes the process of fault diagnosis and fault-tolerant control for nonlinear systems with linear output structure

    A droplet routing technique for fault-tolerant digital microfluidic devices

    Get PDF
    Abstract—Efficient droplet routing is one of the key approaches for realizing fault-tolerant microfluidic biochips. It requires that run-time diagnosis and fault recovery can be made possible in such systems. This paper describes a droplet routing technique for a fault-tolerant digital microfluidic platform. This technique features handling of many microfluidic operations simultaneously and uses on-chip sensors for diagnosis at run-time.\ud Once a fault is detected during the droplet routing, recovery procedures will be started-up immediately. Faulty units on the chip will be marked and isolated from the array so that the remaining droplets can still be routed along a fault-free path to their destinations. This method guarantees a non-stop fault-tolerant operation for very large microfluidic arrays.\u
    • …
    corecore