Search CORE

26,558 research outputs found

Passive Fault-Tolerance Management in Component-Based Embedded Systems

Author: Coelho Jorge
Nogueira Luís
Publication venue: Institute of Informatics, Slovak Academy of Sciences
Publication date: 01/01/2015
Field of study

It is imperative to accept that failures can and will occur even in meticulously designed distributed systems and to design proper measures to counter those failures. Passive replication minimizes resource consumption by only activating redundant replicas in case of failures, as typically, providing and applying state updates is less resource demanding than requesting execution. However, most existing solutions for passive fault tolerance are usually designed and configured at design time, explicitly and statically identifying the most critical components and their number of replicas, lacking the needed flexibility to handle the runtime dynamics of distributed component-based embedded systems. This paper proposes a cost-effective adaptive fault tolerance solution with a significant lower overhead compared to a strict active redundancy-based approach, achieving a high error coverage with a minimum amount of redundancy. The activation of passive replicas is coordinated through a feedback-based coordination model that reduces the complexity of the needed interactions among components until a new collective global service solution is determined, hence improving the overall maintainability and robustness of the system

Repositório Científico do Instituto Politécnico do Porto

Computing and Informatics (E-Journal - Institute of Informatics, SAS, Bratislava)

Implementing fault tolerant applications using reflective object-oriented programming

Author: Fabre Jean-Charles
Nicomette Vincent
Pérennou Tanguy
Stroud Robert
Wu Zhixue
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/06/1995
Field of study

Abstract: Shows how reflection and object-oriented programming can be used to ease the implementation of classical fault tolerance mechanisms in distributed applications. When the underlying runtime system does not provide fault tolerance transparently, classical approaches to implementing fault tolerance mechanisms often imply mixing functional programming with non-functional programming (e.g. error processing mechanisms). The use of reflection improves the transparency of fault tolerance mechanisms to the programmer and more generally provides a clearer separation between functional and non-functional programming. The implementations of some classical replication techniques using a reflective approach are presented in detail and illustrated by several examples, which have been prototyped on a network of Unix workstations. Lessons learnt from our experiments are drawn and future work is discussed

Open Archive Toulouse Archive Ouverte

Time-efficient fault detection and diagnosis system for analog circuits

Author: He Yigang
Luo Qiwu
Sun Yichuang
Publication venue: 'Informa UK Limited'
Publication date: 01/01/2018
Field of study

Time-efficient fault analysis and diagnosis of analog circuits are the most important prerequisites to achieve online health monitoring of electronic equipments, which are involving continuing challenges of ultra-large-scale integration, component tolerance, limited test points but multiple faults. This work reports an FPGA (field programmable gate array)-based analog fault diagnostic system by applying two-dimensional information fusion, two-port network analysis and interval math theory. The proposed system has three advantages over traditional ones. First, it possesses high processing speed and smart circuit size as the embedded algorithms execute parallel on FPGA. Second, the hardware structure has a good compatibility with other diagnostic algorithms. Third, the equipped Ethernet interface enhances its flexibility for remote monitoring and controlling. The experimental results obtained from two realistic example circuits indicate that the proposed methodology had yielded competitive performance in both diagnosis accuracy and time-effectiveness, with about 96% accuracy while within 60 ms computational time.Peer reviewedFinal Published versio

Directory of Open Access Journals

University of Hertfordshire Research Archive

HRČAK - Portal of Croatian Scientific and Professional Journals

Hrčak - Portal of scientific journals of Croatia

Real-time and fault tolerance in distributed control software

Author: Broenink J.F.
Orlic B.
Publication venue: IOS Press
Publication date: 01/01/2003
Field of study

Closed loop control systems typically contain multitude of spatially distributed sensors and actuators operated simultaneously. So those systems are parallel and distributed in their essence. But mapping this parallelism onto the given distributed hardware architecture, brings in some additional requirements: safe multithreading, optimal process allocation, real-time scheduling of bus and network resources. Nowadays, fault tolerance methods and fast even online reconfiguration are becoming increasingly important. All those often conflicting requirements, make design and implementation of real-time distributed control systems an extremely difficult task, that requires substantial knowledge in several areas of control and computer science. Although many design methods have been proposed so far, none of them had succeeded to cover all important aspects of the problem at hand. [1] Continuous increase of production in embedded market, makes a simple and natural design methodology for real-time systems needed more then ever

CiteSeerX

University of Twente Research Information

Space Station Freedom data management system growth and evolution report

Author: Bartlett R.
Davis G.
Gibson J.
Grant T. L.
Hedges R.
Johnson M. J.
Liu Y. K.
Patterson-Hine A.
Sliwa N.
Sowizral H.
Publication venue
Publication date
Field of study

The Information Sciences Division at the NASA Ames Research Center has completed a 6-month study of portions of the Space Station Freedom Data Management System (DMS). This study looked at the present capabilities and future growth potential of the DMS, and the results are documented in this report. Issues have been raised that were discussed with the appropriate Johnson Space Center (JSC) management and Work Package-2 contractor organizations. Areas requiring additional study have been identified and suggestions for long-term upgrades have been proposed. This activity has allowed the Ames personnel to develop a rapport with the JSC civil service and contractor teams that does permit an independent check and balance technique for the DMS

NASA Technical Reports Server

Flexible and dynamic replication control for interdependent distributed real-time embedded systems

Author: D. Gelernter
H. Kopetz
J. Dowling
L. Nogueira
L.M. Pinho
M. Shankar
R. Guerraoui
R. Rajkumar
Z. Cai
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2010
Field of study

Replication is a proven concept for increasing the availability of distributed systems. However, actively replicating every software component in distributed embedded systems may not be a feasible approach. Not only the available resources are often limited, but also the imposed overhead could significantly degrade the system’s performance. This paper proposes heuristics to dynamically determine which components to replicate based on their significance to the system as a whole, its consequent number of passive replicas, and where to place those replicas in the network. The activation of passive replicas is coordinated through a fast convergence protocol that reduces the complexity of the needed interactions among nodes until a new collective global service solution is determined

Repositório Científico do Instituto Politécnico do Porto

Crossref

Design of an integrated airframe/propulsion control system architecture

Author: Cohen Gerald C.
Lee C. William
Strickland Michael J.
Torkelson Thomas C.
Publication venue
Publication date
Field of study

The design of an integrated airframe/propulsion control system architecture is described. The design is based on a prevalidation methodology that uses both reliability and performance. A detailed account is given for the testing associated with a subset of the architecture and concludes with general observations of applying the methodology to the architecture

NASA Technical Reports Server

The integration of on-line monitoring and reconfiguration functions using IEEE1149.4 into a safety critical automotive electronic control unit.

Author: Cutajar R.
Jeffery C.
Lickess M.
Prosser S.
Richardson Andrew M. D.
Riches S.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2005
Field of study

This paper presents an innovative application of IEEE 1149.4 and the integrated diagnostic reconfiguration (IDR) as tools for the implementation of an embedded test solution for an automotive electronic control unit, implemented as a fully integrated mixed signal system. The paper describes how the test architecture can be used for fault avoidance with results from a hardware prototype presented. The paper concludes that fault avoidance can be integrated into mixed signal electronic systems to handle key failure modes

Lancaster E-Prints