Search CORE

2,447 research outputs found

Prototype of Fault Adaptive Embedded Software for Large-Scale Real-Time Systems

Author: Haney Michael
Jung Mina
Messie Derek
Nordstrom Steven
Oh Jae C.
Shetty Shweta
Publication venue
Publication date: 01/01/2005
Field of study

This paper describes a comprehensive prototype of large-scale fault adaptive embedded software developed for the proposed Fermilab BTeV high energy physics experiment. Lightweight self-optimizing agents embedded within Level 1 of the prototype are responsible for proactive and reactive monitoring and mitigation based on specified layers of competence. The agents are self-protecting, detecting cascading failures using a distributed approach. Adaptive, reconfigurable, and mobile objects for reliablility are designed to be self-configuring to adapt automatically to dynamically changing environments. These objects provide a self-healing layer with the ability to discover, diagnose, and react to discontinuities in real-time processing. A generic modeling environment was developed to facilitate design and implementation of hardware resource specifications, application data flow, and failure mitigation strategies. Level 1 of the planned BTeV trigger system alone will consist of 2500 DSPs, so the number of components and intractable fault scenarios involved make it impossible to design an `expert system' that applies traditional centralized mitigative strategies based on rules capturing every possible system state. Instead, a distributed reactive approach is implemented using the tools and methodologies developed by the Real-Time Embedded Systems group.Comment: 2nd Workshop on Engineering of Autonomic Systems (EASe), in the 12th Annual IEEE International Conference and Workshop on the Engineering of Computer Based Systems (ECBS), Washington, DC, April, 200

arXiv.org e-Print Archive

CiteSeerX

Syracuse University Research Facility and Collaborative Environment

Fuzzy Scheduling Applied on Hydroelectric Power Generation

Author: Alejandro Diaz-Sanchez
Carlos Gracios-Marin
Eduardo Lebano-Perez
Esteban Molina Flores
Gerardo Mino-Aguilar
German A. Munoz-Hernandez
José Fermi Guerrero-Castellanos
Publication venue: 'IntechOpen'
Publication date: 09/03/2012
Field of study

IntechOpen

Quantum information and statistical mechanics: an introduction to frontier

Author: Fujii Keisuke
Publication venue
Publication date: 28/06/2013
Field of study

This is a short review on an interdisciplinary field of quantum information science and statistical mechanics. We first give a pedagogical introduction to the stabilizer formalism, which is an efficient way to describe an important class of quantum states, the so-called stabilizer states, and quantum operations on them. Furthermore, graph states, which are a class of stabilizer states associated with graphs, and their applications for measurement-based quantum computation are also mentioned. Based on the stabilizer formalism, we review two interdisciplinary topics. One is the relation between quantum error correction codes and spin glass models, which allows us to analyze the performances of quantum error correction codes by using the knowledge about phases in statistical models. The other is the relation between the stabilizer formalism and partition functions of classical spin models, which provides new quantum and classical algorithms to evaluate partition functions of classical spin models.Comment: 15pages, 4 figures, to appear in Proceedings of 4th YSM-SPIP (Sendai, 14-16 December 2012

arXiv.org e-Print Archive

Tohoku University Repository (TOUR) / 東北大学機関リポジトリ

Keeping checkpoint/restart viable for exascale systems

Author: Ferreira Kurt
Publication venue: UNM Digital Repository
Publication date: 01/12/2011
Field of study

Next-generation exascale systems, those capable of performing a quintillion operations per second, are expected to be delivered in the next 8-10 years. These systems, which will be 1,000 times faster than current systems, will be of unprecedented scale. As these systems continue to grow in size, faults will become increasingly common, even over the course of small calculations. Therefore, issues such as fault tolerance and reliability will limit application scalability. Current techniques to ensure progress across faults like checkpoint/restart, the dominant fault tolerance mechanism for the last 25 years, are increasingly problematic at the scales of future systems due to their excessive overheads. In this work, we evaluate a number of techniques to decrease the overhead of checkpoint/restart and keep this method viable for future exascale systems. More specifically, this work evaluates state-machine replication to dramatically increase the checkpoint interval (the time between successive checkpoints) and hash-based, probabilistic incremental checkpointing using graphics processing units to decrease the checkpoint commit time (the time to save one checkpoint). Using a combination of empirical analysis, modeling, and simulation, we study the costs and benefits of these approaches on a wide range of parameters. These results, which cover of number of high-performance computing capability workloads, different failure distributions, hardware mean time to failures, and I/O bandwidths, show the potential benefits of these techniques for meeting the reliability demands of future exascale platforms

Prototype of Fault Adaptive Embedded Software for Large-Scale Real-Time Systems

Author: Haney Michael
Jung Mina
Messie Derek
Nordstrom Steven
Oh Jae C.
Shetty Shweta
Publication venue: SURFACE at Syracuse University
Publication date: 01/01/2005
Field of study

This paper describes a comprehensive prototype of large-scale fault adaptive embedded software developed for the proposed Fermilab BTeV high energy physics experiment. Lightweight self-optimizing agents embedded within Level 1 of the prototype are responsible for proactive and reactive monitoring and mitigation based on specified layers of competence. The agents are self-protecting, detecting cascading failures using a distributed approach. Adaptive, reconfigurable, and mobile objects for reliability are designed to be self-configuring to adapt automatically to dynamically changing environments. These objects provide a self-healing layer with the ability to discover, diagnose, and react to discontinuities in real-time processing. A generic modeling environment was developed to facilitate design and implementation of hardware resource specifications, application data flow, and failure mitigation strategies. Level 1 of the planned BTeV trigger system alone will consist of 2500 DSPs, so the number of components and intractable fault scenarios involved make it impossible to design an “expert system” that applies traditional centralized mitigative strategies based on rules capturing every possible system state. Instead, a distributed reactive approach is implemented using the tools and methodologies developed by the RealTime Embedded Systems group

Syracuse University Research Facility and Collaborative Environment

Nanofabric Power Analysis: Biosequence Alignment Case of Study

Author: Amaru’ L.G.
Frache Stefano
Graziano Mariagrazia
Zamboni Maurizio
Publication venue: IEEE/ACM
Publication date: 01/01/2011
Field of study

Crossref

PORTO@iris (Publications Open Repository TOrino - Politecnico di Torino)

PORTO Publications Open Repository TOrino