Search CORE

45 research outputs found

Mixed-mode multicore reliability

Author: Gurindar S. Sohi
Koushik Chakraborty
Philip M. Wells
Smolens J. C.
Smolens J. C.
Wells P. M.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date
Field of study

Crossref

Mixed-mode multicore reliability

Author: Gurindar S. Sohi
Koushik Chakraborty
Philip M. Wells
Smolens J. C.
Smolens J. C.
Wells P. M.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date
Field of study

Crossref

Future Challenges in Design Frameworks for Embedded Systems: Application to Intelligent Transportation Systems

Author: J.-M. Nigro
J.C. Smolens
O. Tahan
T. Denœux
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2014
Field of study

Crossref

Fingerprinting: Bounding Soft-Error-Detection Latency and Bandwidth

Author: Falsafi B
Gold BT
Hoe JC
Kim J
Nowatzyk AG
Smolens JC
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/02/2019
Field of study

Recent studies have suggested that the soft-error rate in microprocessor logic will become a reliability concern by 2010. This paper proposes an efficient error detection technique, called fingerprinting, that detects differences in execution across a dual modular redundant (DMR) processor pair. Fingerprinting summarizes a processor's execution history in a hash-based signature; differences between two mirrored processors are exposed by comparing their fingerprints. Fingerprinting tightly bounds detection latency and greatly reduces the interprocessor communication bandwidth required for checking. This paper presents a study that evaluates fingerprinting against a range of current approaches to error detection. The result of this study shows that fingerprinting is the only error detection mechanism that simultaneously allows high-error coverage, low error detection bandwidth, and high I/O performance.X1114sciescopu

포항공과대학교

TRUSS: A Reliable, Scalable Server Architecture

Author: Chung ES
Falsafi B
Gold BT
Hoe JC
Kim J
Liaskovitis V
Nowatzyk AG
Nurvitadhi E
Smolens JC
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/02/2019
Field of study

Traditional techniques that mainframes use to increase reliability special hardware or custom software - are incompatible with commodity server requirements. The truss architecture provides reliable, scalable computation for unmodified application software in a distributed shared-memory multiprocessor.X1112sciescopu

포항공과대학교

Plateletpheresis and Plasmapheresis In the Routine Operation of a Children's Hospital Blood Bank

Author: Irving Wolman
Isaac Djerassi
Jorge Alvarado
Klein E.
Klein E.
Klein E.
Kliman A.
Smith E.
Smolens J.
Tui Co.
Publication venue: 'SAGE Publications'
Publication date
Field of study

Crossref

Adapting to Intermittent Faults in Multicore Systems

Author: Armstrong W.
Constantinescu C.
Gunther S. H.
Gurindar S. Sohi
Joseph R.
Koushik Chakraborty
Litt T.
Mitra S.
Philip M. Wells
Smolens J. C.
Publication venue: University of Wisconsin-Madison Department of Computer Sciences
Publication date: 01/01/2007
Field of study

Future multicore processors will become more susceptible to a variety of hardware failures. In particular, intermittent faults, caused in part by manufacturing process variation or in-progress wear-out, can cause bursts of frequent faults that last from several cycles to several seconds or more. Cost-effective reliability to tolerate intermittent faults will likely require, or be greatly simplified by, the ability to temporarily suspend execution on a core during periods of frequent intermittent faults. We investigate three existing techniques for adapting to the dynamically changing resource availability caused by such core suspension, and demonstrate their different system-level implications. We show that system software reconfiguration has very high overhead for short intermittent faults, that temporarily pausing the execution of a faulty core can lead to cascading livelock, and that using spare cores has high fault-free cost. To remedy these and other drawbacks of current techniques, we propose using a thin hardware/firmware layer to manage an overcommitted system -- one where the OS is configured to use more virtual processors than the number of currently available physical cores. We show that this proposed technique can gracefully degrade performance during intermittent faults of various durations with low overhead, without involving system software, and without requiring spare cores

CiteSeerX

Crossref

MINDS@UW (Univ. of Wisconsin)