Search CORE

75 research outputs found

Reliable High Performance Peta- and Exa-Scale Computing

Author: Bronevetsky G
Publication venue: 'Office of Scientific and Technical Information (OSTI)'
Publication date: 02/04/2012
Field of study

As supercomputers become larger and more powerful, they are growing increasingly complex. This is reflected both in the exponentially increasing numbers of components in HPC systems (LLNL is currently installing the 1.6 million core Sequoia system) as well as the wide variety of software and hardware components that a typical system includes. At this scale it becomes infeasible to make each component sufficiently reliable to prevent regular faults somewhere in the system or to account for all possible cross-component interactions. The resulting faults and instability cause HPC applications to crash, perform sub-optimally or even produce erroneous results. As supercomputers continue to approach Exascale performance and full system reliability becomes prohibitively expensive, we will require novel techniques to bridge the gap between the lower reliability provided by hardware systems and users unchanging need for consistent performance and reliable results. Previous research on HPC system reliability has developed various techniques for tolerating and detecting various types of faults. However, these techniques have seen very limited real applicability because of our poor understanding of how real systems are affected by complex faults such as soft fault-induced bit flips or performance degradations. Prior work on such techniques has had very limited practical utility because it has generally focused on analyzing the behavior of entire software/hardware systems both during normal operation and in the face of faults. Because such behaviors are extremely complex, such studies have only produced coarse behavioral models of limited sets of software/hardware system stacks. Since this provides little insight into the many different system stacks and applications used in practice, this work has had little real-world impact. My project addresses this problem by developing a modular methodology to analyze the behavior of applications and systems during both normal and faulty operation. By synthesizing models of individual components into a whole-system behavior models my work is making it possible to automatically understand the behavior of arbitrary real-world systems to enable them to tolerate a wide range of system faults. My project is following a multi-pronged research strategy. Section II discusses my work on modeling the behavior of existing applications and systems. Section II.A discusses resilience in the face of soft faults and Section II.B looks at techniques to tolerate performance faults. Finally Section III presents an alternative approach that studies how a system should be designed from the ground up to make resilience natural and easy

Crossref

UNT Digital Library

Soft Error Vulnerability of Iterative Linear Algebra Methods

Author: Bronevetsky G
de Supinski B
Publication venue: Lawrence Livermore National Laboratory
Publication date: 01/01/2008
Field of study

Devices are increasingly vulnerable to soft errors as their feature sizes shrink. Previously, soft error rates were significant primarily in space and high-atmospheric computing. Modern architectures now use features so small at sufficiently low voltages that soft errors are becoming important even at terrestrial altitudes. Due to their large number of components, supercomputers are particularly susceptible to soft errors. Since many large scale parallel scientific applications use iterative linear algebra methods, the soft error vulnerability of these methods constitutes a large fraction of the applications overall vulnerability. Many users consider these methods invulnerable to most soft errors since they converge from an imprecise solution to a precise one. However, we show in this paper that iterative methods are vulnerable to soft errors, exhibiting both silent data corruptions and poor ability to detect errors. Further, we evaluate a variety of soft error detection and tolerance techniques, including checkpointing, linear matrix encodings, and residual tracking techniques

Crossref

UNT Digital Library

Soft Error Vulnerability of Iterative Linear Algebra Methods

Author: Bronevetsky G
de Supinski B
Publication venue: Lawrence Livermore National Laboratory
Publication date: 15/12/2007
Field of study

Devices become increasingly vulnerable to soft errors as their feature sizes shrink. Previously, soft errors primarily caused problems for space and high-atmospheric computing applications. Modern architectures now use features so small at sufficiently low voltages that soft errors are becoming significant even at terrestrial altitudes. The soft error vulnerability of iterative linear algebra methods, which many scientific applications use, is a critical aspect of the overall application vulnerability. These methods are often considered invulnerable to many soft errors because they converge from an imprecise solution to a precise one. However, we show that iterative methods can be vulnerable to soft errors, with a high rate of silent data corruptions. We quantify this vulnerability, with algorithms generating up to 8.5% erroneous results when subjected to a single bit-flip. Further, we show that detecting soft errors in an iterative method depends on its detailed convergence properties and requires more complex mechanisms than simply checking the residual. Finally, we explore inexpensive techniques to tolerate soft errors in these methods

CiteSeerX

Crossref

UNT Digital Library

Recommended from our members

Formal Specification of the OpenMP Memory Model

Author: Bronevetsky G
de Supinski B R
Publication venue: Lawrence Livermore National Laboratory
Publication date: 17/05/2006
Field of study

OpenMP [1] is an important API for shared memory programming, combining shared memory's potential for performance with a simple programming interface. Unfortunately, OpenMP lacks a critical tool for demonstrating whether programs are correct: a formal memory model. Instead, the current official definition of the OpenMP memory model (the OpenMP 2.5 specification [1]) is in terms of informal prose. As a result, it is impossible to verify OpenMP applications formally since the prose does not provide a formal consistency model that precisely describes how reads and writes on different threads interact. This paper focuses on the formal verification of OpenMP programs through a proposed formal memory model that is derived from the existing prose model [1]. Our formalization provides a two-step process to verify whether an observed OpenMP execution is conformant. In addition to this formalization, our contributions include a discussion of ambiguities in the current prose-based memory model description. Although our formal model may not capture the current informal memory model perfectly, in part due to these ambiguities, our model reflects our understanding of the informal model's intent. We conclude with several examples that may indicate areas of the OpenMP memory model that need further refinement however it is specified. Our goal is to motivate the OpenMP community to adopt those refinements eventually, ideally through a formal model, in later OpenMP specifications

UNT Digital Library

CLOMP: Accurately Characterizing OpenMP Application Overheads

Author: Bronevetsky G
de Supinski B
Gyllenhaal J
Publication venue: Lawrence Livermore National Laboratory
Publication date: 01/01/2008
Field of study

Despite its ease of use, OpenMP has failed to gain widespread use on large scale systems, largely due to its failure to deliver sufficient performance. Our experience indicates that the cost of initiating OpenMP regions is simply too high for the desired OpenMP usage scenario of many applications. In this paper, we introduce CLOMP, a new benchmark to characterize this aspect of OpenMP implementations accurately. CLOMP complements the existing EPCC benchmark suite to provide simple, easy to understand measurements of OpenMP overheads in the context of application usage scenarios. Our results for several OpenMP implementations demonstrate that CLOMP identifies the amount of work required to compensate for the overheads observed with EPCC. Further, we show that CLOMP also captures limitations for OpenMP parallelization on NUMA systems

CiteSeerX

Crossref

Springer - Publisher Connector

eScholarship - University of California

UNT Digital Library

Recommended from our members

Hybrid MPI: Efficient Message Passing for Shared and Distributed Memory

Author: Bronevetsky G
Friedley A
Hoefler T
Lumsdaine A
Publication venue: Lawrence Livermore National Laboratory
Publication date: 12/02/2013
Field of study

UNT Digital Library

Detailed Modeling, Design, and Evaluation of a Scalable Multi-level Checkpointing System

Author: Bronevetsky G
de Supinski B R
Mohror K M
Moody A T
Publication venue: Lawrence Livermore National Laboratory
Publication date: 09/04/2010
Field of study

High-performance computing (HPC) systems are growing more powerful by utilizing more hardware components. As the system mean-time-before-failure correspondingly drops, applications must checkpoint more frequently to make progress. However, as the system memory sizes grow faster than the bandwidth to the parallel file system, the cost of checkpointing begins to dominate application run times. A potential solution to this problem is to use multi-level checkpointing, which employs multiple types of checkpoints with different costs and different levels of resiliency in a single run. The goal is to design light-weight checkpoints to handle the most common failure modes and rely on more expensive checkpoints for less common, but more severe failures. While this approach is theoretically promising, it has not been fully evaluated in a large-scale, production system context. To this end we have designed a system, called the Scalable Checkpoint/Restart (SCR) library, that writes checkpoints to storage on the compute nodes utilizing RAM, Flash, or disk, in addition to the parallel file system. We present the performance and reliability properties of SCR as well as a probabilistic Markov model that predicts its performance on current and future systems. We show that multi-level checkpointing improves efficiency on existing large-scale systems and that this benefit increases as the system size grows. In particular, we developed low-cost checkpoint schemes that are 100x-1000x faster than the parallel file system and effective against 85% of our system failures. This leads to a gain in machine efficiency of up to 35%, and it reduces the the load on the parallel file system by a factor of two on current and future systems

Crossref

UNT Digital Library

Cannabinoid receptor 2 positions and retains marginal zone B cells within the splenic marginal zone

Author: Attanavanich
Barral
Brinkmann
Bromley
Cinamon
Cinamon
Galiègue
Guinamard
Gurdyal S. Besra
Hardtke
Jagan R. Muppidi
Jason G. Cyster
Karlsson
Kraus
Leadbetter
Lo
Lu
Martin
Martin
Masato Tanaka
Mebius
Miyake
Munro
Natacha Veerapen
Ohl
Pereira
Pereira
Pillai
Regard
Reif
Rinaldi-Carmona
Sensken
Sugiura
Sugiura
Tal I. Arnon
Tanigaki
Tanikawa
van Rooijen
Yelena Bronevetsky
Ziring
Publication venue: The Rockefeller University Press
Publication date: 01/01/2011
Field of study

In addition to other receptors, including sphingosine-1-phosphate receptor 1, cannabinoid receptor 2 positions mouse marginal zone B cells within the marginal zone and also prevents their loss to the blood

Crossref

University of Birmingham Research Portal

PubMed Central

eScholarship - University of California

Oxford University Research Archive

Mammalian microRNA: an important modulator of host-pathogen interactions in human viral infections

Author: A Eulalio
A Grundhoff
A Kozomara
AM Denli
AW Whisnant
AW Whisnant
B Kitab
BJ Reinhart
C Okada
C Staedel
C-J Shen
Chet Raj Ojha
CJ Lukonis
CL Jopling
CS Sullivan
DP Bartel
E Lund
ES Machlin
F Lu
F Sato
F Wahid
F-R Zeng
G Gatto
G Randall
G Swaminathan
G Yang
G Zhang
G-Q Tang
GA Calin
H Guo
H Jiménez-Wences
H Ling
H-S Zhang
H-S Zhang
HP Bogerd
J Bhanja Chowdhury
J Haasnoot
J Huang
J Huang
J Jin
J Lin
J-C Cheng
JA Castillo
JG Ruby
JR Lytle
JS Mattick
K Chiang
K Chiang
L Cui
L Deng
L Frappier
L He
L He
M Hariharan
M Hussain
M Lagos-Quintana
M Lagos-Quintana
M Lin
M Thirion
MA Havens
MI Almeida
MS Scott
Myosotys Rodriguez
Nazira El-Hage
NC Lau
P Fan
PK Kakumani
R Kapoor
R Nathans
R Triboulet
R Yi
R Zhou
RC Lee
RC Wilson
Rita Mukhopadhyay
RL Skalsky
RP Kincaid
RP Kincaid
RS Pillai
RT Marquez
S Bivalkar-Mehla
S Dambal
S Griffiths-Jones
S Pilakka-Kanthikeel
S Ponia
S Qian
S Shrivastava
S Shwetha
S Vasudevan
S Vasudevan
S Wu
Seth M. Dever
SL Ameres
SM Wilting
T Masaki
T Shimakami
TM Johanson
V Ambros
V Brès
VA Erdmann
W Chen
W Filipowicz
W Wen
X Li
X Ouyang
X Zhang
Y Bennasser
Y Bennasser
Y Bronevetsky
Y Chen
Y Lee
Y Li
Y Liu
Y Murakami
Y Wang
Y Yao
Y Zhang
Publication venue: FIU Digital Commons
Publication date: 01/01/2016
Field of study

MicroRNAs (miRNAs), which are small non-coding RNAs expressed by almost all metazoans, have key roles in the regulation of cell differentiation, organism development and gene expression. Thousands of miRNAs regulating approximately 60æ% of the total human genome have been identified. They regulate genetic expression either by direct cleavage or by translational repression of the target mRNAs recognized through partial complementary base pairing. The active and functional unit of miRNA is its complex with Argonaute proteins known as the microRNA-induced silencing complex (miRISC). De-regulated miRNA expression in the human cell may contribute to a diverse group of disorders including cancer, cardiovascular dysfunctions, liver damage, immunological dysfunction, metabolic syndromes and pathogenic infections. Current day studies have revealed that miRNAs are indeed a pivotal component of host-pathogen interactions and host immune responses toward microorganisms. miRNA is emerging as a tool for genetic study, therapeutic development and diagnosis for human pathogenic infections caused by viruses, bacteria, parasites and fungi. Many pathogens can exploit the host miRNA system for their own benefit such as surviving inside the host cell, replication, pathogenesis and bypassing some host immune barriers, while some express pathogen-encoded miRNA inside the host contributing to their replication, survival and/or latency. In this review, we discuss the role and significance of miRNA in relation to some pathogenic viruses

Crossref

Springer - Publisher Connector

PubMed Central

DigitalCommons@Florida International University

Glia-to-neuron transfer of miRNAs via extracellular vesicles: a new mechanism underlying inflammation-induced synaptic alterations

Author: A Iyer
A Jovicic
A Junker
A Montecalvo
A Weber
Alessia Iorio
Annalisa Buffo
B Viviani
BJ Goldie
C Daly
C Fruhbeis
C Verderio
C Xiao
CH Polman
Claudia Verderio
CN Parkhurst
D Centonze
D Eletto
D Fitzner
Dan Cojoc
DT Lioy
E Cocucci
E McNeill
Elena Turola
EM Kramer-Albers
F Antonucci
F Bianco
F Bianco
F Drago
F Jagot
F Kadri
F Properzi
FB Gao
Francesca Peruzzi
G Schratt
Giulia D’Arrigo
Giuseppe Legname
GP Morris
H Asai
H Ding
H Kettenmann
H Wake
HC Christianson
HM Dongen van
I Prada
Ilaria Prada
J Scheppingen van
J Yuan
JD Arroyo
JG Patton
L Conforti
L Riganti
L Tong
LA Colgan
LF Sempere
LH Jia
LJ Vella
M Chivet
M Gabrielli
M Grasso
M Pacifici
M Pacifici
MA Rocca
Marco Pacifici
Mariacristina De Luca
Marta Lombardi
Martina Gabrielli
Mattia Bastoni
ME Tremblay
MM Harraz
N Schonrock
NC Derecki
O Butovsky
O Pascual
OA Shipton
P Candia de
P Joshi
PH Reddy
PS Mitchell
R Corriden
R Saba
RC Paolicelli
Roberta Parolisi
Roberto Furlan
S Mao
S Rom
S Rossi
S Zaqout
S Zhang
SL Shipman
T Babak
TV Bliss
V Budnik
V Iori
V Planche
W Filipowicz
WP Bartlett
Y Bronevetsky
Y Wu
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2018
Field of study

Recent evidence indicates synaptic dysfunction as an early mechanism affected in neuroinflammatory diseases, such as multiple sclerosis, which are characterized by chronic microglia activation. However, the mode(s) of action of reactive microglia in causing synaptic defects are not fully understood. In this study, we show that inflammatory microglia produce extracellular vesicles (EVs) which are enriched in a set of miRNAs that regulate the expression of key synaptic proteins. Among them, miR-146a-5p, a microglia-specific miRNA not present in hippocampal neurons, controls the expression of presynaptic synaptotagmin1 (Syt1) and postsynaptic neuroligin1 (Nlg1), an adhesion protein which play a crucial role in dendritic spine formation and synaptic stability. Using a Renilla-based sensor, we provide formal proof that inflammatory EVs transfer their miR-146a-5p cargo to neuron. By western blot and immunofluorescence analysis we show that vesicular miR-146a-5p suppresses Syt1 and Nlg1 expression in receiving neurons. Microglia-to-neuron miR-146a-5p transfer and Syt1 and Nlg1 downregulation do not occur when EV\ue2\u80\u93neuron contact is inhibited by cloaking vesicular phosphatidylserine residues and when neurons are exposed to EVs either depleted of miR-146a-5p, produced by pro-regenerative microglia, or storing inactive miR-146a-5p, produced by cells transfected with an anti-miR-146a-5p. Morphological analysis reveals that prolonged exposure to inflammatory EVs leads to significant decrease in dendritic spine density in hippocampal neurons in vivo and in primary culture, which is rescued in vitro by transfection of a miR-insensitive Nlg1 form. Dendritic spine loss is accompanied by a decrease in the density and strength of excitatory synapses, as indicated by reduced mEPSC frequency and amplitude. These findings link inflammatory microglia and enhanced EV production to loss of excitatory synapses, uncovering a previously unrecognized role for microglia-enriched miRNAs, released in association to EVs, in silencing of key synaptic genes

Crossref

Sissa Digital Library

Institutional Research Information System University of Turin