3,037 research outputs found
Generalizable Resource Allocation in Stream Processing via Deep Reinforcement Learning
This paper considers the problem of resource allocation in stream processing,
where continuous data flows must be processed in real time in a large
distributed system. To maximize system throughput, the resource allocation
strategy that partitions the computation tasks of a stream processing graph
onto computing devices must simultaneously balance workload distribution and
minimize communication. Since this problem of graph partitioning is known to be
NP-complete yet crucial to practical streaming systems, many heuristic-based
algorithms have been developed to find reasonably good solutions. In this
paper, we present a graph-aware encoder-decoder framework to learn a
generalizable resource allocation strategy that can properly distribute
computation tasks of stream processing graphs unobserved from training data.
We, for the first time, propose to leverage graph embedding to learn the
structural information of the stream processing graphs. Jointly trained with
the graph-aware decoder using deep reinforcement learning, our approach can
effectively find optimized solutions for unseen graphs. Our experiments show
that the proposed model outperforms both METIS, a state-of-the-art graph
partitioning algorithm, and an LSTM-based encoder-decoder model, in about 70%
of the test cases.Comment: Accepted by AAAI 202
Usefulness of regional right ventricular and right atrial strain for prediction of early and late right ventricular failure following a left ventricular assist device implant: A machine learning approach
Background: Identifying candidates for left ventricular assist device surgery at risk of right ventricular failure remains difficult. The aim was to identify the most accurate predictors of right ventricular failure among clinical, biological, and imaging markers, assessed by agreement of different supervised machine learning algorithms. Methods: Seventy-four patients, referred to HeartWare left ventricular assist device since 2010 in two Italian centers, were recruited. Biomarkers, right ventricular standard, and strain echocardiography, as well as cath-lab measures, were compared among patients who did not develop right ventricular failure (N = 56), those with acute–right ventricular failure (N = 8, 11%) or chronic–right ventricular failure (N = 10, 14%). Logistic regression, penalized logistic regression, linear support vector machines, and naïve Bayes algorithms with leave-one-out validation were used to evaluate the efficiency of any combination of three collected variables in an “all-subsets” approach. Results: Michigan risk score combined with central venous pressure assessed invasively and apical longitudinal systolic strain of the right ventricular–free wall were the most significant predictors of acute–right ventricular failure (maximum receiver operating characteristic–area under the curve = 0.95, 95% confidence interval = 0.91–1.00, by the naïve Bayes), while the right ventricular–free wall systolic strain of the middle segment, right atrial strain (QRS-synced), and tricuspid annular plane systolic excursion were the most significant predictors of Chronic-RVF (receiver operating characteristic–area under the curve = 0.97, 95% confidence interval = 0.91–1.00, according to naïve Bayes). Conclusion: Apical right ventricular strain as well as right atrial strain provides complementary information, both critical to predict acute–right ventricular failure and chronic–right ventricular failure, respectively
Machine Learning for Fluid Mechanics
The field of fluid mechanics is rapidly advancing, driven by unprecedented
volumes of data from field measurements, experiments and large-scale
simulations at multiple spatiotemporal scales. Machine learning offers a wealth
of techniques to extract information from data that could be translated into
knowledge about the underlying fluid mechanics. Moreover, machine learning
algorithms can augment domain knowledge and automate tasks related to flow
control and optimization. This article presents an overview of past history,
current developments, and emerging opportunities of machine learning for fluid
mechanics. It outlines fundamental machine learning methodologies and discusses
their uses for understanding, modeling, optimizing, and controlling fluid
flows. The strengths and limitations of these methods are addressed from the
perspective of scientific inquiry that considers data as an inherent part of
modeling, experimentation, and simulation. Machine learning provides a powerful
information processing framework that can enrich, and possibly even transform,
current lines of fluid mechanics research and industrial applications.Comment: To appear in the Annual Reviews of Fluid Mechanics, 202
Learning Scheduling Algorithms for Data Processing Clusters
Efficiently scheduling data processing jobs on distributed compute clusters
requires complex algorithms. Current systems, however, use simple generalized
heuristics and ignore workload characteristics, since developing and tuning a
scheduling policy for each workload is infeasible. In this paper, we show that
modern machine learning techniques can generate highly-efficient policies
automatically. Decima uses reinforcement learning (RL) and neural networks to
learn workload-specific scheduling algorithms without any human instruction
beyond a high-level objective such as minimizing average job completion time.
Off-the-shelf RL techniques, however, cannot handle the complexity and scale of
the scheduling problem. To build Decima, we had to develop new representations
for jobs' dependency graphs, design scalable RL models, and invent RL training
methods for dealing with continuous stochastic job arrivals. Our prototype
integration with Spark on a 25-node cluster shows that Decima improves the
average job completion time over hand-tuned scheduling heuristics by at least
21%, achieving up to 2x improvement during periods of high cluster load
- …