68,147 research outputs found
Resource and Application Models for Advanced Grid Schedulers
As Grid computing is becoming an inevitable future, managing, scheduling and monitoring dynamic, heterogeneous resources will present new challenges. Solutions will have to be agile and adaptive, support self-organization and autonomous management, while maintaining optimal resource utilisation. Presented in this paper are basic principles and architectural concepts for efficient resource allocation in heterogeneous Grid environment
Introducing risk management into the grid
Service Level Agreements (SLAs) are explicit statements about all expectations and obligations in the business partnership between customers and providers. They have been introduced in Grid computing to overcome the best effort approach, making the Grid more interesting for commercial applications. However, decisions on negotiation and system management still rely on static approaches, not reflecting the risk linked with decisions. The EC-funded project "AssessGrid" aims at introducing risk assessment and management as a novel decision paradigm into Grid computing. This paper gives a general motivation for risk management and presents the envisaged architecture of a "risk-aware" Grid middleware and Grid fabric, highlighting its functionality by means of three showcase scenarios
Enabling Adaptive Grid Scheduling and Resource Management
Wider adoption of the Grid concept has led to an increasing amount of federated
computational, storage and visualisation resources being available to scientists and
researchers. Distributed and heterogeneous nature of these resources renders most of the
legacy cluster monitoring and management approaches inappropriate, and poses new
challenges in workflow scheduling on such systems. Effective resource utilisation monitoring
and highly granular yet adaptive measurements are prerequisites for a more efficient Grid
scheduler. We present a suite of measurement applications able to monitor per-process
resource utilisation, and a customisable tool for emulating observed utilisation models. We
also outline our future work on a predictive and probabilistic Grid scheduler. The research is
undertaken as part of UK e-Science EPSRC sponsored project SO-GRM (Self-Organising
Grid Resource Management) in cooperation with BT
Quantifying Operational Constraints of Low-Latency Telerobotics for Planetary Surface Operations
NASA's SLS and Orion crew vehicle will launch humans to cislunar space to
begin the new era of space exploration. NASA plans to use the Orion crew
vehicle to transport humans between Earth and cislunar space where there will
be a stationed habitat known as the Deep Space Gateway (DSG). The proximity to
the lunar surface allows for direct communication between the DSG and surface
assets, which enables low-latency telerobotic exploration. The operational
constraints for telerobotics must be fully explored on Earth before being
utilized on space exploration missions. We identified two constraints on space
exploration using low-latency surface telerobotics and attempts to quantify
these constraints. A constraint associated with low-latency surface
telerobotics is the bandwidth available between the orbiting command station
and the ground assets. The bandwidth available will vary during operation. As a
result, it is critical to quantify the operational video conditions required
for effective exploration. We designed an experiment to quantify the threshold
frame rate required for effective exploration. The experiment simulated
geological exploration via low-latency surface telerobotics using a COTS rover
in a lunar analog environment. The results from this experiment indicate that
humans should operate above a threshold frame rate of 5 frames per second. In a
separate, but similar experiment, we introduced a 2.6 second delay in the video
system. This delay recreated the latency conditions present when operating
rovers on the lunar farside from an Earth-based command station. This time
delay was compared to low-latency conditions for teleoperation at the DSG
(0.4 seconds). The results from this experiment show a 150% increase in
exploration time when the latency is increased to 2.6 seconds. This indicates
that such a delay significantly complicates real-time exploration strategies.Comment: 10 pages, 8 figures, Proceedings of the IEEE Aerospace Conference,
Big Sky, MT. arXiv admin note: text overlap with arXiv:1706.0375
Managing Uncertainty: A Case for Probabilistic Grid Scheduling
The Grid technology is evolving into a global, service-orientated
architecture, a universal platform for delivering future high demand
computational services. Strong adoption of the Grid and the utility computing
concept is leading to an increasing number of Grid installations running a wide
range of applications of different size and complexity. In this paper we
address the problem of elivering deadline/economy based scheduling in a
heterogeneous application environment using statistical properties of job
historical executions and its associated meta-data. This approach is motivated
by a study of six-month computational load generated by Grid applications in a
multi-purpose Grid cluster serving a community of twenty e-Science projects.
The observed job statistics, resource utilisation and user behaviour is
discussed in the context of management approaches and models most suitable for
supporting a probabilistic and autonomous scheduling architecture
- …