7,303 research outputs found

    Components and Interfaces of a Process Management System for Parallel Programs

    Full text link
    Parallel jobs are different from sequential jobs and require a different type of process management. We present here a process management system for parallel programs such as those written using MPI. A primary goal of the system, which we call MPD (for multipurpose daemon), is to be scalable. By this we mean that startup of interactive parallel jobs comprising thousands of processes is quick, that signals can be quickly delivered to processes, and that stdin, stdout, and stderr are managed intuitively. Our primary target is parallel machines made up of clusters of SMPs, but the system is also useful in more tightly integrated environments. We describe how MPD enables much faster startup and better runtime management of parallel jobs. We show how close control of stdio can support the easy implementation of a number of convenient system utilities, even a parallel debugger. We describe a simple but general interface that can be used to separate any process manager from a parallel library, which we use to keep MPD separate from MPICH.Comment: 12 pages, Workshop on Clusters and Computational Grids for Scientific Computing, Sept. 24-27, 2000, Le Chateau de Faverges de la Tour, Franc

    Using Pilot Systems to Execute Many Task Workloads on Supercomputers

    Full text link
    High performance computing systems have historically been designed to support applications comprised of mostly monolithic, single-job workloads. Pilot systems decouple workload specification, resource selection, and task execution via job placeholders and late-binding. Pilot systems help to satisfy the resource requirements of workloads comprised of multiple tasks. RADICAL-Pilot (RP) is a modular and extensible Python-based pilot system. In this paper we describe RP's design, architecture and implementation, and characterize its performance. RP is capable of spawning more than 100 tasks/second and supports the steady-state execution of up to 16K concurrent tasks. RP can be used stand-alone, as well as integrated with other application-level tools as a runtime system

    Political and institutional factors affecting systems engineering

    Get PDF
    External groups have a significant impact on NASA's programs. Ten groups affecting NASA are identified, and examples are given for some of the them. Methods of dealing with these external inputs are discussed, the most important being good and open two way communications and an objective attitude on the part of the NASA participants. The importance of planning ahead, of developing rapport with these groups, and of effective use of NASA contractors is covered. The need for an overall strategic plan for the U.S. space program is stressed

    Global Grids and Software Toolkits: A Study of Four Grid Middleware Technologies

    Full text link
    Grid is an infrastructure that involves the integrated and collaborative use of computers, networks, databases and scientific instruments owned and managed by multiple organizations. Grid applications often involve large amounts of data and/or computing resources that require secure resource sharing across organizational boundaries. This makes Grid application management and deployment a complex undertaking. Grid middlewares provide users with seamless computing ability and uniform access to resources in the heterogeneous Grid environment. Several software toolkits and systems have been developed, most of which are results of academic research projects, all over the world. This chapter will focus on four of these middlewares--UNICORE, Globus, Legion and Gridbus. It also presents our implementation of a resource broker for UNICORE as this functionality was not supported in it. A comparison of these systems on the basis of the architecture, implementation model and several other features is included.Comment: 19 pages, 10 figure

    Information support and interactive planning in the digital factory : approach and industry-driven evaluation

    Get PDF
    In the modern world we are continuously surrounded by information. The human brain has to analyse and interpret this information to transform into useable knowledge that is then used in decision making activities. The advent and implementation of Industry 4.0 will make it a requirement for systems within factories to interact and share large quantities of information with each other. This large volume of information will make it even more difficult for the human resources within the factory to sift through the large amount of information required since there is a limit to the information that our brains can cope with. Just in time information retrieval (JITIR) within the digital factory environment aims to provide support to the human stakeholders in the system by proactively yet non-intrusively providing the required information at the right time based on the users context. This paper will therefore provide an insight into the cognitive difficulties experienced by humans in the digital factory and how JITIR can tackle these challenges. By validating the JITIR concept, several industry scenarios have been evaluated: an exemplary model, concerning the machine tool industry, is presented in the paper. The results of this research are a set of guidelines for the development of a digital factory support tool.peer-reviewe

    Technology in work organisations

    Get PDF

    An Employment-Oriented Definition of the Information Systems Field: An Educator\u27s View

    Get PDF
    Defining information systems has been a longstanding problem for the field. This paper suggests that, since it may not be possible to develop a universal definition, consideration should be given to a plurality of definitions aligned toward specific purposes. As an implementation of this approach it recommends the following shorter definition for the purpose of education, which emphasizes topics that are being or will be taught to prepare students for employment in the field: Information systems is the field that prepares students to interface between non-technical organizational employees and managers and very technical IT professionals, with a focus on functions that are unlikely to be offshored. It includes general categories of information and communications technology use that currently and/or will employ substantial numbers of employees in organizations. The more detailed definition presented in the body of the paper extends this by identifying five broad subcategories that currently fit within the above definition
    • …
    corecore