2,401 research outputs found

    A Taxonomy of Data Grids for Distributed Data Sharing, Management and Processing

    Full text link
    Data Grids have been adopted as the platform for scientific communities that need to share, access, transport, process and manage large data collections distributed worldwide. They combine high-end computing technologies with high-performance networking and wide-area storage management techniques. In this paper, we discuss the key concepts behind Data Grids and compare them with other data sharing and distribution paradigms such as content delivery networks, peer-to-peer networks and distributed databases. We then provide comprehensive taxonomies that cover various aspects of architecture, data transportation, data replication and resource allocation and scheduling. Finally, we map the proposed taxonomy to various Data Grid systems not only to validate the taxonomy but also to identify areas for future exploration. Through this taxonomy, we aim to categorise existing systems to better understand their goals and their methodology. This would help evaluate their applicability for solving similar problems. This taxonomy also provides a "gap analysis" of this area through which researchers can potentially identify new issues for investigation. Finally, we hope that the proposed taxonomy and mapping also helps to provide an easy way for new practitioners to understand this complex area of research.Comment: 46 pages, 16 figures, Technical Repor

    A Multi-Criteria Meta-Fuzzy-Scheduler for Independent Tasks in Grid Computing

    Get PDF
    The paradigm of distributed computation in heterogeneous resources, grid computing, has given rise to a large amount of research on resource scheduling. This paper presents a Meta-Scheduler for grid computing that does not need any given information about tasks length or tasks arrival time unlike traditional dynamic heuristics. Our Meta-Scheduler is of multi-criteria type, because it solves two conflicting objectives: minimize the makespan of a set of tasks and distribute these tasks in a balanced way among the resources of the Grid. Experimental results using fuzzy scheduler show that, through our proposal, we achieve these two objectives and improve dynamic heuristics presented in prior literature

    Resource Management for Data Intensive Tasks on Grids

    Get PDF
    • …
    corecore