10 research outputs found

    An architecture for distributed Grid brokering

    No full text
    Abstract. Computational resource brokering on the Grid is the process of discovering what systems are capable of running a job, obtaining estimates for when that job may run and how much it will cost, and submitting the job to the system that best meets the users' requirements. This paper identifies how resource brokers differ from superschedulers, and describes a resource brokering architecture which is adapted to the emergent structure of the large-scale Grid. We outline the architecture of the UNICORE resource broker which is the basis of our prototype implementation, and discusses both how the existing UNICORE architecture is relevant to the wider brokering picture and what will be done in the future to bring them into closer alignment

    Identifiers for the 21st century: How to design, provision, and reuse persistent identifiers to maximize utility and impact of life science data.

    No full text
    In many disciplines, data are highly decentralized across thousands of online databases (repositories, registries, and knowledgebases). Wringing value from such databases depends on the discipline of data science and on the humble bricks and mortar that make integration possible; identifiers are a core component of this integration infrastructure. Drawing on our experience and on work by other groups, we outline 10 lessons we have learned about the identifier qualities and best practices that facilitate large-scale data integration. Specifically, we propose actions that identifier practitioners (database providers) should take in the design, provision and reuse of identifiers. We also outline the important considerations for those referencing identifiers in various circumstances, including by authors and data generators. While the importance and relevance of each lesson will vary by context, there is a need for increased awareness about how to avoid and manage common identifier problems, especially those related to persistence and web-accessibility/resolvability. We focus strongly on web-based identifiers in the life sciences; however, the principles are broadly relevant to other disciplines

    Anatomy of a web-based identifier.

    No full text
    <p>An example of an exemplary unique resource identifier (URI) is below; it is comprised of American Standard Code for Information Interchange (ASCII) characters and follows a pattern that starts with a fixed set of characters (URI pattern). That URI pattern is followed by a local identifier (local ID)—an identifier which, by itself, is only guaranteed to be locally unique within the database or source. A local ID is sometimes referred to as an “accession.” Note this figure illustrates the simplest representation; nuances regarding versioning are covered in Lesson 6 and <a href="http://www.plosbiology.org/article/info:doi/10.1371/journal.pbio.2001414#pbio.2001414.g005" target="_blank">Fig 5</a>.</p

    Contributions and roles related to content as they correspond to identifier creation versus identifier reuse.

    No full text
    <p>The decision about whether to create a new identifier or reuse an existing one depends on the role you play in the creation, editing, and republishing of content; for certain roles (and when several roles apply) that decision is a judgement call. Asterisks convey cases in which the best course of action is often to correct/improve the original record in collaboration with the original source; the guidance about identifier creation versus reuse is meant to apply only when such collaboration is not practicable (and an alternate record is created). It is common that a given actor may have multiple roles along this spectrum; for instance, a given record in monarchinitiative.org may reflect a combination of (a) corrections Monarch staff made in collaboration with the original data source, (b) post-ingest curation by Monarch staff, (c) expanded content integrated from multiple sources.</p
    corecore