Search CORE

18,851 research outputs found

Analysis and Synthesis of Metadata Goals for Scientific Data

Author: Bain
Baker
Blank
Bountouri
Bosch
Brazma
Bruce
Buschmann
Committee on Science Engineering, and Public Policy (US), and Committee on Ensuring the Utility and Integrity of Research Data in a Digital Age
Consultative Committee for Space Data Systems (CCSDS)
Duval
Frenkel
Garvey
Greenberg
Greenberg
Greenberg
Greenberg
Hall
Hall
Heidorn
Hey
Higgins
Hjørland
Hubenthal
Jones
Kelling
Klein
Krippendorff
Lide
Lim
Michener
Murray-Rust
National Science Foundation
NSF Task Force on Cyberlearning
Rayner
Ryssevik
Sommerville
Spellman
Spurgin
Stvilia
Westbrook
Westbrook
Zhang
Publication venue: Duke University School of Law
Publication date: 01/01/2012
Field of study

The proliferation of discipline-specific metadata schemes contributes to artificial barriers that can impede interdisciplinary and transdisciplinary research. The authors considered this problem by examining the domains, objectives, and architectures of nine metadata schemes used to document scientific data in the physical, life, and social sciences. They used a mixed-methods content analysis and Greenberg’s (2005) metadata objectives, principles, domains, and architectural layout (MODAL) framework, and derived 22 metadata-related goals from textual content describing each metadata scheme. Relationships are identified between the domains (e.g., scientific discipline and type of data) and the categories of scheme objectives. For each strong correlation (\u3e0.6), a Fisher’s exact test for nonparametric data was used to determine significance (p \u3c .05). Significant relationships were found between the domains and objectives of the schemes. Schemes describing observational data are more likely to have “scheme harmonization” (compatibility and interoperability with related schemes) as an objective; schemes with the objective “abstraction” (a conceptual model exists separate from the technical implementation) also have the objective “sufficiency” (the scheme defines a minimal amount of information to meet the needs of the community); and schemes with the objective “data publication” do not have the objective “element refinement.” The analysis indicates that many metadata-driven goals expressed by communities are independent of scientific discipline or the type of data, although they are constrained by historical community practices and workflows as well as the technological environment at the time of scheme creation. The analysis reveals 11 fundamental metadata goals for metadata documenting scientific data in support of sharing research data across disciplines and domains. The authors report these results and highlight the need for more metadata-related research, particularly in the context of recent funding agency policy changes

Publikationer från KTH

Crossref

Duke Law Scholarship Repository

Digitala Vetenskapliga Arkivet - Academic Archive On-line

espace@Curtin

A Molecular Biology Database Digest

Author: Bry François
Kröger Peer
Publication venue
Publication date: 01/01/2000
Field of study

Computational Biology or Bioinformatics has been defined as the application of mathematical and Computer Science methods to solving problems in Molecular Biology that require large scale data, computation, and analysis [18]. As expected, Molecular Biology databases play an essential role in Computational Biology research and development. This paper introduces into current Molecular Biology databases, stressing data modeling, data acquisition, data retrieval, and the integration of Molecular Biology data from different sources. This paper is primarily intended for an audience of computer scientists with a limited background in Biology

CiteSeerX

Open Access LMU

Topic Maps as a Virtual Observatory tool

Author: Brunner R.
Djorgovski S. G.
Mahabal A.
Williams R.
Publication venue: 'SPIE-Intl Soc Optical Eng'
Publication date: 01/01/2001
Field of study

One major component of the VO will be catalogs measuring gigabytes and terrabytes if not more. Some mechanism like XML will be used for structuring the information. However, such mechanisms are not good for information retrieval on their own. For retrieval we use queries. Topic Maps that have started becoming popular recently are excellent for segregating information that results from a query. A Topic Map is a structured network of hyperlinks above an information pool. Different Topic Maps can form different layers above the same information pool and provide us with different views of it. This facilitates in being able to ask exact questions, aiding us in looking for gold needles in the proverbial haystack. Here we discuss the specifics of what Topic Maps are and how they can be implemented within the VO framework. URL: http://www.astro.caltech.edu/~aam/science/topicmaps/Comment: 11 pages, 5 eps figures, to appear in SPIE Annual Meeting 2001 proceedings (Astronomical Data Analysis), uses spie.st

arXiv.org e-Print Archive

CiteSeerX

Caltech Authors

CERN Document Server

Annual report by the Commission to the European Parliament and the Council on the setting up of the CADDIA computerized telecommunications systems and the implementation of the long-term development programme 1 July 1991 to 30 June 1992. COM (93) 84 final, 14 April 1993

Author
Publication venue
Publication date: 01/01/1993
Field of study

Archive of European Integration

Stabilizing knowledge through standards - A perspective for the humanities

Author: Romary Laurent
Publication venue
Publication date: 23/06/2009
Field of study

It is usual to consider that standards generate mixed feelings among scientists. They are often seen as not really reflecting the state of the art in a given domain and a hindrance to scientific creativity. Still, scientists should theoretically be at the best place to bring their expertise into standard developments, being even more neutral on issues that may typically be related to competing industrial interests. Even if it could be thought of as even more complex to think about developping standards in the humanities, we will show how this can be made feasible through the experience gained both within the Text Encoding Initiative consortium and the International Organisation for Standardisation. By taking the specific case of lexical resources, we will try to show how this brings about new ideas for designing future research infrastructures in the human and social sciences

arXiv.org e-Print Archive

HAL-CentraleSupelec

INRIA a CCSD electronic archive server

Hal-Diderot

HAL-Rennes 1

EDI and intelligent agents integration to manage food chains

Author: Mangina Eleni
Vlachos Ilias
Publication venue: South East European Research Centre
Publication date: 01/01/2005
Field of study

Electronic Data Interchange (EDI) is a type of inter-organizational information system, which permits the automatic and structured communication of data between organizations. Although EDI is used for internal communication, its main application is in facilitating closer collaboration between organizational entities, e.g. suppliers, credit institutions, and transportation carriers. This study illustrates how agent technology can be used to solve real food supply chain inefficiencies and optimise the logistics network. For instance, we explain how agribusiness companies can use agent technology in association with EDI to collect data from retailers, group them into meaningful categories, and then perform different functions. As a result, the distribution chain can be managed more efficiently. Intelligent agents also make available timely data to inventory management resulting in reducing stocks and tied capital. Intelligent agents are adoptive to changes so they are valuable in a dynamic environment where new products or partners have entered into the supply chain. This flexibility gives agent technology a relative advantage which, for pioneer companies, can be a competitive advantage. The study concludes with recommendations and directions for further research

Northumbria University Research Portal

Querying Large Physics Data Sets Over an Information Grid

Author: Baker Nigel
Brooks Peter
Goff Jean-Marie Le
Kovacs Zsolt
McClatchey Richard
Publication venue
Publication date: 01/01/2001
Field of study

Optimising use of the Web (WWW) for LHC data analysis is a complex problem and illustrates the challenges arising from the integration of and computation across massive amounts of information distributed worldwide. Finding the right piece of information can, at times, be extremely time-consuming, if not impossible. So-called Grids have been proposed to facilitate LHC computing and many groups have embarked on studies of data replication, data migration and networking philosophies. Other aspects such as the role of 'middleware' for Grids are emerging as requiring research. This paper positions the need for appropriate middleware that enables users to resolve physics queries across massive data sets. It identifies the role of meta-data for query resolution and the importance of Information Grids for high-energy physics analysis rather than just Computational or Data Grids. This paper identifies software that is being implemented at CERN to enable the querying of very large collaborating HEP data-sets, initially being employed for the construction of CMS detectors.Comment: 4 pages, 3 figure

arXiv.org e-Print Archive

CiteSeerX

CERN Document Server