Search CORE

13,014 research outputs found

Searching Data: A Review of Observational Data Retrieval Practices in Selected Disciplines

Author: Aloia N.
Beran B.
Borgman C.L.
Carlson J.
Fielding N.G.
Honor L.B.
Ingwersen P.
Maier D.
Meyer E.T.
Pasquetto I.V.
Zimmerman A.S.
Publication venue: 'Wiley'
Publication date: 03/04/2019
Field of study

A cross-disciplinary examination of the user behaviours involved in seeking and evaluating data is surprisingly absent from the research data discussion. This review explores the data retrieval literature to identify commonalities in how users search for and evaluate observational research data. Two analytical frameworks rooted in information retrieval and science technology studies are used to identify key similarities in practices as a first step toward developing a model describing data retrieval

arXiv.org e-Print Archive

Maastricht University Research Portal

Crossref

International Migration, Integration and Social Cohesion online publications

UvA-DARE

Analysis and Synthesis of Metadata Goals for Scientific Data

Author: Bain
Baker
Blank
Bountouri
Bosch
Brazma
Bruce
Buschmann
Committee on Science Engineering, and Public Policy (US), and Committee on Ensuring the Utility and Integrity of Research Data in a Digital Age
Consultative Committee for Space Data Systems (CCSDS)
Duval
Frenkel
Garvey
Greenberg
Greenberg
Greenberg
Greenberg
Hall
Hall
Heidorn
Hey
Higgins
Hjørland
Hubenthal
Jones
Kelling
Klein
Krippendorff
Lide
Lim
Michener
Murray-Rust
National Science Foundation
NSF Task Force on Cyberlearning
Rayner
Ryssevik
Sommerville
Spellman
Spurgin
Stvilia
Westbrook
Westbrook
Zhang
Publication venue: Duke University School of Law
Publication date: 01/01/2012
Field of study

The proliferation of discipline-specific metadata schemes contributes to artificial barriers that can impede interdisciplinary and transdisciplinary research. The authors considered this problem by examining the domains, objectives, and architectures of nine metadata schemes used to document scientific data in the physical, life, and social sciences. They used a mixed-methods content analysis and Greenberg’s (2005) metadata objectives, principles, domains, and architectural layout (MODAL) framework, and derived 22 metadata-related goals from textual content describing each metadata scheme. Relationships are identified between the domains (e.g., scientific discipline and type of data) and the categories of scheme objectives. For each strong correlation (\u3e0.6), a Fisher’s exact test for nonparametric data was used to determine significance (p \u3c .05). Significant relationships were found between the domains and objectives of the schemes. Schemes describing observational data are more likely to have “scheme harmonization” (compatibility and interoperability with related schemes) as an objective; schemes with the objective “abstraction” (a conceptual model exists separate from the technical implementation) also have the objective “sufficiency” (the scheme defines a minimal amount of information to meet the needs of the community); and schemes with the objective “data publication” do not have the objective “element refinement.” The analysis indicates that many metadata-driven goals expressed by communities are independent of scientific discipline or the type of data, although they are constrained by historical community practices and workflows as well as the technological environment at the time of scheme creation. The analysis reveals 11 fundamental metadata goals for metadata documenting scientific data in support of sharing research data across disciplines and domains. The authors report these results and highlight the need for more metadata-related research, particularly in the context of recent funding agency policy changes

Publikationer från KTH

Crossref

Duke Law Scholarship Repository

Digitala Vetenskapliga Arkivet - Academic Archive On-line

espace@Curtin

Encyclopedia of software components

Author: Beckman Brian C.
Vanwarren Lloyd
Publication venue
Publication date
Field of study

Intelligent browsing through a collection of reusable software components is facilitated with a computer having a video monitor and a user input interface such as a keyboard or a mouse for transmitting user selections, by presenting a picture of encyclopedia volumes with respective visible labels referring to types of software, in accordance with a metaphor in which each volume includes a page having a list of general topics under the software type of the volume and pages having lists of software components for each one of the generic topics, altering the picture to open one of the volumes in response to an initial user selection specifying the one volume to display on the monitor a picture of the page thereof having the list of general topics and altering the picture to display the page thereof having a list of software components under one of the general topics in response to a next user selection specifying the one general topic, and then presenting a picture of a set of different informative plates depicting different types of information about one of the software components in response to a further user selection specifying the one component

NASA Technical Reports Server

Publishing Primary Data on the World Wide Web: Opencontext.org and an Open Future for the Past

Author: Eric C. Kansa
Publication venue: The Alexandria Archive Institute
Publication date: 04/04/2007
Field of study

More scholars are exploring forms of digital dissemination, including open access (OA) systems where content is made available free of charge. These include peer -reviewed e -journals as well as traditional journals that have an online presence. Besides SHA's Technical Briefs in Historical Archaeology, the American Journal of Archaeology now offers open access to downloadable articles from their printed issues. Similarly, Evolutionary Anthropology offers many full -text articles free for download. More archaeologists are also taking advantage of easy Web publication to post copies of their publications on personal websites. Roughly 15% of all scholars participate in such "self -archiving." To encourage this practice, Science Commons (2006) and the Scholarly Publishing and Academic Resources Coalition (SPARC) recently launched the Scholar Copyright Project, an initiative that will develop standard "Author Addenda" -- a suite of short amendments to attach to copyright agreements from publishers (http://sciencecommons. org/projects/publishing/index.html). These addenda make it easier for paper authors to retain and clarify their rights to self -archive their papers electronically. Several studies now clearly document that self -archiving and OA publication enhances uptake and citation rates (Hajjem et al. 2005). Researchers enhance their reputations and stature by opening up their scholarship.Mounting pressure for greater public access also comes from many research stakeholders. Granting foundations interested in maximizing the return on their investment in basic research are often encouraging and sometimes even requiring some form of OA electronic dissemination. Interest in maximizing public access to publicly financed research is catching on in Congress. A new bipartisan bill, the Federal Research Public Access Act, would require OA for drafts of papers that pass peer review and result from federally funded research (U.S. Congress 2006). The bill would create government -funded digital repositories that would host and maintain these draft papers. University libraries are some of the most vocal advocates for OA research. Current publishing frameworks have seen dramatically escalated costs, sometimes four times higher than the general rate of inflation (Create Change 2003). Increasing costs have forced many libraries to cancel subscriptions and thereby hurt access and scholarship (Association for College and Research Libraries 2003; Suber 2004).This article originally published in Technical Briefs In Historical Archaeology, 2007, 2: -11

IssueLab

Nanoinformatics: developing new computing applications for nanomedicine

Author: Alberto Anguita
Alejandro Pazos
Antoine Geissbuhler
B Smith
BY Kim
C Kulikowski
C Rosse
CA Kulikowski
Casimir Kulikowski
Cristian Munteanu
D Dela Iglesia
David Perez-Rey
DG Thomas
Diana De la Iglesia
ED Green
F Martin-Sanchez
Fernando Gonzalez-Nilo
Fernando Martin-Sanchez
Ferran Sanz
George Potamias
Guillermo De la Calle
Guillermo Lopez-Campos
H Berman
IS Kohane
Isabel Hermosilla
Jose Crespo
Jose Maria Barreiro
Josipa Kern
Joyce A. Mitchell
Julio C. Facelli
K Jain
Luciano Milanesi
M Gerstein
M Viceconti
Martin Fritts
Miguel Garcia-Remesal
N Gordon
NA Baker
Nathan Baker
Norbert Graf
P Kiberstis
Paula Otero
Peter Ghazal
Pierre Grangeat
Rada Hussein
Raul E. Cachau
RB Altman
S Bewick
Sabine Koch
SI O’Donoghue
Sonia E. Benitez
V Maojo
V Maojo
V Maojo
V Maojo
Vassilis Moustakis
Victor Maojo
Victoria Lopez-Alonso
Yannick Legre
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2012
Field of study

Nanoinformatics has recently emerged to address the need of computing applications at the nano level. In this regard, the authors have participated in various initiatives to identify its concepts, foundations and challenges. While nanomaterials open up the possibility for developing new devices in many industrial and scientific areas, they also offer breakthrough perspectives for the prevention, diagnosis and treatment of diseases. In this paper, we analyze the different aspects of nanoinformatics and suggest five research topics to help catalyze new research and development in the area, particularly focused on nanomedicine. We also encompass the use of informatics to further the biological and clinical applications of basic research in nanoscience and nanotechnology, and the related concept of an extended ?nanotype? to coalesce information related to nanoparticles. We suggest how nanoinformatics could accelerate developments in nanomedicine, similarly to what happened with the Human Genome and other -omics projects, on issues like exchanging modeling and simulation methods and tools, linking toxicity information to clinical and personal databases or developing new approaches for scientific ontologies, among many others

Repositorio da Universidade da Coruña

Crossref

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Online Research @ Cardiff

Springer - Publisher Connector

DSpace Universidad de Talca

PubMed Central

Edinburgh Research Explorer

Archivo Digital UPM

Archive ouverte UNIGE

Advanced Knowledge Technologies at the Midterm: Tools and Methods for the Semantic Web

Author: Ciravegna Fabio
Domingue John
Hall Wendy
Motta Enrico
O'Hara Kieron
Robertson David
Shadbolt Nigel
Sleeman Derek
Tate Austin
Wilks Yorick
Publication venue: School of Electronics and Computer Science, University of Southampton
Publication date: 01/01/2004
Field of study

The University of Edinburgh and research sponsors are authorised to reproduce and distribute reprints and on-line copies for their purposes notwithstanding any copyright annotation hereon. The views and conclusions contained herein are the author’s and shouldn’t be interpreted as necessarily representing the official policies or endorsements, either expressed or implied, of other parties.In a celebrated essay on the new electronic media, Marshall McLuhan wrote in 1962:Our private senses are not closed systems but are endlessly translated into each other in that experience which we call consciousness. Our extended senses, tools, technologies, through the ages, have been closed systems incapable of interplay or collective awareness. Now, in the electric age, the very instantaneous nature of co-existence among our technological instruments has created a crisis quite new in human history. Our extended faculties and senses now constitute a single field of experience which demands that they become collectively conscious. Our technologies, like our private senses, now demand an interplay and ratio that makes rational co-existence possible. As long as our technologies were as slow as the wheel or the alphabet or money, the fact that they were separate, closed systems was socially and psychically supportable. This is not true now when sight and sound and movement are simultaneous and global in extent. (McLuhan 1962, p.5, emphasis in original)Over forty years later, the seamless interplay that McLuhan demanded between our technologies is still barely visible. McLuhan’s predictions of the spread, and increased importance, of electronic media have of course been borne out, and the worlds of business, science and knowledge storage and transfer have been revolutionised. Yet the integration of electronic systems as open systems remains in its infancy.Advanced Knowledge Technologies (AKT) aims to address this problem, to create a view of knowledge and its management across its lifecycle, to research and create the services and technologies that such unification will require. Half way through its sixyear span, the results are beginning to come through, and this paper will explore some of the services, technologies and methodologies that have been developed. We hope to give a sense in this paper of the potential for the next three years, to discuss the insights and lessons learnt in the first phase of the project, to articulate the challenges and issues that remain.The WWW provided the original context that made the AKT approach to knowledge management (KM) possible. AKT was initially proposed in 1999, it brought together an interdisciplinary consortium with the technological breadth and complementarity to create the conditions for a unified approach to knowledge across its lifecycle. The combination of this expertise, and the time and space afforded the consortium by the IRC structure, suggested the opportunity for a concerted effort to develop an approach to advanced knowledge technologies, based on the WWW as a basic infrastructure.The technological context of AKT altered for the better in the short period between the development of the proposal and the beginning of the project itself with the development of the semantic web (SW), which foresaw much more intelligent manipulation and querying of knowledge. The opportunities that the SW provided for e.g., more intelligent retrieval, put AKT in the centre of information technology innovation and knowledge management services; the AKT skill set would clearly be central for the exploitation of those opportunities.The SW, as an extension of the WWW, provides an interesting set of constraints to the knowledge management services AKT tries to provide. As a medium for the semantically-informed coordination of information, it has suggested a number of ways in which the objectives of AKT can be achieved, most obviously through the provision of knowledge management services delivered over the web as opposed to the creation and provision of technologies to manage knowledge.AKT is working on the assumption that many web services will be developed and provided for users. The KM problem in the near future will be one of deciding which services are needed and of coordinating them. Many of these services will be largely or entirely legacies of the WWW, and so the capabilities of the services will vary. As well as providing useful KM services in their own right, AKT will be aiming to exploit this opportunity, by reasoning over services, brokering between them, and providing essential meta-services for SW knowledge service management.Ontologies will be a crucial tool for the SW. The AKT consortium brings a lot of expertise on ontologies together, and ontologies were always going to be a key part of the strategy. All kinds of knowledge sharing and transfer activities will be mediated by ontologies, and ontology management will be an important enabling task. Different applications will need to cope with inconsistent ontologies, or with the problems that will follow the automatic creation of ontologies (e.g. merging of pre-existing ontologies to create a third). Ontology mapping, and the elimination of conflicts of reference, will be important tasks. All of these issues are discussed along with our proposed technologies.Similarly, specifications of tasks will be used for the deployment of knowledge services over the SW, but in general it cannot be expected that in the medium term there will be standards for task (or service) specifications. The brokering metaservices that are envisaged will have to deal with this heterogeneity.The emerging picture of the SW is one of great opportunity but it will not be a wellordered, certain or consistent environment. It will comprise many repositories of legacy data, outdated and inconsistent stores, and requirements for common understandings across divergent formalisms. There is clearly a role for standards to play to bring much of this context together; AKT is playing a significant role in these efforts. But standards take time to emerge, they take political power to enforce, and they have been known to stifle innovation (in the short term). AKT is keen to understand the balance between principled inference and statistical processing of web content. Logical inference on the Web is tough. Complex queries using traditional AI inference methods bring most distributed computer systems to their knees. Do we set up semantically well-behaved areas of the Web? Is any part of the Web in which semantic hygiene prevails interesting enough to reason in? These and many other questions need to be addressed if we are to provide effective knowledge technologies for our content on the web

Southampton (e-Prints Soton)

Edinburgh Research Archive

Recommended from our members

ICOPER Project - Deliverable 4.3 ISURE: Recommendations for extending effective reuse, embodied in the ICOPER CD&R

Author: Connolly Teresa
Klemke Roland
Okada Alexandra
Scott Peter
Publication venue: ICOPER
Publication date: 01/01/2011
Field of study

The purpose of this document is to capture the ideas and recommendations, within and beyond the ICOPER community, concerning the reuse of learning content, including appropriate methodologies as well as established strategies for remixing and repurposing reusable resources. The overall remit of this work focuses on describing the key issues that are related to extending effective reuse embodied in such materials. The objective of this investigation, is to support the reuse of learning content whilst considering how it could be originally created and then adapted with that ‘reuse’ in mind. In these circumstances a survey on effective reuse best practices can often provide an insight into the main challenges and benefits involved in the process of creating, remixing and repurposing what we are now designating as Reusable Learning Content (RLC). Several key issues are analysed in this report: Recommendations for extending effective reuse, building upon those described in the previous related deliverables 4.1 Content Development Methodologies and 4.2 Quality Control and Web 2.0 technologies. The findings of this current survey, however, provide further recommendations and strategies for using and developing this reusable learning content. In the spirit of ‘reuse’, this work also aims to serve as a foundation for the many different stakeholders and users within, and beyond, the ICOPER community who are interested in reusing learning resources. This report analyses a variety of information. Evidence has been gathered from a qualitative survey that has focused on the technical and pedagogical recommendations suggested by a Special Interest Group (SIG) on the most innovative practices with respect to new media content authors (for content authoring or modification) and course designers (for unit creation). This extended community includes a wider collection of OER specialists. This collected evidence, in the form of video and audio interviews, has also been represented as multimedia assets potentially helpful for learning and useful as learning content in the New Media Space (See section 4 for further details). Section 2 of this report introduces the concept of reusable learning content and reusability. Section 3 discusses an application created by the ICOPER community to enhance the opportunities for developing reusable content. Section 4 of this report provides an overview of the methodology used for the qualitative survey. Section 5 presents a summary of thematic findings. Section 6 highlights a list of recommendations for effective reuse of educational content, which were derived from thematic analysis described in Appendix A. Finally, section 7 summarises the key outcomes of this work

Open Research Online (The Open University)

A Linked Data Approach to Sharing Workflows and Workflow Results

Author: Bechhofer S
Margaria T
Marshall MS
Missier P
Newman DR
Roos M
Roure DD
Steffen B
Zhao J
Publication venue
Publication date: 01/01/2010
Field of study

A bioinformatics analysis pipeline is often highly elaborate, due to the inherent complexity of biological systems and the variety and size of datasets. A digital equivalent of the ‘Materials and Methods’ section in wet laboratory publications would be highly beneficial to bioinformatics, for evaluating evidence and examining data across related experiments, while introducing the potential to find associated resources and integrate them as data and services. We present initial steps towards preserving bioinformatics ‘materials and methods’ by exploiting the workflow paradigm for capturing the design of a data analysis pipeline, and RDF to link the workflow, its component services, run-time provenance, and a personalized biological interpretation of the results. An example shows the reproduction of the unique graph of an analysis procedure, its results, provenance, and personal interpretation of a text mining experiment. It links data from Taverna, myExperiment.org, BioCatalogue.org, and ConceptWiki.org. The approach is relatively ‘light-weight’ and unobtrusive to bioinformatics users

Southampton (e-Prints Soton)

Crossref

University of Birmingham Research Portal

Oxford University Research Archive

The University of Manchester - Institutional Repository

Encyclopedia of Software Components

Author: Beckman Brian C.
Warren Lloyd V.
Publication venue
Publication date: 20/05/1997
Field of study

NASA Technical Reports Server

LIFE: Costing the digital preservation lifecycle

Author: Ayris Paul
Davies Richard
McLeod Rory
Shenton Helen
Wheatley Paul
Publication venue
Publication date: 10/10/2007
Field of study

Having confidence in the permanence of a digital resource requires a deep understanding of the preservation activities that will need to be performed throughout its lifetime, and an ability to plan and resource for those activities. The LIFE (Lifecycle Information for E-Literature) Project1 has advanced understanding of the short and long-term costs in this complex area, facilitating better planning, comparison and evaluation of digital lifecycles. The LIFE Project created a digital lifecycle model based on previous work undertaken on the lifecycles of paper-based materials. It applied the model to real-life collections, modelling their lifecycles and studying their constituent processes. The results were then used to estimate the costs of each element of the digital lifecycle. Organisations can now apply this process, enabling evaluation and refinement of their existing lifecycles and facilitating more effective planning for the preservation of newly acquired content. Phase 2 of the LIFE Project began in February 2007. It is evaluating and refining the models and methodology developed in the first phase of the project and developing lifecycle costings for a range of further case studies

UCL Discovery