20,143 research outputs found
Data curation standards and social science occupational information resources
Occupational information resources - data about the characteristics of different occupational positions - are widely used in the social sciences, across a range of disciplines and international contexts. They are available in many formats, most often constituting small electronic files that are made freely downloadable from academic web-pages. However there are several challenges associated with how occupational information resources are distributed to, and exploited by, social researchers. In this paper we describe features of occupational information resources, and indicate the role digital curation can play in exploiting them. We report upon the strategies used in the GEODE research project (Grid Enabled Occupational Data Environment, http://www.geode.stir.ac.uk). This project attempts to develop long-term standards for the distribution of occupational information resources, by providing a standardized framework-based electronic depository for occupational information resources, and by providing a data indexing service, based on e-Science middleware, which collates occupational information resources and makes them readily accessible to non-specialist social scientists
Libraries and the management of research data
A discussion of the role of university libraries in the management of digital research data outputs. Reviews some of the recent history of progress in this area from a UK perspective, with reference to international developments
Towards Exascale Scientific Metadata Management
Advances in technology and computing hardware are enabling scientists from
all areas of science to produce massive amounts of data using large-scale
simulations or observational facilities. In this era of data deluge, effective
coordination between the data production and the analysis phases hinges on the
availability of metadata that describe the scientific datasets. Existing
workflow engines have been capturing a limited form of metadata to provide
provenance information about the identity and lineage of the data. However,
much of the data produced by simulations, experiments, and analyses still need
to be annotated manually in an ad hoc manner by domain scientists. Systematic
and transparent acquisition of rich metadata becomes a crucial prerequisite to
sustain and accelerate the pace of scientific innovation. Yet, ubiquitous and
domain-agnostic metadata management infrastructure that can meet the demands of
extreme-scale science is notable by its absence.
To address this gap in scientific data management research and practice, we
present our vision for an integrated approach that (1) automatically captures
and manipulates information-rich metadata while the data is being produced or
analyzed and (2) stores metadata within each dataset to permeate
metadata-oblivious processes and to query metadata through established and
standardized data access interfaces. We motivate the need for the proposed
integrated approach using applications from plasma physics, climate modeling
and neuroscience, and then discuss research challenges and possible solutions
Development of a pilot data management infrastructure for biomedical researchers at University of Manchester – approach, findings, challenges and outlook of the MaDAM Project
Management and curation of digital data has been becoming ever more important in a higher education and research environment characterised by large and complex data, demand for more interdisciplinary and collaborative work, extended funder requirements and use of e-infrastructures to facilitate new research methods and paradigms. This paper presents the approach, technical infrastructure, findings, challenges and outlook (including future development within the successor project, MiSS) of the ‘MaDAM: Pilot data management infrastructure for biomedical researchers at University of Manchester’ project funded under the infrastructure strand of the JISC Managing Research Data (JISCMRD) programme. MaDAM developed a pilot research data management solution at the University of Manchester based on biomedical researchers’ requirements, which includes technical and governance components with the flexibility to meet future needs across multiple research groups and disciplines
Curating E-Mails; A life-cycle approach to the management and preservation of e-mail messages
E-mail forms the backbone of communications in many modern institutions and organisations and is a valuable type of organisational, cultural, and historical record. Successful management and preservation of valuable e-mail messages and collections is therefore vital if organisational accountability is to be achieved and historical or cultural memory retained for the future. This requires attention by all stakeholders across the entire life-cycle of the e-mail records.
This instalment of the Digital Curation Manual reports on the several issues involved in managing and curating e-mail messages for both current and future use. Although there is no 'one-size-fits-all' solution, this instalment outlines a generic framework for e-mail curation and preservation, provides a summary of current approaches, and addresses the technical, organisational and cultural challenges to successful e-mail management and longer-term curation.
Open research data: Report to the Australian National Data Service (ANDS)
Main points
Research data are an asset we have been building for decades, through billions of dollars of public investment in research annually. The information and communication technology (ICT) revolution presents an unprecedented opportunity to ‘leverage’ that asset. Given this, there is increasing awareness around the world that there are benefits to be gained from curating and openly sharing research data (Kvalheim and Kvamme 2014).
Conservatively, we estimate that the value of data in Australia’s public research to be at least 6 billion a year at current levels of expenditure and activity. Research data curation and sharing might be worth at least 5.5 billion a year, of which perhaps 4.9 billion annually is yet to be realized. Hence, any policy around publicly-funded research data should aim to realise as much of this unrealised value as practicable.
Aims and scope
This study offers conservative estimates of the value and benefits to Australia of making publicly-funded research data freely available, and examines the role and contribution of data repositories and associated infrastructure. It also explores the policy settings required to optimise research data sharing, and thereby increase the return on public investment in research. The study’s focus is Australia’s Commonwealth-funded research and agencies. It includes research commissioned or funded by Commonwealth bodies as well as in-house research within research-oriented agencies wholly or largely funded by the Commonwealth. Government data or public sector information is a separate category of publicly-funded data – although there is some overlap at the margins (e.g. Commonwealth Government funding for Geoscience Australia).
Main findings
For the purposes of estimation, we explore a range of research funding and expenditure from total Australian Government funding support for research to the sum of government and higher education expenditure on research by sector of execution. The lower bound estimates are based on the labour-cost share of research funding and expenditure (6.4 billion per annum), and upper bound estimates on total research funding and expenditure (13.3 billion per annum)
SEAD Virtual Archive: Building a Federation of Institutional Repositories for Long Term Data Preservation
Major research universities are grappling with their response to the deluge of scientific data emerging through research by their faculty. Many are looking to their libraries and the institutional repository as a solution. Scientific data introduces substantial challenges that the document-based institutional repository may not be suited to deal with. The Sustainable Environment - Actionable Data (SEAD) Virtual Archive specifically addresses the challenges of “long tail” scientific data. In this paper, we propose requirements, policy and architecture to support not only the preservation of scientific data today using institutional repositories, but also its rich access and use into the future
Data curation standards and the messy world of social science occupational information resources
Occupational information resources – data about the characteristics of different occupational positions – play a unique role in social science research. They are of relevance across diverse research disciplines and in numerous disparate contexts. They are also very widely available, typically freely downloadable from research-oriented academic web-pages. But they are also one of the most uncoordinated types of information resource that social scientists routinely come across. In this paper we describe issues in curating occupational information resources during the GEODE research project (Grid Enabled Occupational Data Environment, http:/www.geode.stir.ac.uk). This project attempts to develop long-term standards for the distribution of occupational information resources, by providing a standardised framework electronic depository for occupational information resources, and by providing a data-indexing service, premised upon eScience middleware, which collates occupational information resources and makes them readily accessible to non-specialist social scientists
Institutional Challenges in the Data Decade
Throughout the year, the DCC stages regional data management roadshows to present best practice and showcase new tools and resources. This article reports on the second roadshow, organised in conjunction with the White Rose University Consortium and held on 1-3 March 2011 at the University of Sheffield.
The goal for Day 1 was to describe the emerging trends and challenges associated with research data management and their potential impact on higher education institutions, and to introduce the Digital Curation Centre (DCC) and its role in supporting research data management. This was achieved through a substantial morning presentation followed by an afternoon of illustrative case studies at both disciplinary and institutional levels, highlighting different models, approaches and working practice. Day 2 was aimed at those in senior management roles and looked at strategic and policy implementation objectives. The Day 3 workshop explored data management requirements from the perspective of the institution and the main UK funding bodies, the different roles and responsibilities involved in effective data management and provided an introduction to data management planning. The portfolio of DCC resources, tools and services was explored in greater detail.
The roadshow provided delegates with advice and guidance to support institutional Research Data Management and has helped to facilitate regional networking and the exchange of skills and experience
- …