16,229 research outputs found
Searching Data: A Review of Observational Data Retrieval Practices in Selected Disciplines
A cross-disciplinary examination of the user behaviours involved in seeking
and evaluating data is surprisingly absent from the research data discussion.
This review explores the data retrieval literature to identify commonalities in
how users search for and evaluate observational research data. Two analytical
frameworks rooted in information retrieval and science technology studies are
used to identify key similarities in practices as a first step toward
developing a model describing data retrieval
Enforcing public data archiving policies in academic publishing: A study of ecology journals
To improve the quality and efficiency of research, groups within the
scientific community seek to exploit the value of data sharing. Funders,
institutions, and specialist organizations are developing and implementing
strategies to encourage or mandate data sharing within and across disciplines,
with varying degrees of success. Academic journals in ecology and evolution
have adopted several types of public data archiving policies requiring authors
to make data underlying scholarly manuscripts freely available. Yet anecdotes
from the community and studies evaluating data availability suggest that these
policies have not obtained the desired effects, both in terms of quantity and
quality of available datasets. We conducted a qualitative, interview-based
study with journal editorial staff and other stakeholders in the academic
publishing process to examine how journals enforce data archiving policies. We
specifically sought to establish who editors and other stakeholders perceive as
responsible for ensuring data completeness and quality in the peer review
process. Our analysis revealed little consensus with regard to how data
archiving policies should be enforced and who should hold authors accountable
for dataset submissions. Themes in interviewee responses included hopefulness
that reviewers would take the initiative to review datasets and trust in
authors to ensure the completeness and quality of their datasets. We highlight
problematic aspects of these thematic responses and offer potential starting
points for improvement of the public data archiving process.Comment: 35 pages, 1 figure, 1 tabl
Recommended from our members
UK Research Information Shared Service (UKRISS) Final Report, July 2014
The reporting of research information is a complex and expensive activity for research organisations (ROs). There is little alignment between funders of the reporting requests made to institutions and requests made to individual researchers about their research outputs and outcomes. This inevitably results in duplication and increased costs across the sector, whilst limiting the potential sharing and reuse of the information. The UK Research Information Shared Service (UKRISS) project conducted a feasibility and scoping study for the reporting of research information at a national level based on CERIF (Common European Research Information Format), with the objective of increasing efficiency, productivity and quality across the sector. The aim was to define and prototype solutions which are compelling, easy to use, have a low entry barrier, and support innovative information sharing and benchmarking. CERIF has emerged as the preferred format for expressing research information across Europe. To date, CERIF has been piloted for specific applications, but not as a format for reporting requirements across all UK ROs. The final report presents the work carried out by the UKRISS project, including requirements gathering, modelling and prototyping, as well as recommendation for sustainability. UKRISS was divided into two phases. Phase 1, mapping the reporting landscape, ran from March 2012 to December 2012. Phase 2, exploring delivery of potential solutions, began in February 2013 and ended in December 2013
Requirements for Provenance on the Web
From where did this tweet originate? Was this quote from the New York Times modified? Daily, we rely on data from the Web but often it is difficult or impossible to determine where it came from or how it was produced. This lack of provenance is particularly evident when people and systems deal with Web information or with any environment where information comes from sources of varying quality. Provenance is not captured pervasively in information systems. There are major technical, social, and economic impediments that stand in the way of using provenance effectively. This paper synthesizes requirements for provenance on the Web for a number of dimensions focusing on three key aspects of provenance: the content of provenance, the management of provenance records, and the uses of provenance information. To illustrate these requirements, we use three synthesized scenarios that encompass provenance problems faced by Web users toda
Framework for Prioritization of Open Data Publication: An Application to Smart Cities
Public Sector Information is considered to play a fundamental role in the growth of the knowledge economy and improvements in society. Given the difficulty in publishing and maintaining all available data, due to budget constraints, institutions need to select which data to publish, giving priority to data most likely to generate social and economic impact. Priority of publication could become an even more significant problem in Smart Cities: as huge amounts of information are generated from different domains, the way data is prioritized and thus reused, could be a determining factor in promoting, among others, new and sustainable business opportunities for local entrepreneurs, and to improve citizen quality of life. However, people in charge of prioritizing which data to publish through open data portals (such as Chief Data Officers, or CDOs) do not have available any specific support in their decision-making process. In this work, a proposal of a framework for prioritization of open data publication as well as its application to Smart Cities is presented. This specific application of the framework relies on OSS (Open Source Software) indicators to help making decisions on the most relevant data to publish focused on developers and businesses operating within the Smart City context.This work was funded by (i) Ministerio de Economía e Innovación (Spain) TIN2015-69957-R (MINECO/ERDF, EU) project and TIN2016-78103-C2-2-R (MINECO/ERDF, EU) project, (ii) POCTEP 4IE project (0045-4IE-4-P), and (iii) Consejería de Economía e Infraestructuras/Junta de Extremadura (Spain) - European Regional Development Fund (ERDF)- GR18112 project and IB16055 project
Open Data, Grey Data, and Stewardship: Universities at the Privacy Frontier
As universities recognize the inherent value in the data they collect and
hold, they encounter unforeseen challenges in stewarding those data in ways
that balance accountability, transparency, and protection of privacy, academic
freedom, and intellectual property. Two parallel developments in academic data
collection are converging: (1) open access requirements, whereby researchers
must provide access to their data as a condition of obtaining grant funding or
publishing results in journals; and (2) the vast accumulation of 'grey data'
about individuals in their daily activities of research, teaching, learning,
services, and administration. The boundaries between research and grey data are
blurring, making it more difficult to assess the risks and responsibilities
associated with any data collection. Many sets of data, both research and grey,
fall outside privacy regulations such as HIPAA, FERPA, and PII. Universities
are exploiting these data for research, learning analytics, faculty evaluation,
strategic decisions, and other sensitive matters. Commercial entities are
besieging universities with requests for access to data or for partnerships to
mine them. The privacy frontier facing research universities spans open access
practices, uses and misuses of data, public records requests, cyber risk, and
curating data for privacy protection. This paper explores the competing values
inherent in data stewardship and makes recommendations for practice, drawing on
the pioneering work of the University of California in privacy and information
security, data governance, and cyber risk.Comment: Final published version, Sept 30, 201
Why Geospatial Linked Open Data for Smart Mobility?
While the concept of Smart Cities is gaining momentum around the world and government data are increasingly available and accessible on the World Wide Web, key issues remain about Open Data and data standards for smart cities. A better integration and interoperabilty of data through the World Wide Web is only possible when everyone agrees on the standards for data representation and sharing. Linked Open Data positions itself as a solution for such standardization, being a method of publishing structured data using standard Web technologies. This facilitates the interlinking between datasets, makes them readable by computers, and easily accesible on the World Wide Web. We illustrate this through the example of an evolution from a traditional Content Management System with a geoportal, to a semantic based aproach. The Traffic Safety Monitor was developed in the period of 2012-2015 to monitor the road safety and to support policy development on road safety in Flanders (the northern part of Belgium). The system is built as a Content Management System (CMS), with publication tools to present geospatial indicators on road safety (e.g. the number of accidents with cars and the number of positive alcohol tests) as Web maps using stardardized Open Geospatial Consortium Webservices. The Traffic Safety Monitor is currently further developed towards a Mobility Monitor. Here, the focus is on the development of a business process model for the semantic exchange and publication of spatial data using Linked Open Data principles targeting indicators of sustainable and smart mobility. In the future, the usability of cycling Infrastructure for vehicles such as mobility scooters, bicycle trailers etc. can be assessed using Linked Open Data. The data and metadata is published in Linked open data format, opening the door for their reuse by a wide range of (smart) applications
- …