2 research outputs found
A Digital Library for Research Data and Related Information in the Social Sciences
In the social sciences, researchers search for information on the Web, but
this is most often distributed on different websites, search portals, digital
libraries, data archives, and databases. In this work, we present an integrated
search system for social science information that allows finding information
around research data in a single digital library. Users can search for research
data sets, publications, survey variables, questions from questionnaires,
survey instruments, and tools. Information items are linked to each other so
that users can see, for example, which publications contain data citations to
research data. The integration and linking of different kinds of information
increase their visibility so that it is easier for researchers to find
information for re-use. In a log-based usage study, we found that users search
across different information types, that search sessions contain a high rate of
positive signals and that link information is often explored
DBRepo: a Semantic Digital Repository for Relational Databases
Data curation is a complex, multi-faceted task. While dedicated data stewards are starting to take care of these activities in close collaboration with researchers for many types of (usually file-based) data in many institutions, this is rarely yet the case for data held in relational databases. Beyond large-scale infrastructures hosting e.g. climate or genome data, researchers usually have to create, build and maintain their database, care about security patches, and feed data into it in order to use it in their research. Data curation, if at all, usually happens after a project is finished, when data may be exported for digital preservation into file repository systems.
We present DBRepo, a semantic digital repository for relational databases in a private cloud setting designed to (1) host research data stored in relational databases right from the beginning of a research project, (2) provide separation of concerns, allowing the researchers to focus on the domain aspects of the data and their work while bringing in experts to handle classic data management tasks, (3) improve findability, accessibility and reusability by offering semantic mapping of metadata attributes, and (4) focus on reproducibility in dynamically evolving data by supporting versioning and precise identification/cite-ability for arbitrary subsets of data.